Received: by 2002:a05:6a10:a841:0:0:0:0 with SMTP id d1csp2504224pxy; Sat, 24 Apr 2021 18:31:40 -0700 (PDT) X-Google-Smtp-Source: ABdhPJw5/C2aE7HYJ3CLEkqfkH7se5HSvtFLVZwtbBNo6v4a/bTG5C7UdEOHxxmor3zmhv/YgDja X-Received: by 2002:a17:902:6a87:b029:e6:6a3d:29e8 with SMTP id n7-20020a1709026a87b02900e66a3d29e8mr10862784plk.10.1619314300099; Sat, 24 Apr 2021 18:31:40 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1619314300; cv=none; d=google.com; s=arc-20160816; b=NYbuKD6KKrh01m3rUccPZQi5PZUiv1EbFQjNdpUoeXMWU2cDfN89Hllr/DFggmRcRP bI4ZeFacnmHCH5A7F9xp8VSG3N/A/Q6Ic3fuRODVK+BjSrz3Fcu0C5j3k9TF5cuXiufp 9+DsMWBkY6HMNbJOO7V1mBnqbNndmSgBg8F9bCl1f2wHQ8AO7azAKue2832sfIu84U1m OV+38OpYVJGNiX/2Xkpoy2g4C6EcrslOr3ymXNP9j8rhxQxS2XalbeTUYUOS0hkxL29T IgUty2kShF+xq2FOeKayowXLG7EGZniFpp1eNBfpqI4mS15B74LIO37asknCL8p8mrQe 70Vg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date :ironport-sdr:ironport-sdr; bh=4YOZm5aor669VQ+jL1AWOr+ScVEL1rKoMYoqMtNbWI0=; b=EqEZMWuq6tRaW6gZ2O3EHo3oVh4ZhYjqCiT8vz/m8Kmb+BREmskGGkdD9YX69itgjo jlsi5o/Mx3UC0RnK5nJoEcMCrPobvXcm3nEIkkMbcIIpzsIUTxkAMOeq0qdLU481HUND UZUjAztKrPvJksrPzRQtV+pV9QoylwvIw8/L2laLPlm0CxpjCokdz4NU/35qFaoD9Wls 49+VoebBeuSEJunAA4Pz8CtqMfHQ7mX6yhrh93D5iMaXfPUYZ/+G39oNLlC7ESL5ac4o bWnDAJYphhAZ1GnOZp9mpZJzIyUH/jcL7jg1rMGcCQSUI5rBx08o/Q3V1hHFPwLBEbWO AzFA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id q4si15817578plb.191.2021.04.24.18.31.28; Sat, 24 Apr 2021 18:31:40 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229795AbhDYBbi (ORCPT + 99 others); Sat, 24 Apr 2021 21:31:38 -0400 Received: from mga09.intel.com ([134.134.136.24]:2384 "EHLO mga09.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229687AbhDYBbh (ORCPT ); Sat, 24 Apr 2021 21:31:37 -0400 IronPort-SDR: QGGNzWBJYDvMtTOMVdWU4wZUV5rwfUHif6L1RdcC6WWILmFB8Hr1ZJo90T4+jMgMsq706wNLom 4BkCme41yCYw== X-IronPort-AV: E=McAfee;i="6200,9189,9964"; a="196316936" X-IronPort-AV: E=Sophos;i="5.82,249,1613462400"; d="scan'208";a="196316936" Received: from orsmga008.jf.intel.com ([10.7.209.65]) by orsmga102.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 24 Apr 2021 18:30:58 -0700 IronPort-SDR: VERNXzieqiffuNkWMmdxlLF2JeDjc9IOthaXXepEVXPMUiWVL/L0oe3jvyhv7MtohcYM6XzE3s kLeTHCVX8BHg== X-IronPort-AV: E=Sophos;i="5.82,249,1613462400"; d="scan'208";a="428899909" Received: from xsang-optiplex-9020.sh.intel.com (HELO xsang-OptiPlex-9020) ([10.239.159.140]) by orsmga008-auth.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 24 Apr 2021 18:30:56 -0700 Date: Sun, 25 Apr 2021 09:48:16 +0800 From: Oliver Sang To: Linus Torvalds Cc: "Aneesh Kumar K.V" , Harish Sriram , Andrew Morton , LKML , lkp@lists.01.org, kernel test robot Subject: Re: [mm/vunmap] e47110e905: WARNING:at_mm/vmalloc.c:#__vunmap Message-ID: <20210425014816.GB5251@xsang-OptiPlex-9020> References: <20210423063227.GA17429@xsang-OptiPlex-9020> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.9.4 (2018-02-28) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Linus, On Fri, Apr 23, 2021 at 10:18:18AM -0700, Linus Torvalds wrote: > On Thu, Apr 22, 2021 at 11:15 PM kernel test robot > wrote: > > > > commit: e47110e90584a22e9980510b00d0dfad3a83354e ("mm/vunmap: add cond_resched() in vunmap_pmd_range") > > Funky. That commit doesn't seem to have anything to do with the oops. > > The oops is odd too: > > > [ 198.731223] WARNING: CPU: 0 PID: 1948 at mm/vmalloc.c:2247 __vunmap (kbuild/src/consumer/mm/vmalloc.c:2247 (discriminator 1)) > > That's the warning for an unaligned vunmap(): > > 2247 if (WARN(!PAGE_ALIGNED(addr), "Trying to vfree() bad > address (%p)\n", > 2248 addr)) > 2249 return; > > > [ 198.744933] Call Trace: > > [ 198.745229] free_module (kbuild/src/consumer/kernel/module.c:2251) > > 2248 /* This may be empty, but that's OK */ > 2249 module_arch_freeing_init(mod); > 2250 module_memfree(mod->init_layout.base); > 2251 kfree(mod->args); > > That's the "module_memfree()" - the return address points to the > return point, which is the next line. > > And as far as I can tell, the only thing that assigns anything but > NULL to that init_layout.base is > > ptr = module_alloc(mod->init_layout.size); > > which uses __vmalloc_node_range() for the allocation. > > So absolutely nothing in this report makes sense to me. I suspect it's > some odd memory corruption. > > Oliver - how reliable is that bisection? we will check further if any issue in our test env. by bot auto tests, we saw 12 issue instances out of 74 runs. but not happen out of 100 runs of parent. f3f99d63a8156c7a e47110e90584a22e9980510b00d ---------------- --------------------------- fail:runs %reproduction fail:runs | | | 1:100 -1% :74 dmesg.BUG:kernel_reboot-without-warning_in_test_stage 2:100 0% 2:74 dmesg.BUG:unable_to_handle_page_fault_for_address :100 12% 12:74 dmesg.Kernel_panic-not_syncing:Fatal_exception 2:100 0% 2:74 dmesg.Oops:#[##] 1:100 -1% :74 dmesg.RIP:__is_module_percpu_address :100 12% 12:74 dmesg.RIP:__vunmap <----- :100 12% 12:74 dmesg.RIP:kfree :100 1% 1:74 dmesg.RIP:kobject_add_internal 2:100 -1% 1:74 dmesg.RIP:print_modules 1:100 -1% :74 dmesg.RIP:skip_spaces 1:100 -1% :74 dmesg.RIP:usercopy_abort :100 1% 1:74 dmesg.WARNING:at_lib/kobject.c:#kobject_add_internal :100 12% 12:74 dmesg.WARNING:at_mm/vmalloc.c:#__vunmap 3:100 10% 13:74 dmesg.boot_failures 1:100 -1% :74 dmesg.canonical_address#:#[##] 2:100 -2% :74 dmesg.invalid_opcode:#[##] 2:100 -2% :74 dmesg.kernel_BUG_at_mm/usercopy.c :100 11% 11:74 dmesg.stack_segment:#[##] > > Does anybody else see what might be up? > > Linus