Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp3140194imu; Sun, 27 Jan 2019 22:43:47 -0800 (PST) X-Google-Smtp-Source: ALg8bN6dq3A0afznUHnT5CPRFSEFOcLuVQUU6in2fx0ZRc365ATtJfvXwrbbM2M5kxXhnsk2yA8X X-Received: by 2002:a62:35c7:: with SMTP id c190mr21360878pfa.76.1548657827474; Sun, 27 Jan 2019 22:43:47 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1548657827; cv=none; d=google.com; s=arc-20160816; b=SQvFCul21CHt9dI85g1Dex8JMe0s8fE3FKRJvbTeOwL6KeUzsD1BInOGHvNG/BBR1J dV5Ygrnl4taOFskcOQH/Hwe9jfh54BCnvhpdMEAxoecRT7ho2+MRWgSVGPPazw8Ds8nB AIs/ea+q6kT/JZs16cDu5AUI7z3eCzulnNWf6d7/QwsNsOkgg1fLGW69+MJ5SZ0SO2oV 7fRH2XnAn+6Q8/nREych2p/ArHCYJ7rLXFA6F1AWgTPYubEdvSqd1o/5PJzWqEjaRamk GxiNbVPX9PRFr6Ona8/mbPIccpyx/5RSSkKmMxbAZ0dkTJi40AH+WZQomarYYMgBFP/s ygAQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=tIYifZuX7+W5yXCTJBrqrHSvWgKArl5LFpM1Zbav92g=; b=kngK1/qy4T90xlLWrPQKuPMVhVCCievfZ7Iq8L2hpXoUGRrY2douLxyCHccPcJdxDp BK3zMSrkja1+qoadDQNo/V3guu7bQnUbV/Uo9OsXtpA51j7ks9HtO/DDBE9C06SZ0bxA ivZk+aHr3y6+/gVkgpnsEB5zBbTPvidfrHwgpJCUYIL7W/TDIwgELpiZnun3frjV+cT7 Re7oJbD+YMwIfa0nD3rIQmszHqzc9wDoNaO1ujmgvbKm+ceTy6Ndbd4G7sLYn7uXz9/g /u1Up+xevhihHnCyOtwYpZCIQr5c0fSIib0TBw76JCclOsQn8jOQ/vUQqzHn0432cdIB huPA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id a8si30954617pgi.359.2019.01.27.22.43.32; Sun, 27 Jan 2019 22:43:47 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726694AbfA1GnH (ORCPT + 99 others); Mon, 28 Jan 2019 01:43:07 -0500 Received: from mx2.suse.de ([195.135.220.15]:39200 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1726612AbfA1GnH (ORCPT ); Mon, 28 Jan 2019 01:43:07 -0500 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx1.suse.de (Postfix) with ESMTP id 0F284AEB2; Mon, 28 Jan 2019 06:43:05 +0000 (UTC) Date: Mon, 28 Jan 2019 07:43:03 +0100 From: Michal Hocko To: Mikhail Gavrilov Cc: robert shteynfeld , Linus Torvalds , Mikhail Zaslonko , Linux List Kernel Mailing , Gerald Schaefer , Dave Hansen , Alexander Duyck , Andrew Morton , Pavel Tatashin , Steven Sistare , Daniel Jordan , Bob Picco Subject: Re: kernel panic due to https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/commit/?id=2830bf6f05fb3e05bc4743274b806c821807a684 Message-ID: <20190128064303.GD18811@dhcp22.suse.cz> References: <20190125073704.GC3560@dhcp22.suse.cz> <20190125081924.GF3560@dhcp22.suse.cz> <20190125082952.GG3560@dhcp22.suse.cz> <20190125155810.GQ3560@dhcp22.suse.cz> <20190125163938.GA20411@dhcp22.suse.cz> <20190125173315.GC20411@dhcp22.suse.cz> <20190125181549.GE20411@dhcp22.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon 28-01-19 11:37:00, Mikhail Gavrilov wrote: > > Linus, could you take the revert please? > > > > From 817b18d3db36a6900ca9043af8c1416c56358be3 Mon Sep 17 00:00:00 2001 > > From: Michal Hocko > > Date: Fri, 25 Jan 2019 19:08:58 +0100 > > Subject: [PATCH] Revert "mm, memory_hotplug: initialize struct pages for the > > full memory section" > > > > This reverts commit 2830bf6f05fb3e05bc4743274b806c821807a684. > > > > The underlying assumption that one sparse section belongs into a single > > numa node doesn't hold really. Robert Shteynfeld has reported a boot > > failure. The boot log was not captured but his memory layout is as > > follows: > > [ 0.286954] Early memory node ranges > > [ 0.286955] node 1: [mem 0x0000000000001000-0x0000000000090fff] > > [ 0.286955] node 1: [mem 0x0000000000100000-0x00000000dbdf8fff] > > [ 0.286956] node 1: [mem 0x0000000100000000-0x0000001423ffffff] > > [ 0.286956] node 0: [mem 0x0000001424000000-0x0000002023ffffff] > > > > This means that node0 starts in the middle of a memory section which is > > also in node1. memmap_init_zone tries to initialize padding of a section > > even when it is outside of the given pfn range because there are code > > paths (e.g. memory hotplug) which assume that the full worth of memory > > section is always initialized. In this particular case, though, such a > > range is already intialized and most likely already managed by the page > > allocator. Scribbling over those pages corrupts the internal state and > > likely blows up when any of those pages gets used. > > > > Reported-by: Robert Shteynfeld > > Fixes: 2830bf6f05fb ("mm, memory_hotplug: initialize struct pages for the full memory section") > > Cc: stable > > Signed-off-by: Michal Hocko > > --- > > mm/page_alloc.c | 12 ------------ > > 1 file changed, 12 deletions(-) > > > > diff --git a/mm/page_alloc.c b/mm/page_alloc.c > > index d295c9bc01a8..35fdde041f5c 100644 > > --- a/mm/page_alloc.c > > +++ b/mm/page_alloc.c > > @@ -5701,18 +5701,6 @@ void __meminit memmap_init_zone(unsigned long size, int nid, unsigned long zone, > > cond_resched(); > > } > > } > > -#ifdef CONFIG_SPARSEMEM > > - /* > > - * If the zone does not span the rest of the section then > > - * we should at least initialize those pages. Otherwise we > > - * could blow up on a poisoned page in some paths which depend > > - * on full sections being initialized (e.g. memory hotplug). > > - */ > > - while (end_pfn % PAGES_PER_SECTION) { > > - __init_single_page(pfn_to_page(end_pfn), end_pfn, zone, nid); > > - end_pfn++; > > - } > > -#endif > > } > > > > #ifdef CONFIG_ZONE_DEVICE > > Michal, I suppose that revert the commit > 2830bf6f05fb3e05bc4743274b806c821807a68 are return my issue > https://marc.info/?l=linux-mm&m=154499704718428 > Are any other better approach would be proposed for fixing my issue? As I've said above. We will need to remove the hardcoded PAGES_PER_SECTION assumption from the hotplug code. Mikhail had a patch which was dealing with the two specific sysfs file handlers and I will build on top of that. -- Michal Hocko SUSE Labs