Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp3803989imu; Mon, 10 Dec 2018 08:05:49 -0800 (PST) X-Google-Smtp-Source: AFSGD/WR1PMLOcawqVE/xLYFgH9BEPcqa0AgACrf7Ka1NACQ8qZtiv4XUDNul7GdB+kg5tgOiZjm X-Received: by 2002:a17:902:bd86:: with SMTP id q6mr12286421pls.16.1544457948970; Mon, 10 Dec 2018 08:05:48 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1544457948; cv=none; d=google.com; s=arc-20160816; b=qxrr/J7yehYdWmx7VuqHf+n7dnM2ikcJaa0yizw1CuJgceScmE6dTu8Su3JFvKbXBD GyAuMtdYdAvU5pJW8TY1pVZJhetKnsRChaWCslOKocNK+1PkCUWpXMt2UWGPDR7IuMB4 nkGCeodaHN8WPmk0/m7wi90/g/p/qMviOmR3IFVXwzI/KoEN1daRRRrB7BqGAOqAF/m8 tS/hT4n1HfGdwLH24BV+G93K2gPf3T8oChgqphlRDbxdc0+6vLAjPudR2QEEXgrSdiPD 0VjR+KAgIACWsVNKj0RZKNhzEIQiFN2cSpDZCaxXkfTLWl22xD2vKNLy1kTy3rXdMcJ0 CVyQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=yn9gjNw1cAJvMmyr9BfQCmKKJdFF/mo/Q9pyYQmL7Mo=; b=qLOO/brKlM+UmOqVQyeLx8YbAGDLsIwKH1rl0TnFiQ3RaYMz6OAd7Cd0Pwrm1YcEcg 4SiuOUkfY4NIdbFQb/2UcwiSlCLfSP0rCnBHBgEtSsZHxgrUfoTezfIihD4AoLTx9+LY CFba6tuTny6DADSEDZ4bzH2hvE1h8UXKGwkFUt3fLOT91yIPjTL3MtaAfENDWq3ZxIXh Pr+UB3n91zAz4EB1adrnIadrH3UGRJVoLN71hWPmSg0wPlhIIzQcSLUqoqIS3qovLKVI GbyBtTePFmhVa0U3pGDaTFHkxKT0QIHVg+gLIviIkYujhya6Iq/u5Py7MpXpW+F1qwxm 43vw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id s125si11523128pfc.60.2018.12.10.08.05.33; Mon, 10 Dec 2018 08:05:48 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727246AbeLJNYy (ORCPT + 99 others); Mon, 10 Dec 2018 08:24:54 -0500 Received: from mx2.suse.de ([195.135.220.15]:56294 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1726324AbeLJNYy (ORCPT ); Mon, 10 Dec 2018 08:24:54 -0500 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx1.suse.de (Postfix) with ESMTP id 33371AE0B; Mon, 10 Dec 2018 13:24:52 +0000 (UTC) Date: Mon, 10 Dec 2018 14:24:51 +0100 From: Michal Hocko To: Mikhail Zaslonko Cc: akpm@linux-foundation.org, linux-kernel@vger.kernel.org, linux-mm@kvack.org, Pavel.Tatashin@microsoft.com, schwidefsky@de.ibm.com, heiko.carstens@de.ibm.com, gerald.schaefer@de.ibm.com Subject: Re: [PATCH 1/1] mm, memory_hotplug: Initialize struct pages for the full memory section Message-ID: <20181210132451.GO1286@dhcp22.suse.cz> References: <20181210130712.30148-1-zaslonko@linux.ibm.com> <20181210130712.30148-2-zaslonko@linux.ibm.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20181210130712.30148-2-zaslonko@linux.ibm.com> User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon 10-12-18 14:07:12, Mikhail Zaslonko wrote: > If memory end is not aligned with the sparse memory section boundary, the > mapping of such a section is only partly initialized. It would be great to mention how you can end up in the situation like this(a user provided memmap or a strange HW). > This may lead to > VM_BUG_ON due to uninitialized struct page access from > is_mem_section_removable() or test_pages_in_a_zone() function triggered by > memory_hotplug sysfs handlers: > > page:000003d082008000 is uninitialized and poisoned > page dumped because: VM_BUG_ON_PAGE(PagePoisoned(p)) > Call Trace: > ([<0000000000385b26>] test_pages_in_a_zone+0xde/0x160) > [<00000000008f15c4>] show_valid_zones+0x5c/0x190 > [<00000000008cf9c4>] dev_attr_show+0x34/0x70 > [<0000000000463ad0>] sysfs_kf_seq_show+0xc8/0x148 > [<00000000003e4194>] seq_read+0x204/0x480 > [<00000000003b53ea>] __vfs_read+0x32/0x178 > [<00000000003b55b2>] vfs_read+0x82/0x138 > [<00000000003b5be2>] ksys_read+0x5a/0xb0 > [<0000000000b86ba0>] system_call+0xdc/0x2d8 > Last Breaking-Event-Address: > [<0000000000385b26>] test_pages_in_a_zone+0xde/0x160 > Kernel panic - not syncing: Fatal exception: panic_on_oops > > Fix the problem by initializing the last memory section of the highest zone > in memmap_init_zone() till the very end, even if it goes beyond the zone > end. Why do we need to restrict this to the highest zone? In other words, why cannot we do what I was suggesting earlier [1]. What does prevent other zones to have an incomplete section boundary? [1] http://lkml.kernel.org/r/20181105183533.GQ4361@dhcp22.suse.cz > Signed-off-by: Mikhail Zaslonko > Reviewed-by: Gerald Schaefer > Cc: > --- > mm/page_alloc.c | 15 +++++++++++++++ > 1 file changed, 15 insertions(+) > > diff --git a/mm/page_alloc.c b/mm/page_alloc.c > index 2ec9cc407216..41ef5508e5f1 100644 > --- a/mm/page_alloc.c > +++ b/mm/page_alloc.c > @@ -5542,6 +5542,21 @@ void __meminit memmap_init_zone(unsigned long size, int nid, unsigned long zone, > cond_resched(); > } > } > +#ifdef CONFIG_SPARSEMEM > + /* > + * If there is no zone spanning the rest of the section > + * then we should at least initialize those pages. Otherwise we > + * could blow up on a poisoned page in some paths which depend > + * on full sections being initialized (e.g. memory hotplug). > + */ > + if (end_pfn == max_pfn) { > + while (end_pfn % PAGES_PER_SECTION) { > + __init_single_page(pfn_to_page(end_pfn), end_pfn, zone, > + nid); > + end_pfn++; > + } > + } > +#endif > } > > #ifdef CONFIG_ZONE_DEVICE > -- > 2.16.4 -- Michal Hocko SUSE Labs