Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp4432911imu; Tue, 29 Jan 2019 01:13:47 -0800 (PST) X-Google-Smtp-Source: ALg8bN7J6SvUz3IfyB1Qk28YIDu6hsENu+EYFHRbXa5EII+rY/kc6zx/fTfevmRe1Vs5+cKMmUnw X-Received: by 2002:a17:902:b40d:: with SMTP id x13mr25575702plr.237.1548753227742; Tue, 29 Jan 2019 01:13:47 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1548753227; cv=none; d=google.com; s=arc-20160816; b=Ke+dywhhSQJ+DXosc66uGtqPSYC5pRFtv/yJj57BqDgzF4MuBSLjMFIN51V7URHrmJ 6qKYgLSosbonrt8wPoLBMGqECeTJnPPlGYFwx21FM7sDQp8RHCYnUjDK/I3oU+5DLFOl bfiGdoPHL7kZwyl6lunBFZiCdb/CL/yVdWr0ChyKYwBWtDqUfI4Gk5LgkgQkQfZiHzR9 i5srvlsmBXpbo7esCQSz64XavBrHFoI1QU8Vwom48NXDQMciT0pmZGaMFQ59KDY+cbBF L1t7kXVqAKMEPEzwLrWx30flbdCPqEaGY9BtAbBAmyL8GQiPTmGv+lUcvvGo5gBX+IxK bc2w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=SC7CocLDdA6XyArTDMOCU2Ngc7FJpHL2Kcc3DR+AUNM=; b=SPRZ7TfC/8FnIk0EcC3LwxtSe5RURAF4oZUShHa5dDjDC+zVicZ3J2zgSkkoN/1FfT YeHNyR2JoQql2x6j37CSMmaWfmy+SKFdOEzBhSOpm3ysu0xo9U6QddIvAak+lvtbntUF flqOA17LXsvBz+LKIhHjBgytX2z36fe1nquh/ClL/8oum5ovo6lhv1hKt2Qz9nEguNrT jWD9Iv1hhGEZKPqmrUZ0DH846GaY/tm8r8h9TKkIxCd8aGzeia+RJR8EcOXPKodFvYxe aWS7fLU+/cVrZuSDbxkb1KS5IOsLmXZKh7am3cQMAXXSyIC9K4WHEsHGGvKe86bBvTJF B8kA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 133si12404948pfw.64.2019.01.29.01.13.32; Tue, 29 Jan 2019 01:13:47 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727724AbfA2JNY (ORCPT + 99 others); Tue, 29 Jan 2019 04:13:24 -0500 Received: from mx2.suse.de ([195.135.220.15]:55248 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1725298AbfA2JNX (ORCPT ); Tue, 29 Jan 2019 04:13:23 -0500 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (unknown [195.135.220.254]) by mx1.suse.de (Postfix) with ESMTP id CFC79AE5E; Tue, 29 Jan 2019 09:13:22 +0000 (UTC) Date: Tue, 29 Jan 2019 10:13:21 +0100 From: Michal Hocko To: Oscar Salvador Cc: Mikhail Zaslonko , Mikhail Gavrilov , Andrew Morton , Pavel Tatashin , schwidefsky@de.ibm.com, heiko.carstens@de.ibm.com, gerald.schaefer@de.ibm.com, linux-mm@kvack.org, LKML Subject: Re: [PATCH 2/2] mm, memory_hotplug: test_pages_in_a_zone do not pass the end of zone Message-ID: <20190129091321.GH18811@dhcp22.suse.cz> References: <20190128144506.15603-1-mhocko@kernel.org> <20190128144506.15603-3-mhocko@kernel.org> <20190129090908.oms43oyjicozkvzu@d104.suse.de> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20190129090908.oms43oyjicozkvzu@d104.suse.de> User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue 29-01-19 10:09:08, Oscar Salvador wrote: > On Mon, Jan 28, 2019 at 03:45:06PM +0100, Michal Hocko wrote: > > From: Mikhail Zaslonko > > > > If memory end is not aligned with the sparse memory section boundary, the > > mapping of such a section is only partly initialized. This may lead to > > VM_BUG_ON due to uninitialized struct pages access from test_pages_in_a_zone() > > function triggered by memory_hotplug sysfs handlers. > > > > Here are the the panic examples: > > CONFIG_DEBUG_VM_PGFLAGS=y > > kernel parameter mem=2050M > > -------------------------- > > page:000003d082008000 is uninitialized and poisoned > > page dumped because: VM_BUG_ON_PAGE(PagePoisoned(p)) > > Call Trace: > > ([<0000000000385b26>] test_pages_in_a_zone+0xde/0x160) > > [<00000000008f15c4>] show_valid_zones+0x5c/0x190 > > [<00000000008cf9c4>] dev_attr_show+0x34/0x70 > > [<0000000000463ad0>] sysfs_kf_seq_show+0xc8/0x148 > > [<00000000003e4194>] seq_read+0x204/0x480 > > [<00000000003b53ea>] __vfs_read+0x32/0x178 > > [<00000000003b55b2>] vfs_read+0x82/0x138 > > [<00000000003b5be2>] ksys_read+0x5a/0xb0 > > [<0000000000b86ba0>] system_call+0xdc/0x2d8 > > Last Breaking-Event-Address: > > [<0000000000385b26>] test_pages_in_a_zone+0xde/0x160 > > Kernel panic - not syncing: Fatal exception: panic_on_oops > > > > Fix this by checking whether the pfn to check is within the zone. > > > > [mhocko@suse.com: separated this change from > > http://lkml.kernel.org/r/20181105150401.97287-2-zaslonko@linux.ibm.com] > > Signed-off-by: Mikhail Zaslonko > > Signed-off-by: Michal Hocko > > Looks good to me: > > Reviewed-by: Oscar Salvador > > > --- > > mm/memory_hotplug.c | 3 +++ > > 1 file changed, 3 insertions(+) > > > > diff --git a/mm/memory_hotplug.c b/mm/memory_hotplug.c > > index 07872789d778..7711d0e327b6 100644 > > --- a/mm/memory_hotplug.c > > +++ b/mm/memory_hotplug.c > > @@ -1274,6 +1274,9 @@ int test_pages_in_a_zone(unsigned long start_pfn, unsigned long end_pfn, > > i++; > > if (i == MAX_ORDER_NR_PAGES || pfn + i >= end_pfn) > > continue; > > + /* Check if we got outside of the zone */ > > + if (zone && !zone_spans_pfn(zone, pfn + i)) > > + return 0; > > page = pfn_to_page(pfn + i); > > Since we are already checking if the zone spans that pfn, is it safe to get > rid of the below check? Or maybe not because we might have intersected zones? Exactly. The new zone might start at the next pfn. Look for the cover leter for such an example. > > if (zone && page_zone(page) != zone) > > return 0; > > -- > Oscar Salvador > SUSE L3 -- Michal Hocko SUSE Labs