Received: by 2002:a25:ab43:0:0:0:0:0 with SMTP id u61csp1377972ybi; Fri, 14 Jun 2019 13:43:28 -0700 (PDT) X-Google-Smtp-Source: APXvYqwgqUmYtLIqoO6V7FLYQ/EGDJANMltE8k/oATT4o8dgk1h4AzTx8pxgAW6PaPLJNrxDznFk X-Received: by 2002:a17:902:7083:: with SMTP id z3mr30527699plk.205.1560545008780; Fri, 14 Jun 2019 13:43:28 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1560545008; cv=none; d=google.com; s=arc-20160816; b=DQHllIrrjl3Otb/jl6e/c4Dli+nf91VmEwnMASerf5bUMv63yBE1GyX9xfDeAwlqRX zfG8rmwQP6Dp+SjOzuIFnFCQkIByiVUDC3cf2Lb4CEOnBiWJqxv5bwC6A9Scuvoe3eEw tD7RQf/1oslBi73YyGf/Z7oltEKncP+7Pu6Tq0AsZqt34lmZAEc2i28zDQhW3Hqcee8g 3g7R2A1zAmsBwOjT5PEkBX5WT1k67M9JjsJo+6vcj0n6gxSIBRUkVBp2JdziqF48aynN 6rsmdykpgpNcxNQhjCDen9aMBO1D1R6M19IvG25X/EwL2x3R4qVXsm2/brMp4y6g0zOG zK6A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:date:cc:to:from:subject:message-id :dkim-signature; bh=TOfdLcBnSkKFa0etKu8MZcwr2ijJWBkKs1bxmcxDpBY=; b=wYmrxAs+rVIkzNxbDZeSB/HZ+seDiAzTXRFhax5sdtPFxb+ueOm5BkByFYPc8m09v7 5EBGDpejERCjyOSatzooN10plUmw2uTmxyrPrudRFAxMDqcH2tOcDqgd4zXTJEOu8c8u KuZyHyY+vdzfKU1839rZyylPIRiStqWp1vCNZwYaZ8HUgDQpVXzqE3YWu1p8pjYAINib 0hMMFrHGY6e1S3Xlq2x4PXJlPOjIz+IpoEdN430i6YExWpTGUZnuijEbOc5/ttIjmQvb KHIIHGgyO33IQtqgLc9NYd5+xhpMxFEDr38fAwzR1K/vahTVLkct7pmDYGXvveU4lHUz r5GQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@lca.pw header.s=google header.b=SWqmPKao; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id e8si3310871pgs.308.2019.06.14.13.43.13; Fri, 14 Jun 2019 13:43:28 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@lca.pw header.s=google header.b=SWqmPKao; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726448AbfFNUnG (ORCPT + 99 others); Fri, 14 Jun 2019 16:43:06 -0400 Received: from mail-qt1-f196.google.com ([209.85.160.196]:45668 "EHLO mail-qt1-f196.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725991AbfFNUnG (ORCPT ); Fri, 14 Jun 2019 16:43:06 -0400 Received: by mail-qt1-f196.google.com with SMTP id j19so4001968qtr.12 for ; Fri, 14 Jun 2019 13:43:05 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=lca.pw; s=google; h=message-id:subject:from:to:cc:date:in-reply-to:references :mime-version:content-transfer-encoding; bh=TOfdLcBnSkKFa0etKu8MZcwr2ijJWBkKs1bxmcxDpBY=; b=SWqmPKaoD8N4ys6eJDesRML4qgnkieruLHQDsaJRoVoDe7pwP3byPAnIChbS0DQeeh /gKDidz47xsO8PVAyAvwJzMUs2DWwVfHFIk4de6zNlhLpnBqZHu3xIrPahMgIkEgrsdG msYg9a8jJrE7/3cbOBb/nTuLtBGn+70qHG3hsUMFduXhQDqZOnjwRkAPHoKxX7W+Y5zV H3On47vL/D8hx+Efoc8ND5H6CHPa17KL35JV0TxDVQBMoCGgV45+0kB/Q978UB6A3uFd kklSDals/I+OpMLvGvcfgqDkUGcdQby91dP8xdiIlslHEzHXphEg1zy101w+XV+ss70d nSzQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:message-id:subject:from:to:cc:date:in-reply-to :references:mime-version:content-transfer-encoding; bh=TOfdLcBnSkKFa0etKu8MZcwr2ijJWBkKs1bxmcxDpBY=; b=uRrRwhSce535y9wYYFPR9+PPTqO46HPPLcKtjGBWmvnYTGaNuYlI3glDZVE6rhW/VN FV31d5xOVxjxsSY8bzhW14UnSGX9ScBon0/BkDFOA+4b4YR6zt/A8X7nAm1W2ql9ljIu swB50Ylj8P5ZZQAt4g9v4k6LMWaCKiqoK/f9D6Hy6Jt8IvYElPGDL55324I+uhRr80qs JRC+LDYuozi4utD917GG+Odn+B7HMTznYux6lCSxsgsqVBq+6w8PjlLTRG25AbVI2WOR 4itZv0ZfdP7tkDl1V+rgKBPSLw8SpYvBQbOmoKUHtAWkIJrM4llPv/tbojyHJyxss0yX Zguw== X-Gm-Message-State: APjAAAV0jqYcAXysaUB+yqDXiBRRtaS230PCv6oLBwvFXSfrJP5YIUST oa+7UXFtrwX8n7eaTkkFHgU7Ow== X-Received: by 2002:ac8:18f0:: with SMTP id o45mr81653407qtk.273.1560544984932; Fri, 14 Jun 2019 13:43:04 -0700 (PDT) Received: from dhcp-41-57.bos.redhat.com (nat-pool-bos-t.redhat.com. [66.187.233.206]) by smtp.gmail.com with ESMTPSA id j62sm1793286qte.89.2019.06.14.13.43.03 (version=TLS1_2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128/128); Fri, 14 Jun 2019 13:43:04 -0700 (PDT) Message-ID: <1560544982.5154.24.camel@lca.pw> Subject: Re: [PATCH -next] mm/hotplug: skip bad PFNs from pfn_to_online_page() From: Qian Cai To: Dan Williams Cc: "Aneesh Kumar K.V" , Andrew Morton , Oscar Salvador , Linux MM , Linux Kernel Mailing List Date: Fri, 14 Jun 2019 16:43:02 -0400 In-Reply-To: References: <1560366952-10660-1-git-send-email-cai@lca.pw> <1560376072.5154.6.camel@lca.pw> <87lfy4ilvj.fsf@linux.ibm.com> <1560524365.5154.21.camel@lca.pw> <1560541220.5154.23.camel@lca.pw> Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.22.6 (3.22.6-10.el7) Mime-Version: 1.0 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, 2019-06-14 at 12:48 -0700, Dan Williams wrote: > On Fri, Jun 14, 2019 at 12:40 PM Qian Cai wrote: > > > > On Fri, 2019-06-14 at 11:57 -0700, Dan Williams wrote: > > > On Fri, Jun 14, 2019 at 11:03 AM Dan Williams > > > wrote: > > > > > > > > On Fri, Jun 14, 2019 at 7:59 AM Qian Cai wrote: > > > > > > > > > > On Fri, 2019-06-14 at 14:28 +0530, Aneesh Kumar K.V wrote: > > > > > > Qian Cai writes: > > > > > > > > > > > > > > > > > > > 1) offline is busted [1]. It looks like test_pages_in_a_zone() > > > > > > > missed > > > > > > > the > > > > > > > same > > > > > > > pfn_section_valid() check. > > > > > > > > > > > > > > 2) powerpc booting is generating endless warnings [2]. In > > > > > > > vmemmap_populated() at > > > > > > > arch/powerpc/mm/init_64.c, I tried to change PAGES_PER_SECTION to > > > > > > > PAGES_PER_SUBSECTION, but it alone seems not enough. > > > > > > > > > > > > > > > > > > > Can you check with this change on ppc64.  I haven't reviewed this > > > > > > series > > > > > > yet. > > > > > > I did limited testing with change . Before merging this I need to go > > > > > > through the full series again. The vmemmap poplulate on ppc64 needs > > > > > > to > > > > > > handle two translation mode (hash and radix). With respect to vmemap > > > > > > hash doesn't setup a translation in the linux page table. Hence we > > > > > > need > > > > > > to make sure we don't try to setup a mapping for a range which is > > > > > > arleady convered by an existing mapping. > > > > > > > > > > It works fine. > > > > > > > > Strange... it would only change behavior if valid_section() is true > > > > when pfn_valid() is not or vice versa. They "should" be identical > > > > because subsection-size == section-size on PowerPC, at least with the > > > > current definition of SUBSECTION_SHIFT. I suspect maybe > > > > free_area_init_nodes() is too late to call subsection_map_init() for > > > > PowerPC. > > > > > > Can you give the attached incremental patch a try? This will break > > > support for doing sub-section hot-add in a section that was only > > > partially populated early at init, but that can be repaired later in > > > the series. First things first, don't regress. > > > > > > diff --git a/mm/page_alloc.c b/mm/page_alloc.c > > > index 874eb22d22e4..520c83aa0fec 100644 > > > --- a/mm/page_alloc.c > > > +++ b/mm/page_alloc.c > > > @@ -7286,12 +7286,10 @@ void __init free_area_init_nodes(unsigned long > > > *max_zone_pfn) > > > > > >         /* Print out the early node map */ > > >         pr_info("Early memory node ranges\n"); > > > -       for_each_mem_pfn_range(i, MAX_NUMNODES, &start_pfn, &end_pfn, > > > &nid) { > > > +       for_each_mem_pfn_range(i, MAX_NUMNODES, &start_pfn, &end_pfn, > > > &nid) > > >                 pr_info("  node %3d: [mem %#018Lx-%#018Lx]\n", nid, > > >                         (u64)start_pfn << PAGE_SHIFT, > > >                         ((u64)end_pfn << PAGE_SHIFT) - 1); > > > -               subsection_map_init(start_pfn, end_pfn - start_pfn); > > > -       } > > > > > >         /* Initialise every node */ > > >         mminit_verify_pageflags_layout(); > > > diff --git a/mm/sparse.c b/mm/sparse.c > > > index 0baa2e55cfdd..bca8e6fa72d2 100644 > > > --- a/mm/sparse.c > > > +++ b/mm/sparse.c > > > @@ -533,6 +533,7 @@ static void __init sparse_init_nid(int nid, > > > unsigned long pnum_begin, > > >                 } > > >                 check_usemap_section_nr(nid, usage); > > >                 sparse_init_one_section(__nr_to_section(pnum), pnum, > > > map, usage); > > > +               subsection_map_init(section_nr_to_pfn(pnum), > > > PAGES_PER_SECTION); > > >                 usage = (void *) usage + mem_section_usage_size(); > > >         } > > >         sparse_buffer_fini(); > > > > It works fine except it starts to trigger slab debugging errors during boot. > > Not > > sure if it is related yet. > > If you want you can give this branch a try if you suspect something > else in -next is triggering the slab warning. > > https://git.kernel.org/pub/scm/linux/kernel/git/djbw/nvdimm.git/log/?h=subsect > ion-v9 > > It's the original v9 patchset + dependencies backported to v5.2-rc4. > > I otherwise don't see how subsections would effect slab caches. It works fine there.