Received: by 10.213.65.68 with SMTP id h4csp20757imn; Mon, 12 Mar 2018 05:29:02 -0700 (PDT) X-Google-Smtp-Source: AG47ELvz+j9fwggDbeQCVXQ2QqNtEr+Fe1qy1P9dbGFNQGs8JnBJJ5Q/pw+4kGvLurVKmK64UBNL X-Received: by 10.99.164.25 with SMTP id c25mr6554735pgf.235.1520857742700; Mon, 12 Mar 2018 05:29:02 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1520857742; cv=none; d=google.com; s=arc-20160816; b=mFQS3dSsMb6mARVzxavUqqkcPE/FbFqEs8w1dTU+zSiWZZNzvDY9/RY5+O3JxTNG9R KMqybXbmVJOGzwSEI1G49l1NxUy/E5jptgbv6+9TnfdSrjASSvHxelsd4RtLhFhszVnx pfQFYtaU1Jg3mHwolh3OrbrY5pSU0eVc3JDKbU+PiSk0nXpA8rOZ/XDG4ZKBDfW3dA5K baXTOk3GmpIPmCV1OLziAlstS6XO5speVsT2ceSu1hX+F+vxws/zUO6M3uCGJnmke/He lNXLksX13GjyxsVjGaVJQSGF7j3m90uBT8yeLetH3uy6kFxMzPA570Y0gFXiTOQi7Qfn lDFw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :references:in-reply-to:mime-version:dkim-signature :arc-authentication-results; bh=918F79RLMuLI2TuaViLFArHBgDz+JHJrLMsja2W1KI4=; b=iZ+CZgFyYqDyFibfyAgZykajtcomuucIZfLrALhOKnAAnXvNU2X0gz0mIE0xuw0VRH PpaagqxqBU7agFb+k+60awM8eM5WYVZWrXJiaHVJ1ufHGTPpRfM/qXEDCqNj/PRyyc8A sceP8ylQE2KtPAQlDvcixeMIKviQrQnPOzIKtlDA8SVZHM4zUJoWtTgbNLWIPNC/azMI ecCHKfgiW9cdbEPuXPiqthJh+giWUmQ1uCN39HohN+ygAwvhefbYDReIAv9Nin1oV2qn IV7srpY3/xleQZCGa3voa92SgrSD5tbpWltQwPGl1F5VjPasJcYu9ovbyol6uCXXCxeT DinQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail header.i=@gmail.com header.s=20161025 header.b=JCaDN2R3; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 5si5687800pfx.208.2018.03.12.05.28.47; Mon, 12 Mar 2018 05:29:02 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=fail header.i=@gmail.com header.s=20161025 header.b=JCaDN2R3; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932294AbeCLM07 (ORCPT + 99 others); Mon, 12 Mar 2018 08:26:59 -0400 Received: from mail-qt0-f193.google.com ([209.85.216.193]:38581 "EHLO mail-qt0-f193.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751358AbeCLM05 (ORCPT ); Mon, 12 Mar 2018 08:26:57 -0400 Received: by mail-qt0-f193.google.com with SMTP id n12so18494795qtl.5; Mon, 12 Mar 2018 05:26:57 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:sender:in-reply-to:references:from:date:message-id :subject:to:cc; bh=918F79RLMuLI2TuaViLFArHBgDz+JHJrLMsja2W1KI4=; b=JCaDN2R3y3ij8/YBkgSbXW8CTDzSDMGTiutb+ALI32KhFcgZLio7I4eeBxa7212X2l tdkkHzhwZs630A3S13JJOjopsDhdsKtym93r/o8AbD20A/QIW1w1CcSaWj0qTI6TUIhs XupqGfVPu0RtEqhSxRTJi2k+ieaVESZoqRJsL9xqx6OaG38JivvxrX6VPwki6/q+xldh v33zWqn8ay2wmi6W1Q9s3j79Hti/jEcgBpzJPic6YEZzDWVK/vhWSvVyoayi7maZU16L DF3iDzgsHIk5unGLBDGF/n1r4+VK2wlrTB7cvx/sNBX0FeifSyxp1LCnbaANhWT3e0il QGeg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:sender:in-reply-to:references:from :date:message-id:subject:to:cc; bh=918F79RLMuLI2TuaViLFArHBgDz+JHJrLMsja2W1KI4=; b=t6w199AZUIOTw3iEOsY5/ZgRys4Hukzttb7L6Odc9vIQoNAi3MA/jYzDfus9dObZVE i3uA+LFGbg18IUfMgwr+qvcBkogmW5o6Fd5DHnYsw+a9UvpiaK/3owAtFP245zBqbDl8 mp1hKY5MS+zmoy7sNpTDZSBwsGC/4GI6FzgUeODo536oQmsuiJsjoDwh/zqh++AIR+UH /0EaxCwqfpcqxinHfw2m2Sioy+mIBI/enhsus7Mr6btis/ONbkJBz+DCdoZqhDFXm6hL B1ad95ST4JDdd8LQR8LMJy68Ci2nbkYGM4XXyUCBDZGeT/0VcDWUT3Unvq6JWmreP0WB fv+g== X-Gm-Message-State: AElRT7EezaE80P920z13VAK6GZFggwCHNcXTgjWSRzvv/MfKzGe6v7w0 DKxgnyM/G42o5YA0nQRIORybaTPfXnr5DIIjxmI= X-Received: by 10.200.112.19 with SMTP id x19mr11835948qtm.85.1520857617196; Mon, 12 Mar 2018 05:26:57 -0700 (PDT) MIME-Version: 1.0 Received: by 10.12.195.207 with HTTP; Mon, 12 Mar 2018 05:26:56 -0700 (PDT) In-Reply-To: References: <1519908465-12328-1-git-send-email-neelx@redhat.com> <0485727b2e82da7efbce5f6ba42524b429d0391a.1520011945.git.neelx@redhat.com> <20180302164052.5eea1b896e3a7125d1e1f23a@linux-foundation.org> From: Sudeep Holla Date: Mon, 12 Mar 2018 12:26:56 +0000 X-Google-Sender-Auth: 2vd7CMveUFQ6z_Ut_Es_SGY8QNA Message-ID: Subject: Re: [PATCH v3 2/2] mm/page_alloc: fix memmap_init_zone pageblock alignment To: Daniel Vacek Cc: Andrew Morton , open list , linux-mm@kvack.org, Michal Hocko , Vlastimil Babka , Mel Gorman , Pavel Tatashin , Paul Burton , stable@vger.kernel.org, Sudeep Holla Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi, I couldn't find the exact mail corresponding to the patch merged in v4.16-rc5 but commit 864b75f9d6b01 "mm/page_alloc: fix memmap_init_zone pageblock alignment" cause boot hang on my ARM64 platform. Log: [ 0.000000] NUMA: No NUMA configuration found [ 0.000000] NUMA: Faking a node at [mem 0x0000000000000000-0x00000009ffffffff] [ 0.000000] NUMA: NODE_DATA [mem 0x9fffcb480-0x9fffccf7f] [ 0.000000] Zone ranges: [ 0.000000] DMA32 [mem 0x0000000080000000-0x00000000ffffffff] [ 0.000000] Normal [mem 0x0000000100000000-0x00000009ffffffff] [ 0.000000] Movable zone start for each node [ 0.000000] Early memory node ranges [ 0.000000] node 0: [mem 0x0000000080000000-0x00000000f8f9afff] [ 0.000000] node 0: [mem 0x00000000f8f9b000-0x00000000f908ffff] [ 0.000000] node 0: [mem 0x00000000f9090000-0x00000000f914ffff] [ 0.000000] node 0: [mem 0x00000000f9150000-0x00000000f920ffff] [ 0.000000] node 0: [mem 0x00000000f9210000-0x00000000f922ffff] [ 0.000000] node 0: [mem 0x00000000f9230000-0x00000000f95bffff] [ 0.000000] node 0: [mem 0x00000000f95c0000-0x00000000fe58ffff] [ 0.000000] node 0: [mem 0x00000000fe590000-0x00000000fe5cffff] [ 0.000000] node 0: [mem 0x00000000fe5d0000-0x00000000fe5dffff] [ 0.000000] node 0: [mem 0x00000000fe5e0000-0x00000000fe62ffff] [ 0.000000] node 0: [mem 0x00000000fe630000-0x00000000feffffff] [ 0.000000] node 0: [mem 0x0000000880000000-0x00000009ffffffff] [ 0.000000] Initmem setup node 0 [mem 0x0000000080000000-0x00000009ffffffff] On Sat, Mar 3, 2018 at 1:08 AM, Daniel Vacek wrote: > On Sat, Mar 3, 2018 at 1:40 AM, Andrew Morton wrote: >> On Sat, 3 Mar 2018 01:12:26 +0100 Daniel Vacek wrote: >> >>> Commit b92df1de5d28 ("mm: page_alloc: skip over regions of invalid pfns >>> where possible") introduced a bug where move_freepages() triggers a >>> VM_BUG_ON() on uninitialized page structure due to pageblock alignment. >> >> b92df1de5d28 was merged a year ago. Can you suggest why this hasn't >> been reported before now? > > Yeah. I was surprised myself I couldn't find a fix to backport to > RHEL. But actually customers started to report this as soon as 7.4 > (where b92df1de5d28 was merged in RHEL) was released. I remember > reports from September/October-ish times. It's not easily reproduced > and happens on a handful of machines only. I guess that's why. But > that does not make it less serious, I think. > > Though there actually is a report here: > https://bugzilla.kernel.org/show_bug.cgi?id=196443 > > And there are reports for Fedora from July: > https://bugzilla.redhat.com/show_bug.cgi?id=1473242 > and CentOS: https://bugs.centos.org/view.php?id=13964 > and we internally track several dozens reports for RHEL bug > https://bugzilla.redhat.com/show_bug.cgi?id=1525121 > > Enough? ;-) > >> This makes me wonder whether a -stable backport is really needed... > > For some machines it definitely is. Won't hurt either, IMHO. > > --nX