Received: by 10.192.165.148 with SMTP id m20csp617927imm; Wed, 9 May 2018 19:58:03 -0700 (PDT) X-Google-Smtp-Source: AB8JxZryUCjE2I/Mwe80yNzxQvq8V6apeI+3QsS+ug5OpilyaRNhegTnrt8XXekRoSH8rxlvtVYY X-Received: by 2002:a63:4384:: with SMTP id q126-v6mr37006831pga.294.1525921082963; Wed, 09 May 2018 19:58:02 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1525921082; cv=none; d=google.com; s=arc-20160816; b=CAFo80FXo1FEe5wrLE+B6T7+jH9Y2/dmePBTaUXDvgU3lqD5MYUAWjTe3ClRVHVAeN pp3awHXpSUVkk4DtMYJ/TwO6jnl5ZrA1dXJQlMAYRXk5Lyb0dH+2SuDSU6vz1llvVedM 9xppl/0VPgVt6rPyaEVkDkwPMh+d325IjbZQQsl7XE7XlPQqc94EJgw3jUJ1YAcBP/vX WYoE1TOCUDKe83INevTqAsJ+bqBH99P7bPcqhx2n1i1vTaZB3fPp4JuwnEj+AaBOrL/R uNGkPlpnM+Q4e2znxPe8I2eI3L7sId6XvE4uXSOObn8fW1IAu1i0k5HGFTSVO1Pw1cg1 uKiw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :references:in-reply-to:mime-version:dkim-signature :arc-authentication-results; bh=QIqxFVaRtlZYbxX4WWX10w7O5sXUDVknWlUGNGdOoP4=; b=v8G3vUGYviCzZBmvPpPsTEZ9tweWoclgPsYbIC7rJa/v+kxQ6326aA0CZhLuQKipaq qeD5mdxWNCgAYmi8mdQpwCeijXTH0sXzKk6Gj4VGcm9J9Wkcz7kIAnMQudCJ9Li1aAt1 RxjQ9bxcdMl2la6jt9Yl5MGh/Bh5qUckGa10IpoROgdbS03ukwgIXfHiIXsSQLcRrdYc OQISbYpWXvEx1lLKmwG2fAuJqQ44Nijy23T65dAsHp0MLqhoQbTE7wXtxI9K4vHgVaGo 1e6hyaRlFmJdvhJaSVgdcAB4B5WOBm0CScCOFPuXGtJef6JU8C6jOQ9qKyrS6iXCGBSX jeVQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=MeZe7tXi; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id z14-v6si22801785pgc.617.2018.05.09.19.57.48; Wed, 09 May 2018 19:58:02 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=MeZe7tXi; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756545AbeEJC5i (ORCPT + 99 others); Wed, 9 May 2018 22:57:38 -0400 Received: from mail-ot0-f176.google.com ([74.125.82.176]:41059 "EHLO mail-ot0-f176.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1756359AbeEJC5g (ORCPT ); Wed, 9 May 2018 22:57:36 -0400 Received: by mail-ot0-f176.google.com with SMTP id t1-v6so694553oth.8; Wed, 09 May 2018 19:57:36 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc; bh=QIqxFVaRtlZYbxX4WWX10w7O5sXUDVknWlUGNGdOoP4=; b=MeZe7tXiS2x5LYsplQcr13MyE7kEWG3jL3QoTboETjKVEFjEl9pE/FxYQIpRZrOF5S HySA9dH0zeSPi22ADI901Gw9qUiikEAqxN22lC0S/TmT1ww8yPEawdy0n7fDxhwSos8+ bP1H+ZBjNFeVVlBCVp6vma+OAs4x8YDYWoao53bv7TS/I/OoUgaoVkebIESyEIrEHgsv ZCb8RBN0+pKtayAwmJO6lSKLBP6T8J5/q+vMgq1DfHwHWOjmI9whBI6uaO9dz/B41xnk 5o6bOmWYG8kH6C10of4Nmy0Y17ow30vfMFBUy7UGWHAI6GtPGbPFG9gYzMmr6950Ou+8 dhpQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc; bh=QIqxFVaRtlZYbxX4WWX10w7O5sXUDVknWlUGNGdOoP4=; b=nj+bL2a10RG1+PqmdePy6c/VEiFpU8wylm0hs6OpkKuSOjEKWZmVgYm+hrM1sbVeus bIKytWnnLQKT6eIiFt61WGB6xWbc5lwSsjaHxv1W+ly7G6ItdJpkSLjAHIDg7xLI4pkU cnXVP7LNGsC/cPyfIJnh0bPqVu5a1U2BZzrm9BAHcKau8E9gjNMVROK1uC2RGCpfbOxf NX051McRKDLuC7xnOs0ek0y6bCvpxw+fvnUkodNyi3frXas/DDYe1KacwnvtVQknMSTT gk2wEPNqihTSAJFdqZYL7dgzsZF5o0wjrVzQKHd85MjscOVhIBu/1bXz6p+WHf7ZgeX7 cqyw== X-Gm-Message-State: ALQs6tB07pkBSWnRWmO0FnjcQn0DzX+iW4vyz6lc6szjpXziuaSoweT8 pGg7SIDp1BjsOnVnvqbK8KB8QwdZFg2AKEadTGs= X-Received: by 2002:a9d:1814:: with SMTP id b20-v6mr15564836ote.116.1525921055722; Wed, 09 May 2018 19:57:35 -0700 (PDT) MIME-Version: 1.0 Received: by 10.201.88.68 with HTTP; Wed, 9 May 2018 19:57:35 -0700 (PDT) In-Reply-To: <20180509125611.GT32366@dhcp22.suse.cz> References: <20180411104832.GF23400@dhcp22.suse.cz> <20180509122454.GR32366@dhcp22.suse.cz> <20180509125611.GT32366@dhcp22.suse.cz> From: Ganapatrao Kulkarni Date: Thu, 10 May 2018 08:27:35 +0530 Message-ID: Subject: Re: fd3e45436660 ("ACPI / NUMA: ia64: Parse all entries of SRAT memory affinity table") To: Michal Hocko Cc: Ganapatrao Kulkarni , Tony Luck , "Rafael J. Wysocki" , tiantao6@huawei.com, LKML , linux-acpi@vger.kernel.org Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, May 9, 2018 at 6:26 PM, Michal Hocko wrote: > On Wed 09-05-18 18:07:16, Ganapatrao Kulkarni wrote: >> Hi Michal >> >> >> On Wed, May 9, 2018 at 5:54 PM, Michal Hocko wrote: >> > On Wed 11-04-18 12:48:32, Michal Hocko wrote: >> >> Hi, >> >> my attention was brought to the %subj commit and either I am missing >> >> something or the patch is quite dubious. What is it actually trying to >> >> fix? If a BIOS/FW provides more memblocks than the limit then we would >> >> get misleading numa topology (numactl -H output) but is the situation >> >> much better with it applied? Numa init code will refuse to init more >> >> memblocks than the limit and falls back to dummy_numa_init (AFAICS) >> >> which will break the topology again and numactl -H will have a >> >> misleading output anyway. >> >> IIRC, the MEMBLOCK beyond max limit getting dropped from visible >> memory(partial drop from a node). >> this patch removed any upper limit on memblocks and allowed to parse >> all entries of SRAT. > > Yeah I've understood that much. My question is, however, why do we care > about parsing the NUMA topology when we fallback into a single NUMA node > anyway? Or do I misunderstand the code? I do not have any platform with > that many memblocks. IMHO, this fix is very much logical by removing the SRAT parsing restriction. below is the crash log which made us to debug and eventually fix with this patch. [ 0.000000] NUMA: Adding memblock [0x80000000 - 0xfeffffff] on node 0 [ 0.000000] ACPI: SRAT: Node 0 PXM 0 [mem 0x80000000-0xfeffffff] [ 0.000000] NUMA: Adding memblock [0x880000000 - 0xffcffffff] on node 0 [ 0.000000] ACPI: SRAT: Node 0 PXM 0 [mem 0x880000000-0xffcffffff] [ 0.000000] NUMA: Adding memblock [0xffd000000 - 0xfffffffff] on node 0 [ 0.000000] ACPI: SRAT: Node 0 PXM 0 [mem 0xffd000000-0xfffffffff] [ 0.000000] NUMA: Adding memblock [0x8800000000 - 0x8bfcffffff] on node 0 [ 0.000000] ACPI: SRAT: Node 0 PXM 0 [mem 0x8800000000-0x8bfcffffff] [ 0.000000] NUMA: Adding memblock [0x8bfd000000 - 0x8ffcffffff] on node 0 [ 0.000000] ACPI: SRAT: Node 0 PXM 0 [mem 0x8bfd000000-0x8ffcffffff] [ 0.000000] NUMA: Adding memblock [0x8ffd000000 - 0x93fcffffff] on node 0 [ 0.000000] ACPI: SRAT: Node 0 PXM 0 [mem 0x8ffd000000-0x93fcffffff] [ 0.000000] NUMA: Adding memblock [0x93fd000000 - 0x9bfcffffff] on node 1 [ 0.000000] ACPI: SRAT: Node 1 PXM 1 [mem 0x93fd000000-0x9bfcffffff] [ 0.000000] NUMA: Adding memblock [0x9bfd000000 - 0x9ffcffffff] on node 1 [ 0.000000] ACPI: SRAT: Node 1 PXM 1 [mem 0x9bfd000000-0x9ffcffffff] [ 0.000000] NUMA: Warning: invalid memblk node 4 [mem 0x9ffd000000-0xa7fcffffff] [ 0.000000] NUMA: Faking a node at [mem 0x0000000000000000-0x000000a7fcffffff] [ 0.000000] NUMA: Adding memblock [0x802f0000 - 0x802fffff] on node 0 [ 0.000000] NUMA: Adding memblock [0x80300000 - 0xbfffffff] on node 0 [ 0.000000] NUMA: Adding memblock [0xc4000000 - 0xf5efffff] on node 0 [ 0.000000] NUMA: Adding memblock [0xf5f00000 - 0xf5f6ffff] on node 0 [ 0.000000] NUMA: Adding memblock [0xf5f70000 - 0xf603ffff] on node 0 [ 0.000000] NUMA: Adding memblock [0xf6040000 - 0xf667ffff] on node 0 [ 0.000000] NUMA: Adding memblock [0xf6680000 - 0xfe45ffff] on node 0 [ 0.000000] NUMA: Adding memblock [0xfe460000 - 0xfe4effff] on node 0 [ 0.000000] NUMA: Adding memblock [0xfe4f0000 - 0xfe4fffff] on node 0 [ 0.000000] NUMA: Adding memblock [0xfe500000 - 0xfe61ffff] on node 0 [ 0.000000] NUMA: Adding memblock [0xfe620000 - 0xfeffffff] on node 0 [ 0.000000] NUMA: Adding memblock [0x880000000 - 0xfffffffff] on node 0 [ 0.000000] NUMA: Adding memblock [0x8800000000 - 0x93fcffffff] on node 0 [ 0.000000] NUMA: Adding memblock [0x93fd000000 - 0x9ffcffffff] on node 0 [ 0.000000] NUMA: Warning: invalid memblk node 4 [mem 0x9ffd000000-0xa7fcffffff] [ 0.000000] Unable to handle kernel NULL pointer dereference at virtual address 00001b40 [ 0.000000] pgd = fffffc0009570000 [ 0.000000] [00001b40] *pgd=000000a7fcfe0003, *pud=000000a7fcfe0003, *pmd=000000a7fcfe0003, *pte=0000000000000000 [ 0.000000] Internal error: Oops: 96000006 [#1] SMP [ 0.000000] Modules linked in: [ 0.000000] CPU: 0 PID: 0 Comm: swapper Not tainted 4.11.12-11.cavium.ml.aarch64 #1 [ 0.000000] Hardware name: (null) (DT) [ 0.000000] task: fffffc0008d35780 task.stack: fffffc0008cf0000 [ 0.000000] PC is at sparse_early_usemaps_alloc_node+0x20/0xb4 [ 0.000000] LR is at sparse_init+0xec/0x204 [ 0.000000] pc : [] lr : [] pstate: 80000089 [ 0.000000] sp : fffffc0008cf3e40 thanks Ganapat > > -- > Michal Hocko > SUSE Labs