Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S966303AbcKJUwM (ORCPT ); Thu, 10 Nov 2016 15:52:12 -0500 Received: from mail.kernel.org ([198.145.29.136]:54796 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S966280AbcKJUwL (ORCPT ); Thu, 10 Nov 2016 15:52:11 -0500 MIME-Version: 1.0 In-Reply-To: <93b7769a-4907-176e-9f18-0bf6bd72d15d@caviumnetworks.com> References: <1477689302-32671-1-git-send-email-ddaney.cavm@gmail.com> <93b7769a-4907-176e-9f18-0bf6bd72d15d@caviumnetworks.com> From: Rob Herring Date: Thu, 10 Nov 2016 14:51:46 -0600 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: [PATCH v2] of, numa: Return NUMA_NO_NODE from disable of_node_to_nid() if nid not possible. To: David Daney Cc: David Daney , "linux-kernel@vger.kernel.org" , Frank Rowand , "devicetree@vger.kernel.org" , "linux-arm-kernel@lists.infradead.org" , Will Deacon , Catalin Marinas , Robert Richter , Hanjun Guo , Ganapatrao Kulkarni , Gilbert Netzer , David Daney Content-Type: text/plain; charset=UTF-8 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2766 Lines: 68 On Thu, Nov 3, 2016 at 10:11 AM, David Daney wrote: > On 11/02/2016 08:37 PM, Rob Herring wrote: >> >> On Fri, Oct 28, 2016 at 4:15 PM, David Daney >> wrote: >>> >>> From: David Daney >>> >>> On arm64 NUMA kernels we can pass "numa=off" on the command line to >>> disable NUMA. A side effect of this is that kmalloc_node() calls to >>> non-zero nodes will crash the system with an OOPS: >>> >>> [ 0.000000] ITS@0x0000901000020000: allocated 2097152 Devices >>> @10002000000 (flat, esz 8, psz 64K, shr 1) >>> [ 0.000000] Unable to handle kernel NULL pointer dereference at >>> virtual address 00001680 >>> [ 0.000000] pgd = fffffc0009470000 >>> [ 0.000000] [00001680] *pgd=0000010ffff90003, *pud=0000010ffff90003, >>> *pmd=0000010ffff90003, *pte=0000000000000000 >>> [ 0.000000] Internal error: Oops: 96000006 [#1] SMP >>> . >>> . >>> . >>> [ 0.000000] [] __alloc_pages_nodemask+0xa4/0xe68 >>> [ 0.000000] [] new_slab+0xd0/0x564 >>> [ 0.000000] [] ___slab_alloc+0x2e4/0x514 >>> [ 0.000000] [] __slab_alloc+0x48/0x58 >>> [ 0.000000] [] __kmalloc_node+0xd0/0x2dc >>> [ 0.000000] [] __irq_domain_add+0x7c/0x164 >>> [ 0.000000] [] its_probe+0x784/0x81c >>> [ 0.000000] [] its_init+0x48/0x1b0 >>> [ 0.000000] [] gic_init_bases+0x228/0x360 >>> [ 0.000000] [] gic_of_init+0x148/0x1cc >>> [ 0.000000] [] of_irq_init+0x184/0x298 >>> [ 0.000000] [] irqchip_init+0x14/0x38 >>> [ 0.000000] [] init_IRQ+0xc/0x30 >>> [ 0.000000] [] start_kernel+0x240/0x3b8 >>> [ 0.000000] [] __primary_switched+0x30/0x6c >>> [ 0.000000] Code: 912ec2a0 b9403809 0a0902fb 37b007db (f9400300) >>> . >>> . >>> . >>> >>> This is caused by code like this in kernel/irq/irqdomain.c >>> >>> domain = kzalloc_node(sizeof(*domain) + (sizeof(unsigned int) * >>> size), >>> GFP_KERNEL, of_node_to_nid(of_node)); >>> >>> When NUMA is disabled, the concept of a node is really undefined, so >>> of_node_to_nid() should unconditionally return NUMA_NO_NODE. >>> >>> Fix by returning NUMA_NO_NODE when the nid is not in the set of >>> possible nodes. >>> >>> Reported-by: Gilbert Netzer >>> Signed-off-by: David Daney >> >> >> Does this need to go in 4.9? > > > That would be my preference. Given how late this is now, my having nothing else for 4.9 and that his has never worked, I've applied for 4.10, but I did tag for stable. Rob