Received: by 2002:ac0:a581:0:0:0:0:0 with SMTP id m1-v6csp669667imm; Wed, 20 Jun 2018 04:53:04 -0700 (PDT) X-Google-Smtp-Source: ADUXVKKLNmmJEyAJWIL6CiQ/adRbb2yz+9CYM8KVI/1XU8n0MoCNqKqJKQcDjYboGEgFC0jmhTfv X-Received: by 2002:a62:660a:: with SMTP id a10-v6mr22455867pfc.156.1529495584380; Wed, 20 Jun 2018 04:53:04 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1529495584; cv=none; d=google.com; s=arc-20160816; b=jkDvbWzDaCZHFJ8/6KkdupNXf50SZMw23nIeH/0gMvjYBYik5/XCyjrA75THxjo38t PN6BGNvOCa0memMYbqegyRAdfj4MbPUpUvMOq+LVb7jIwNakvcS0dmG7d9gxc8NFcSeC WBibmalydTDz4jreqMh3814BL+0TXasrtIDj+iuMNHWAivnNIj0TY1L3/ecF1c7ECJeQ ZdQPDFOcd86rwuMKptACzC8tavlTAilbyCZfsOsTME86629oISQzGze+VUKt5tijEeVJ Nx+wGOsrTqwEF/SPbJBPyfe+eLOCB+rS1pRQnuffllZTSAzf21AiJ5NESiguBtJ8fE2T G8UA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:user-agent:message-id :in-reply-to:date:references:subject:cc:to:from :arc-authentication-results; bh=JJ3b3SmtMDF4UMM1ZuKiBYGjI1izYO+74WnPWcgn7zQ=; b=xliRgp1LCrgG8a+Ayti4NqkiWz2oLKM4UjhRnrLgEefZ6THLAJM9SzPgqXDyYlIk0F U1+ZYNrCTb4XrUBotVjQZrpRHN7UZPRwnxcii/0t7egfmB6ZDoV95BtuBAD7FWiXUc04 jV3BrKs84RRQO+qZdgd++EjqrPQ+ggU96LCiv/JtM08JxJzxmiRng/yGKDbzuLwcWNM8 97wfgjVR04oZJThGQB5Ja4CokNmNuwjWbeS/WTAlqA2/MielnkHjYgYTMpqK1/v28rc9 nd1uVDfCPAOx6ckd5MaNGnkrOMcZPNO1uRVFqVJcG+0DnjVpjmoTnvfwd5YtsVPHEeay 7OXw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id f3-v6si1785821pgp.496.2018.06.20.04.52.36; Wed, 20 Jun 2018 04:53:04 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753972AbeFTLv0 (ORCPT + 99 others); Wed, 20 Jun 2018 07:51:26 -0400 Received: from usa-sjc-mx-foss1.foss.arm.com ([217.140.101.70]:35872 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752827AbeFTLvY (ORCPT ); Wed, 20 Jun 2018 07:51:24 -0400 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.72.51.249]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 52C3D1596; Wed, 20 Jun 2018 04:51:24 -0700 (PDT) Received: from localhost (e105922-lin.cambridge.arm.com [10.1.206.33]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id C37093F557; Wed, 20 Jun 2018 04:51:23 -0700 (PDT) From: Punit Agrawal To: Xie XiuQi Cc: Lorenzo Pieralisi , Michal Hocko , Hanjun Guo , Bjorn Helgaas , , , Catalin Marinas , "Rafael J. Wysocki" , Will Deacon , Linux Kernel Mailing List , Jarkko Sakkinen , , , Greg Kroah-Hartman , Bjorn Helgaas , Andrew Morton , zhongjiang , linux-arm Subject: Re: [PATCH 1/2] arm64: avoid alloc memory on offline node References: <20180611145330.GO13364@dhcp22.suse.cz> <87lgbk59gs.fsf@e105922-lin.cambridge.arm.com> <87bmce60y3.fsf@e105922-lin.cambridge.arm.com> <8b715082-14d4-f10b-d2d6-b23be7e4bf7e@huawei.com> <20180619120714.GE13685@dhcp22.suse.cz> <874lhz3pmn.fsf@e105922-lin.cambridge.arm.com> <20180619140818.GA16927@e107981-ln.cambridge.arm.com> <87wouu3jz1.fsf@e105922-lin.cambridge.arm.com> <20180619151425.GH13685@dhcp22.suse.cz> <87r2l23i2b.fsf@e105922-lin.cambridge.arm.com> <20180619163256.GA18952@e107981-ln.cambridge.arm.com> <814205eb-ae86-a519-bed0-f09b8e2d3a02@huawei.com> Date: Wed, 20 Jun 2018 12:51:22 +0100 In-Reply-To: <814205eb-ae86-a519-bed0-f09b8e2d3a02@huawei.com> (Xie XiuQi's message of "Wed, 20 Jun 2018 11:31:34 +0800") Message-ID: <87602d3ccl.fsf@e105922-lin.cambridge.arm.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/25.2 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Xie XiuQi writes: > Hi Lorenzo, Punit, > > > On 2018/6/20 0:32, Lorenzo Pieralisi wrote: >> On Tue, Jun 19, 2018 at 04:35:40PM +0100, Punit Agrawal wrote: >>> Michal Hocko writes: >>> >>>> On Tue 19-06-18 15:54:26, Punit Agrawal wrote: >>>> [...] >>>>> In terms of $SUBJECT, I wonder if it's worth taking the original patch >>>>> as a temporary fix (it'll also be easier to backport) while we work on >>>>> fixing these other issues and enabling memoryless nodes. >>>> >>>> Well, x86 already does that but copying this antipatern is not really >>>> nice. So it is good as a quick fix but it would be definitely much >>>> better to have a robust fix. Who knows how many other places might hit >>>> this. You certainly do not want to add a hack like this all over... >>> >>> Completely agree! I was only suggesting it as a temporary measure, >>> especially as it looked like a proper fix might be invasive. >>> >>> Another fix might be to change the node specific allocation to node >>> agnostic allocations. It isn't clear why the allocation is being >>> requested from a specific node. I think Lorenzo suggested this in one of >>> the threads. >> >> I think that code was just copypasted but it is better to fix the >> underlying issue. >> >>> I've started putting together a set fixing the issues identified in this >>> thread. It should give a better idea on the best course of action. >> >> On ACPI ARM64, this diff should do if I read the code correctly, it >> should be (famous last words) just a matter of mapping PXMs to nodes for >> every SRAT GICC entry, feel free to pick it up if it works. >> >> Yes, we can take the original patch just because it is safer for an -rc >> cycle even though if the patch below would do delaying the fix for a >> couple of -rc (to get it tested across ACPI ARM64 NUMA platforms) is >> not a disaster. > > I tested this patch on my arm board, it works. I am assuming you tried the patch without enabling support for memory-less nodes. The patch de-couples the onlining of numa nodes (as parsed from SRAT) from NR_CPUS restriction. When it comes to building zonelists, the node referenced by the PCI controller also has zonelists initialised. So it looks like a fallback node is setup even if we don't have memory-less nodes enabled. I need to stare some more at the code to see why we need memory-less nodes at all then ...