Received: by 10.192.165.148 with SMTP id m20csp854571imm; Thu, 10 May 2018 01:32:11 -0700 (PDT) X-Google-Smtp-Source: AB8JxZqf3LBY25wn2agden4RsauKosS7YfKGZ0dVop3Ntx5LhH2Lt3pvPjiMOIqUztGnNuArs6zI X-Received: by 2002:a65:498e:: with SMTP id r14-v6mr400294pgs.78.1525941131696; Thu, 10 May 2018 01:32:11 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1525941131; cv=none; d=google.com; s=arc-20160816; b=G3WbemfVdQFDbud2h9KSnVh0pXG+o3uznyQiFLo7n9tgib0uJH3G6RkXGEubmsiC1O ooVoVDCjnWtkMwGBHPBcguf6hy/T9YSYkXK4Ocd7l4JJibIHI1522DeAIQnBr3LOfpaQ gktPctegwIYfWw7KOYEQQ9LPRKY4BWoUI1qE+YCbIth3evPxPUrjlTnlr5lc5Z6RrCZk cRP4p9oDTTokBsIuctqntMdPOzKsbmHUN0ItDUHG8WjYOE+4KN6izlULxmaNTx9qjJYv rsWT2LGG5EX66MpQxXhrAuCJBAcdpl6ObPD8BmUi5+/hLZTcMhgzHJVR0Xp6plzHEcVK /2yg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=JdYifIXAaS8ovfqro8catLnru82xvVyn35lMJTqNGYw=; b=Tnl5RQ000Myy+JmjT5wyJ+mk1JBRGrjmWGi3JpQIXO20+IcPFIvKtQ0caMEA3lGxqe BPoy6ywOf8apXd5zx14Id3iaXTsN8qk5vEvW4V3x1HndnBbnaVHk5PVy1zoklzyGRSHK i4A6cp6X+hWlpn9oifuovJIQltOakXISLzEO2LddOLB4266hoVK3YLEEpQEw75qZQhiX uziRJ5O3rpGwcGXs9VrVg2SvC7RH6Fts1la9opSyqFMpO2YfjlzR/VwcI1HzccrA/koS 0dhengBkct7Qh4l2MWiDuso3TQYpzEohG3Z3EoYDpvOHU2kB/Q8fuRqIKL7NgELLd1ai FQdg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id u8-v6si282036plm.134.2018.05.10.01.31.45; Thu, 10 May 2018 01:32:11 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756874AbeEJIW0 (ORCPT + 99 others); Thu, 10 May 2018 04:22:26 -0400 Received: from mx2.suse.de ([195.135.220.15]:52996 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751313AbeEJIWW (ORCPT ); Thu, 10 May 2018 04:22:22 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay2.suse.de (charybdis-ext.suse.de [195.135.220.254]) by mx2.suse.de (Postfix) with ESMTP id A6AC3AC81; Thu, 10 May 2018 08:22:20 +0000 (UTC) Date: Thu, 10 May 2018 10:22:19 +0200 From: Michal Hocko To: Ganapatrao Kulkarni Cc: Ganapatrao Kulkarni , Tony Luck , "Rafael J. Wysocki" , tiantao6@huawei.com, LKML , linux-acpi@vger.kernel.org Subject: Re: fd3e45436660 ("ACPI / NUMA: ia64: Parse all entries of SRAT memory affinity table") Message-ID: <20180510082219.GH32366@dhcp22.suse.cz> References: <20180411104832.GF23400@dhcp22.suse.cz> <20180509122454.GR32366@dhcp22.suse.cz> <20180509125611.GT32366@dhcp22.suse.cz> <20180510073054.GD32366@dhcp22.suse.cz> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.9.5 (2018-04-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu 10-05-18 13:36:11, Ganapatrao Kulkarni wrote: > On Thu, May 10, 2018 at 1:00 PM, Michal Hocko wrote: > > On Thu 10-05-18 08:27:35, Ganapatrao Kulkarni wrote: > >> On Wed, May 9, 2018 at 6:26 PM, Michal Hocko wrote: > >> > On Wed 09-05-18 18:07:16, Ganapatrao Kulkarni wrote: > >> >> Hi Michal > >> >> > >> >> > >> >> On Wed, May 9, 2018 at 5:54 PM, Michal Hocko wrote: > >> >> > On Wed 11-04-18 12:48:32, Michal Hocko wrote: > >> >> >> Hi, > >> >> >> my attention was brought to the %subj commit and either I am missing > >> >> >> something or the patch is quite dubious. What is it actually trying to > >> >> >> fix? If a BIOS/FW provides more memblocks than the limit then we would > >> >> >> get misleading numa topology (numactl -H output) but is the situation > >> >> >> much better with it applied? Numa init code will refuse to init more > >> >> >> memblocks than the limit and falls back to dummy_numa_init (AFAICS) > >> >> >> which will break the topology again and numactl -H will have a > >> >> >> misleading output anyway. > >> >> > >> >> IIRC, the MEMBLOCK beyond max limit getting dropped from visible > >> >> memory(partial drop from a node). > >> >> this patch removed any upper limit on memblocks and allowed to parse > >> >> all entries of SRAT. > >> > > >> > Yeah I've understood that much. My question is, however, why do we care > >> > about parsing the NUMA topology when we fallback into a single NUMA node > >> > anyway? Or do I misunderstand the code? I do not have any platform with > >> > that many memblocks. > >> > >> IMHO, this fix is very much logical by removing the SRAT parsing restriction. > >> below is the crash log which made us to debug and eventually fix with > >> this patch. > > > > Ohh, I am not saying that the current code handles too many memblocks > > correctly. I just think that your fix is not correct or incomplete at > > least. Assuming that my understanding is correct which you haven't > > disputed yet. So can we focus on the proper solution now? Do we actually > > need the memblock restrictions? We do not need those for reserved > > memblocks so I do not see any real reason to simply remove the > > restriction altogether. Have you explored that option? > > my logic was simple, when i added this patch, when the cap on max > memblocks is arch specific, why to restrict SRAT parsing which is not > arch specific. Because there are other parts which are still restricted and that we want to have all parts in sync. > other way around argument is, why the restriction added in the first > place itself! Exactly. So have you explored that path? -- Michal Hocko SUSE Labs