Received: by 2002:ad5:4acb:0:0:0:0:0 with SMTP id n11csp810216imw; Thu, 14 Jul 2022 11:09:45 -0700 (PDT) X-Google-Smtp-Source: AGRyM1uOuewrrblgZpgLghKRn2sLzzV7Yu553KLjLaIK8/jcEzManKDTuyK39gFWp1sHAAWCKTWe X-Received: by 2002:a05:6402:3708:b0:433:2d3b:ed5 with SMTP id ek8-20020a056402370800b004332d3b0ed5mr13749876edb.246.1657822185558; Thu, 14 Jul 2022 11:09:45 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1657822185; cv=none; d=google.com; s=arc-20160816; b=K73E0rPg0ND23BZc5mtSRg0yV5AIzLOvjMlFI2QWwMcGBHE9MnHsfcdNGnJBYQJGIg OI+C5UfGR1ZQrFUo3QkPZ+qknqgs0hfuZrQ2NEjupDc4LKsZWxnE7RcbjpD1oywIegjO +nN/S6CP+z6CSPbI1lZAIuSJSglcIHlKA7JcFwvEY9b9xuzDJy40nRbcQi5Lwbiy8WW3 g3H+xyqW/sC6ALyFp1ffFQy4f05NYDEjlAmymmtFXymc+8WVcfvaX3Q6nnDLiElyN3Ai ovHFucwpBXzQSQqKep1AmWFAEpme3cpdltkdDSwn3wYtVgzUoS4GxdOKFldJq0VhsvFV MzDw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date; bh=tCr4JAmdYZzXUSRhCTZ3nfKu+iyKBVNT3h843+ABdrs=; b=VHPAehVUzx0p9IjeQqp/KEX/CMMXr+oX3Ju+ED3hcKiLmPWPob5T54CuUQuEgMOC+N IrbszR50oyRlU6WUTz0TTtgCaIFYzw8US6YLQj8Q9bOxQSMZGycBKymzTgZhC3h7MVVa ZBVd7WHxziVzkyNXfgf967AS+Hw0PcSg7ZrTr+Unon/56BTQ6m92mpZxXNjqWD/z3U8u GcQpc9EtPWnm7pg3K+Ips5tdpvWSZKiZRdwrZ6cQSmSpPU1K/7dm7NBCk6wkx2akAsyn +PuwdPZYl+t9CRY9d0neAKVKGOUXisMv9aFcVWnWt6caj04QamicUiKvpKQJAuTCxhNF SjRQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id mp22-20020a1709071b1600b00726a78c2a41si2812800ejc.701.2022.07.14.11.09.20; Thu, 14 Jul 2022 11:09:45 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233252AbiGNRwy (ORCPT + 99 others); Thu, 14 Jul 2022 13:52:54 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:43396 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231500AbiGNRww (ORCPT ); Thu, 14 Jul 2022 13:52:52 -0400 Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 13669491D1 for ; Thu, 14 Jul 2022 10:52:52 -0700 (PDT) Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 4062C1D13; Thu, 14 Jul 2022 10:52:52 -0700 (PDT) Received: from localhost (ionvoi01-desktop.cambridge.arm.com [10.1.196.65]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 6883F3F70D; Thu, 14 Jul 2022 10:52:51 -0700 (PDT) Date: Thu, 14 Jul 2022 18:52:49 +0100 From: Ionela Voinescu To: Sudeep Holla Cc: linux-kernel@vger.kernel.org, conor.dooley@microchip.com, Greg Kroah-Hartman , Vincent Guittot , Dietmar Eggemann , Pierre Gondois , linux-arm-kernel@lists.infradead.org, linux-riscv@lists.infradead.org Subject: Re: [PATCH -next] arch_topology: Fix cache attributes detection in the CPU hotplug path Message-ID: References: <20220713133344.1201247-1-sudeep.holla@arm.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20220713133344.1201247-1-sudeep.holla@arm.com> X-Spam-Status: No, score=-6.9 required=5.0 tests=BAYES_00,RCVD_IN_DNSWL_HI, SPF_HELO_NONE,SPF_NONE,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Sudeep, Thank you for the fix! On Wednesday 13 Jul 2022 at 14:33:44 (+0100), Sudeep Holla wrote: > init_cpu_topology() is called only once at the boot and all the cache > attributes are detected early for all the possible CPUs. However when > the CPUs are hotplugged out, the cacheinfo gets removed. While the > attributes are added back when the CPUs are hotplugged back in as part > of CPU hotplug state machine, it ends up called quite late after the > update_siblings_masks() are called in the secondary_start_kernel() > resulting in wrong llc_sibling_masks. > > Move the call to detect_cache_attributes() inside update_siblings_masks() > to ensure the cacheinfo is updated before the LLC sibling masks are > updated. This will fix the incorrect LLC sibling masks generated when > the CPUs are hotplugged out and hotplugged back in again. > > Reported-by: Ionela Voinescu > Signed-off-by: Sudeep Holla > --- > drivers/base/arch_topology.c | 16 ++++++---------- > 1 file changed, 6 insertions(+), 10 deletions(-) > > Hi Conor, > > Ionela reported an issue with the CPU hotplug and as a fix I need to > move the call to detect_cache_attributes() which I had thought to keep > it there from first but for no reason had moved it to init_cpu_topology(). > > Wonder if this fixes the -ENOMEM on RISC-V as this one is called on the > cpu in the secondary CPUs init path while init_cpu_topology executed > detect_cache_attributes() for all possible CPUs much earlier. I think > this might help as the percpu memory might be initialised in this case. > > Anyways give this a try, also test the CPU hotplug and check if nothing > is broken on RISC-V. We noticed this bug only on one platform while > > Regards, > Sudeep > > diff --git a/drivers/base/arch_topology.c b/drivers/base/arch_topology.c > index 441e14ac33a4..0424b59b695e 100644 > --- a/drivers/base/arch_topology.c > +++ b/drivers/base/arch_topology.c > @@ -732,7 +732,11 @@ const struct cpumask *cpu_clustergroup_mask(int cpu) > void update_siblings_masks(unsigned int cpuid) > { > struct cpu_topology *cpu_topo, *cpuid_topo = &cpu_topology[cpuid]; > - int cpu; > + int cpu, ret; > + > + ret = detect_cache_attributes(cpuid); > + if (ret) > + pr_info("Early cacheinfo failed, ret = %d\n", ret); > /* update core and thread sibling masks */ > for_each_online_cpu(cpu) { > @@ -821,7 +825,7 @@ __weak int __init parse_acpi_topology(void) > #if defined(CONFIG_ARM64) || defined(CONFIG_RISCV) > void __init init_cpu_topology(void) > { > - int ret, cpu; > + int ret; > reset_cpu_topology(); > ret = parse_acpi_topology(); > @@ -836,13 +840,5 @@ void __init init_cpu_topology(void) > reset_cpu_topology(); > return; > } > - > - for_each_possible_cpu(cpu) { > - ret = detect_cache_attributes(cpu); > - if (ret) { > - pr_info("Early cacheinfo failed, ret = %d\n", ret); > - break; > - } > - } > } > #endif > --2.37.1 > Reviewed-by: Ionela Voinescu Tested-by: Ionela Voinescu Kind regards, Ionela.