Received: by 2002:a05:6358:3188:b0:123:57c1:9b43 with SMTP id q8csp624221rwd; Thu, 8 Jun 2023 05:46:57 -0700 (PDT) X-Google-Smtp-Source: ACHHUZ5b8p0qBZRL8LdgYbFD3tAxeestNEdSAqoFfXjXbAE+lLvoXMWPX1eT56a+5otjhVTxK4OS X-Received: by 2002:a92:3204:0:b0:33d:6988:c017 with SMTP id z4-20020a923204000000b0033d6988c017mr8221402ile.18.1686228417322; Thu, 08 Jun 2023 05:46:57 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1686228417; cv=none; d=google.com; s=arc-20160816; b=lqa5UvPHj1tdpbkbS3dgUG1NDkYHIp6qSCR30n49AFiZbjQLbu3S3tSht+z6c7+Msm kIIxsPTRqUK7VAjnOfb9aA/6maXfQiJuI6tIa/Ol751eDF43eM0cdv5f/AhTLwXPr9O3 eQlJFWCO9VsBgyVgNmQHNWaA6G6cLhRZnd8vdRv9nO64D8pp36hTk0udPhYyKKxm0dfs 0yGNRbDyqswuvwc/8v+0NciP1oqeNAaVuApxUndWgMEuAky8sMDRHrFUQIdIgYKcImv7 faADlfqPcfdta32NKTaPQ0frHu+OybFoP5kiNGMqA4njQeR2mQiyxl7faOc3jTrj2yQe AEsA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date; bh=SwQDS9F9WkUjw+lMP5ddNJHctH3cJqgy0vtliPl8vHM=; b=c5jKhTb1c/6MhNBwVJtdnVWiXQPK/Arb4ZSE/gw0UzVCkTSbUKahDe8CiIV0cqTM73 FZEFQS5/BsOex/dkH1PheEBbXAR434yAb6x1MzOidEioSEYGcPn9SjCUf5aX4lPH+XHA dKenDgO+lm4LOR+KhMVTCZiyQ8cTeoKcb+92bHztjsqTY1XJtCpP2skDmdCLHFcEET40 7wIuy9jDVHTfQiSXVdlnMJ8l086qEeqTgS0zrww3Wb4bU/kKplFSOd/9TK4fc7j7U/9s gcA9ikco/3bNBInZaYK2qVCqIqaqti2ape6d5ISftxnzNCfDlIZF2AkJcWwHFqgeuj4y YMuQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id w27-20020a63af1b000000b00543cac2783fsi918287pge.485.2023.06.08.05.46.45; Thu, 08 Jun 2023 05:46:57 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S235911AbjFHMku (ORCPT + 99 others); Thu, 8 Jun 2023 08:40:50 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:56446 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231626AbjFHMkt (ORCPT ); Thu, 8 Jun 2023 08:40:49 -0400 Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id B61A726B3; Thu, 8 Jun 2023 05:40:46 -0700 (PDT) Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id AF8F31042; Thu, 8 Jun 2023 05:41:31 -0700 (PDT) Received: from FVFF77S0Q05N (unknown [10.57.24.103]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 895CD3F663; Thu, 8 Jun 2023 05:40:44 -0700 (PDT) Date: Thu, 8 Jun 2023 13:40:39 +0100 From: Mark Rutland To: Junhao He , will@kernel.org Cc: jonathan.cameron@huawei.com, linux-kernel@vger.kernel.org, linux-arm-kernel@lists.infradead.org, linux-doc@vger.kernel.org, linuxarm@huawei.com, yangyicong@huawei.com, shenyang39@huawei.com, prime.zeng@hisilicon.com Subject: Re: [PATCH] drivers/perf: hisi: Don't migrate perf to the CPU going to teardown Message-ID: References: <20230608114326.27649-1-hejunhao3@huawei.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20230608114326.27649-1-hejunhao3@huawei.com> X-Spam-Status: No, score=-4.2 required=5.0 tests=BAYES_00,RCVD_IN_DNSWL_MED, SPF_HELO_NONE,SPF_NONE,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Jun 08, 2023 at 07:43:26PM +0800, Junhao He wrote: > The driver needs to migrate the perf context if the current using CPU going > to teardown. By the time calling the cpuhp::teardown() callback the > cpu_online_mask() hasn't updated yet and still includes the CPU going to > teardown. In current driver's implementation we may migrate the context > to the teardown CPU and leads to the below calltrace: > > ... > [ 368.104662][ T932] task:cpuhp/0 state:D stack: 0 pid: 15 ppid: 2 flags:0x00000008 > [ 368.113699][ T932] Call trace: > [ 368.116834][ T932] __switch_to+0x7c/0xbc > [ 368.120924][ T932] __schedule+0x338/0x6f0 > [ 368.125098][ T932] schedule+0x50/0xe0 > [ 368.128926][ T932] schedule_preempt_disabled+0x18/0x24 > [ 368.134229][ T932] __mutex_lock.constprop.0+0x1d4/0x5dc > [ 368.139617][ T932] __mutex_lock_slowpath+0x1c/0x30 > [ 368.144573][ T932] mutex_lock+0x50/0x60 > [ 368.148579][ T932] perf_pmu_migrate_context+0x84/0x2b0 > [ 368.153884][ T932] hisi_pcie_pmu_offline_cpu+0x90/0xe0 [hisi_pcie_pmu] > [ 368.160579][ T932] cpuhp_invoke_callback+0x2a0/0x650 > [ 368.165707][ T932] cpuhp_thread_fun+0xe4/0x190 > [ 368.170316][ T932] smpboot_thread_fn+0x15c/0x1a0 > [ 368.175099][ T932] kthread+0x108/0x13c > [ 368.179012][ T932] ret_from_fork+0x10/0x18 > ... > > Use function cpumask_any_but() to find one correct active cpu to fixes > this issue. > > Fixes: 8404b0fbc7fb ("drivers/perf: hisi: Add driver for HiSilicon PCIe PMU") > Signed-off-by: Junhao He Acked-by: Mark Rutland I assume that Will can pick this up. I did a quick check, and all other perf drivers seem to do the right thing here, either using cpumask_any_but(), or generating a temporary mask with the cpu being offlined removed. Mark. > --- > drivers/perf/hisilicon/hisi_pcie_pmu.c | 2 +- > 1 file changed, 1 insertion(+), 1 deletion(-) > > diff --git a/drivers/perf/hisilicon/hisi_pcie_pmu.c b/drivers/perf/hisilicon/hisi_pcie_pmu.c > index 0bc8dc36aff5..14f8b4b03337 100644 > --- a/drivers/perf/hisilicon/hisi_pcie_pmu.c > +++ b/drivers/perf/hisilicon/hisi_pcie_pmu.c > @@ -683,7 +683,7 @@ static int hisi_pcie_pmu_offline_cpu(unsigned int cpu, struct hlist_node *node) > > pcie_pmu->on_cpu = -1; > /* Choose a new CPU from all online cpus. */ > - target = cpumask_first(cpu_online_mask); > + target = cpumask_any_but(cpu_online_mask, cpu); > if (target >= nr_cpu_ids) { > pci_err(pcie_pmu->pdev, "There is no CPU to set\n"); > return 0; > -- > 2.30.0 >