Received: by 2002:a05:6a10:2726:0:0:0:0 with SMTP id ib38csp693166pxb; Tue, 5 Apr 2022 19:02:52 -0700 (PDT) X-Google-Smtp-Source: ABdhPJx32n2Hldz/FDtbqjDpFzP5eCGiJcJg7WDU9x+UbhNwoRyJlitC7EZ5yqk0aC4E3ZiH5Wbn X-Received: by 2002:a17:903:2283:b0:154:613:eb14 with SMTP id b3-20020a170903228300b001540613eb14mr6320959plh.32.1649210572635; Tue, 05 Apr 2022 19:02:52 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1649210572; cv=none; d=google.com; s=arc-20160816; b=UIAW0dYhGZS5zxlU8zS9YpIh48wRUzCuSpGD1+VwXgxYu6eL+PdcuqLVFHDVEQtSd2 n5L9KZpjRapDju0HTQBX1fvgs43w4eJG/XPM9hHwRGReFpuyF7/OFnH0oFAW/apJESZP hMg8kBB5W2vA/WxKKN/TXPA6Tp+qJhqeDeNMGmu8jMTpGiWK1xxSSJ1PCVjCEy99Z+Ne GeJwccpLmx5hupVCoMyFYQI0sx0gNV5JNlHXahBghA2DaRr0C8O/klrZsOrRsWcdJdYg c09I7YCNKhut30u1hgOhbVJeSDAuLQ7W911CVwj5ndWBWWnlFONKBWrvgsXBgH0NF/O6 41ew== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:message-id:date:subject:cc:to :from:dkim-signature; bh=+tyEDbz84DInPDAz28FniCUoVJfTWROvhKa7L49cjAs=; b=VYpHFApe4De2ZepaOZkYDIIgNIr2rup86UQEeIqu3dm+DaUwtv6eiah/sg3U9z5JTK HLGLtiJ49qJDXfT0sGYckau4hp9jsieWHf8gDRpJLFnPrqx8ZeHxDCUJpkZo7DBHzqzV ML0UAeVs69lGoEJfrqj6sKpCzq5HHo+jO0MyiMo0w4wF8L4G2r4zpkuFLgCdqkSbPkQ7 9hxbbqXNPJfF6iu881qmR2dqS+JTV4doihgwie0Ht1JJen+btm3s21w0uT9b6LpQfDwb fgFH2cSrTAJ+SVI/Ud2eCbMHCYx3nm8gDFww+OnQcKvrTLIBb++F4OYlKzqK4YefqgWV 7gKA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=korg header.b="JcXB/rHg"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id v17-20020a056a00149100b004fa6cbcbbf0si15468311pfu.235.2022.04.05.19.02.36; Tue, 05 Apr 2022 19:02:52 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=korg header.b="JcXB/rHg"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1357786AbiDENI5 (ORCPT + 99 others); Tue, 5 Apr 2022 09:08:57 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:60606 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1344119AbiDEJSS (ORCPT ); Tue, 5 Apr 2022 05:18:18 -0400 Received: from dfw.source.kernel.org (dfw.source.kernel.org [139.178.84.217]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id D7A4137A3E; Tue, 5 Apr 2022 02:04:14 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id 692A961564; Tue, 5 Apr 2022 09:04:14 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 42FDDC385A0; Tue, 5 Apr 2022 09:04:13 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=linuxfoundation.org; s=korg; t=1649149453; bh=Kez2sL3RMdmqfeeAkd8I0EFnzgYorCR1t5sLLVWdCdc=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=JcXB/rHgU7sSNrJUWvozvhZqQWwVpfeiHW5DgtCiDXVpN12SvwnI2/aIwKbehUV0B M+m5DO4rpuB/jiLJ3OXSYPjnQnZYYoNaq3EyB9wERrOPo7CGpCP5A2gG8rSNJyvx97 RRzSJuyF/GTfZhOXBdjBTQrqYyH7QCgXTksyWGa0= From: Greg Kroah-Hartman To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman , stable@vger.kernel.org, Thomas Richter , Jiri Olsa , Namhyung Kim , Sumanth Korikkar , Heiko Carstens , Sven Schnelle , Vasily Gorbik , Arnaldo Carvalho de Melo , Sasha Levin Subject: [PATCH 5.16 0721/1017] perf stat: Fix forked applications enablement of counters Date: Tue, 5 Apr 2022 09:27:14 +0200 Message-Id: <20220405070415.668585336@linuxfoundation.org> X-Mailer: git-send-email 2.35.1 In-Reply-To: <20220405070354.155796697@linuxfoundation.org> References: <20220405070354.155796697@linuxfoundation.org> User-Agent: quilt/0.66 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-7.1 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_HI, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Thomas Richter [ Upstream commit d0a0a511493d269514fcbd852481cdca32c95350 ] I have run into the following issue: # perf stat -a -e new_pmu/INSTRUCTION_7/ -- mytest -c1 7 Performance counter stats for 'system wide': 0 new_pmu/INSTRUCTION_7/ 0.000366428 seconds time elapsed # The new PMU for s390 counts the execution of certain CPU instructions. The root cause is the extremely small run time of the mytest program. It just executes some assembly instructions and then exits. In above invocation the instruction is executed exactly one time (-c1 option). The PMU is expected to report this one time execution by a counter value of one, but fails to do so in some cases, not all. Debugging reveals the invocation of the child process is done *before* the counter events are installed and enabled. Tracing reveals that sometimes the child process starts and exits before the event is installed on all CPUs. The more CPUs the machine has, the more often this miscount happens. Fix this by reversing the start of the work load after the events have been installed on the specified CPUs. Now the comment also matches the code. Output after: # perf stat -a -e new_pmu/INSTRUCTION_7/ -- mytest -c1 7 Performance counter stats for 'system wide': 1 new_pmu/INSTRUCTION_7/ 0.000366428 seconds time elapsed # Now the correct result is reported rock solid all the time regardless how many CPUs are online. Reviewers notes: Jiri: Right, without -a the event has enable_on_exec so the race does not matter, but it's a problem for system wide with fork. Namhyung: Agreed. Also we may move the enable_counters() and the clock code out of the if block to be shared with the else block. Fixes: acf2892270dcc428 ("perf stat: Use perf_evlist__prepare/start_workload()") Signed-off-by: Thomas Richter Acked-by: Jiri Olsa Acked-by: Namhyung Kim Acked-by: Sumanth Korikkar Cc: Heiko Carstens Cc: Sven Schnelle Cc: Vasily Gorbik Link: https://lore.kernel.org/r/20220317155346.577384-1-tmricht@linux.ibm.com Signed-off-by: Arnaldo Carvalho de Melo Signed-off-by: Sasha Levin --- tools/perf/builtin-stat.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/tools/perf/builtin-stat.c b/tools/perf/builtin-stat.c index 7974933dbc77..08c024038ca7 100644 --- a/tools/perf/builtin-stat.c +++ b/tools/perf/builtin-stat.c @@ -956,10 +956,10 @@ static int __run_perf_stat(int argc, const char **argv, int run_idx) * Enable counters and exec the command: */ if (forks) { - evlist__start_workload(evsel_list); err = enable_counters(); if (err) return -1; + evlist__start_workload(evsel_list); t0 = rdclock(); clock_gettime(CLOCK_MONOTONIC, &ref_time); -- 2.34.1