Received: by 2002:a25:7ec1:0:0:0:0:0 with SMTP id z184csp2543162ybc; Wed, 20 Nov 2019 16:20:59 -0800 (PST) X-Google-Smtp-Source: APXvYqyJX7zisRr6kd8iDO6VGLX5CQo+t2t3c+txDipVBnK9oE+8vWNZ9XWMXOQkpDZ2MswlNysB X-Received: by 2002:a17:906:ecf5:: with SMTP id qt21mr9004087ejb.295.1574295658911; Wed, 20 Nov 2019 16:20:58 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1574295658; cv=none; d=google.com; s=arc-20160816; b=zECWgEo9irX2eeKa7E7qNp8euTivGun4xmpzMaHR4MQYFGu0AVfoiVOdgYAReW69G8 2DpUdA3pSChqWDWoEAaEoZCvKAptZD46KwZyStRyNsOn2udso+PP8g4Q03VFjUQ8jY+Z 2mPOAM3ITSYreVV2i3lpyjh/7ZdF0UskFV47pKehAoDctbv672sG28PwPzhfMrW9Xpcp 4b6NfeaAX9srYxZD7Te9/u09EUqUxP1j/mJwR2qN7OAY3lWKuSriid9l1b9Nd4ODuaJb yX8rODBVWCwT5JwfTPxqlZ7vsmh3QpKqEkuQM6yGKpELV6Qm7qHQcajtAEv7CYcHvXIG oQsw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from; bh=J7R6tJxAQn7JkOgNQFAOor7F1rHBOa7fXe2BxPn807Q=; b=pgfisPgbEQCPHxPG2BgiEEHMkrAkSPJE3LxpIVL3FyZvU8f8G78UOXf/I0KTUrEnyR MPg8gKRGflamWwpQW0N0/lI6xfvKnAIuBPenuAFhWSHJPyZaPvqrUxKZu5gjtunDTzo8 1hs0bCqLRYTJSh83y//Dwf77Gp8XB7CDtgqT+OjOcmge4RsT793wvVSuVcUqk0c4LyI+ b4NNK9MM+hr+xaRSlwf0YuHHt8A3RHSc05oJOhIEyj9A9T2PsMj4pWSxgMo7yadMbn10 FIqFxg1kbIie5CFRezia4bYvcoRWEMkSH75dJG0AG9AAvBTqqzRQd26YywNeBROhkqQG M9XA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id b18si813904edx.26.2019.11.20.16.20.35; Wed, 20 Nov 2019 16:20:58 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727313AbfKUAQS (ORCPT + 99 others); Wed, 20 Nov 2019 19:16:18 -0500 Received: from mga17.intel.com ([192.55.52.151]:36701 "EHLO mga17.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726343AbfKUAPf (ORCPT ); Wed, 20 Nov 2019 19:15:35 -0500 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from orsmga004.jf.intel.com ([10.7.209.38]) by fmsmga107.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 20 Nov 2019 16:15:34 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.69,223,1571727600"; d="scan'208";a="357620027" Received: from tassilo.jf.intel.com (HELO tassilo.localdomain) ([10.7.201.21]) by orsmga004.jf.intel.com with ESMTP; 20 Nov 2019 16:15:34 -0800 Received: by tassilo.localdomain (Postfix, from userid 1000) id 7CE65300B64; Wed, 20 Nov 2019 16:15:34 -0800 (PST) From: Andi Kleen To: acme@kernel.org Cc: jolsa@kernel.org, linux-kernel@vger.kernel.org Subject: Optimize perf stat for large number of events/cpus Date: Wed, 20 Nov 2019 16:15:10 -0800 Message-Id: <20191121001522.180827-1-andi@firstfloor.org> X-Mailer: git-send-email 2.23.0 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org [v8: Address review feedback. Only changes one patch.] This patch kit optimizes perf stat for a large number of events on systems with many CPUs and PMUs. Some profiling shows that the most overhead is doing IPIs to all the target CPUs. We can optimize this by using sched_setaffinity to set the affinity to a target CPU once and then doing the perf operation for all events on that CPU. This requires some restructuring, but cuts the set up time quite a bit. In theory we could go further by parallelizing these setups too, but that would be much more complicated and for now just batching it per CPU seems to be sufficient. At some point with many more cores parallelization or a better bulk perf setup API might be needed though. In addition perf does a lot of redundant /sys accesses with many PMUs, which can be also expensve. This is also optimized. On a large test case (>700 events with many weak groups) on a 94 CPU system I go from real 0m8.607s user 0m0.550s sys 0m8.041s to real 0m3.269s user 0m0.760s sys 0m1.694s so shaving ~6 seconds of system time, at slightly more cost in perf stat itself. On a 4 socket system the savings are more dramatic: real 0m15.641s user 0m0.873s sys 0m14.729s to real 0m4.493s user 0m1.578s sys 0m2.444s so 11s difference in the user visible set up time. Also available in git://git.kernel.org/pub/scm/linux/kernel/git/ak/linux-misc perf/stat-scale-11 v1: Initial post. v2: Rebase. Fix some minor issues. v3: Rebase. Address review feedback. Fix one minor issue v4: Modified based on review feedback. Now it maintains all_cpus per evlist. There is still a need for cpu_index iteration to get the correct index for indexing the file descriptors. Fix bug with unsorted cpu maps, now they are always sorted. Some cleanups and refactoring. v5: Split patches. Redo loop iteration again. Fix cpu map merging for uncore. Remove duplicates from cpumaps. Add unit tests. v6: Address review feedback. Fix some bugs. Add more comments. Merge one invalid patch split. v7: Address review feedback. Fix python scripting (thanks 0day) Minor updates. v8: Address review feedback. -Andi