Received: by 2002:a05:6358:11c7:b0:104:8066:f915 with SMTP id i7csp2525457rwl; Thu, 13 Apr 2023 07:35:26 -0700 (PDT) X-Google-Smtp-Source: AKy350be3MSzdf691dL8QDCGAntAP8+8gb+7kL7jx2DmQ2T6aGy5B0qs4CfXoZIxGlCmOoMCqDMc X-Received: by 2002:a17:907:590:b0:94e:c306:f42f with SMTP id vw16-20020a170907059000b0094ec306f42fmr688327ejb.16.1681396526115; Thu, 13 Apr 2023 07:35:26 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1681396526; cv=none; d=google.com; s=arc-20160816; b=GuBQlfO3aHH620RGZYN6oIS9Z2JbJbretlWTGx5/peEKwvKi5b7NhMUpZJNh7tQ8uw Cy9+7sYqtJWuNfUSIwJ/416zJm9v36tzIB4WUe73k1XRbJ/zLgA+9A5BC50pZ9jvE8jD eeBK+pWRqAxpweqjfIRR22Iav7QRfq4ntZqm/UByNte9B5fqRy9DnDQbBa4k3Ckudxk7 lINE4ZbBgl6TF4Iby0XZYUywk0Mixq+Ng3IjejZG4Bpg0pws41LDRd8ZP2lSyeGFSlFm gHwmwHzUzyrE11ayeratwLhI54fk2kj2ssxhkuFyWl6+SwRs5fpDs+t8m6ml8whWLio+ 10mg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from; bh=6GTRxEV3PlemGSNWeoIbhEeBeXMH9r7Pkx00KqKxH/A=; b=jpzFQy9fV+ltpOXyC4WLgnmdv/jCD8XXkUrP7LIu+s6eWLDKNnyjaOzO/AkDBFW6Z3 sKwjeW4WxrqlW3oX4ZTQAusy6NYlcbZKeGpBJjcRbTrkPyboidzIj2WSRetlzlXsmToe b3n43am+VNP3pdXv+7NN2OUlCb5GribLuStiA+MDlLePQlnMpM3FItxqW6S6OWm+LiBd JoOYNlfbb+/TszNJo4n3c2SGbizLtLad4Az2jUDi2kVn7zI68JRCxYxeVXSea6VgLUu3 CFkWQnf66oz+hd2VKMhdO4wyc6y5zOxAw6Z3T2xarljv0AwOkuNGvMblNFF3+WFmr9Fn 4KEw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id jx21-20020a170906ca5500b0092fb40179efsi2174979ejb.733.2023.04.13.07.34.52; Thu, 13 Apr 2023 07:35:26 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231414AbjDMO0X (ORCPT + 99 others); Thu, 13 Apr 2023 10:26:23 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:55798 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231217AbjDMO0W (ORCPT ); Thu, 13 Apr 2023 10:26:22 -0400 Received: from frasgout.his.huawei.com (frasgout.his.huawei.com [185.176.79.56]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 07AEA9EE0; Thu, 13 Apr 2023 07:26:19 -0700 (PDT) Received: from lhrpeml500005.china.huawei.com (unknown [172.18.147.201]) by frasgout.his.huawei.com (SkyGuard) with ESMTP id 4Py1ww03HTz67njv; Thu, 13 Apr 2023 22:21:51 +0800 (CST) Received: from SecurePC-101-06.china.huawei.com (10.122.247.231) by lhrpeml500005.china.huawei.com (7.191.163.240) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2507.23; Thu, 13 Apr 2023 15:26:16 +0100 From: Jonathan Cameron To: Liang Kan , , , , CC: , , , , , , Davidlohr Bueso , Dave Jiang Subject: [PATCH v6 0/5] perf: CXL 3.0 Performance Monitoring Unit support Date: Thu, 13 Apr 2023 15:26:12 +0100 Message-ID: <20230413142617.15995-1-Jonathan.Cameron@huawei.com> X-Mailer: git-send-email 2.37.2 MIME-Version: 1.0 Content-Transfer-Encoding: 7BIT Content-Type: text/plain; charset=US-ASCII X-Originating-IP: [10.122.247.231] X-ClientProxiedBy: lhrpeml500002.china.huawei.com (7.191.160.78) To lhrpeml500005.china.huawei.com (7.191.163.240) X-CFilter-Loop: Reflected X-Spam-Status: No, score=-4.2 required=5.0 tests=BAYES_00,RCVD_IN_DNSWL_MED, RCVD_IN_MSPIKE_H2,SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Was CXL 3.0 Performance Monitoring Unit support. Renamed to highlight the driver has moved to drivers/perf Dan Williams has expressed that he is happy to take this through the CXL tree once perf reviewers are happy with the perf aspects. Thanks to Kan and Dan for reviews. Changes since v5: - Fixed outdated example in patch 4 description. - Gathered tags. Major changes since v4: (Smaller changes described in individual patches) - Move driver from drivers/cxl/cpmu.c to drivers/perf/cxl_pmu.c - Rename devices to associate with cxl/devices/memX etc. Scheme extends to CPMU instances on Upstream and Downstream CXL ports. - Rename CXL core files to simply pmu.* and simplify code structure naming and similar to cxl_pmu* CXL_PMU* etc Patch 1/5 is also present in the series: [PATCH 00/32] Add parents to struct pmu -> dev which may merge first. Update introduction. The CXL rev 3.0 specification introduces a CXL Performance Monitoring Unit definition. CXL components may have any number of these blocks. The definition is highly flexible, but that does bring complexity in the driver. All instances are self describing, though for vendor defined events we expect mapping from numeric values to counter names to be performed in userspace tooling. In common with many CXL features, this driver precedes any announced hardware (that I'm aware of anyway!). Supported features are: - Devices that allow counters to be written when frozen (allows a single register write to start / stop all counters). - Fixed purpose counters - Configurable counters - CXL specification defined events + HDM filters. This initial series covers a useful subset of functionality and is expected to be followed by patch series addressing: * Discoverability beyond fine grained events. This will cover both vendor defined events and providing perf tool with the visibility to be able construct 'summed' events. For example if a single counter can cover all host to device read traffic. Perf tool patches will use this information and appropriate schema to present a richer set of countable events. * CXL PMU instances on Upstream and Downstream CXL switch ports and root ports. * Free running counters. Often used for vendor defined debug type events and error counters. Likely to appear on real devices. * Devices without interrupt support for overflow. * Devices that don't support freeze (counters need to be enabled individually). Exact priority order for the above features will depend on early hardware though (a) will definitely be top of the list as any likely hardware will be able to take advantage of that feature. Notes. 1) The QEMU model against which this was developed needs tidying up. Latest tree at https://gitlab.com/jic23/qemu branch cxl-2023-02-28 It's backed up behind other series that I plan to upstream first. 2) V1 led to a discussion of how to handle the self describing and extensible nature of the CPMU counters. That is likely to involve a description in the "caps" sysfs directory and perf tool code that is aware of the different event combinations that make sense and can establish which sets are available on a given device. That is likely to take a while to converge on, so as the driver is useful in the current state, I'm looking to upstream this first and deal with the more complex handling later. 3) There is a lot of other functionality that can be added in future include allowing for simpler hardware implementations that may not support the minimum level of features currently required by the driver (freeze, overflow interrupts etc). 4) Adding support for ports will require solving msi/msix vector requests from portdrv and how to pass those to the CXL port drivers. (or some other way to instantiate the CXL PMU devices.) RFC on that to follow. CXL rev 3.0 specification available from https://www.computeexpresslink.org Jonathan Cameron (5): perf: Allow a PMU to have a parent cxl: Add functions to get an instance of / count regblocks of a given type cxl/pci: Find and register CXL PMU devices perf: CXL Performance Monitoring Unit driver docs: perf: Minimal introduction the the CXL PMU device and driver Documentation/admin-guide/perf/cxl.rst | 68 ++ Documentation/admin-guide/perf/index.rst | 1 + MAINTAINERS | 7 + drivers/cxl/Kconfig | 13 + drivers/cxl/core/Makefile | 1 + drivers/cxl/core/core.h | 1 + drivers/cxl/core/pmu.c | 69 ++ drivers/cxl/core/port.c | 2 + drivers/cxl/core/regs.c | 75 +- drivers/cxl/cxl.h | 16 + drivers/cxl/cxlpci.h | 1 + drivers/cxl/pci.c | 26 +- drivers/cxl/pmu.h | 28 + drivers/perf/Kconfig | 13 + drivers/perf/Makefile | 1 + drivers/perf/cxl_pmu.c | 984 +++++++++++++++++++++++ include/linux/perf_event.h | 1 + kernel/events/core.c | 1 + 18 files changed, 1301 insertions(+), 7 deletions(-) create mode 100644 Documentation/admin-guide/perf/cxl.rst create mode 100644 drivers/cxl/core/pmu.c create mode 100644 drivers/cxl/pmu.h create mode 100644 drivers/perf/cxl_pmu.c -- 2.37.2