Received: by 2002:a05:6a10:5bc5:0:0:0:0 with SMTP id os5csp124866pxb; Mon, 18 Oct 2021 22:28:33 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzWCemAHbWfFML7rC27Rw8vDVYlY9lZUhprQOFKT+XkiiWhM+TO0y3QaalOY9zH3RGCRF5r X-Received: by 2002:a05:6402:268c:: with SMTP id w12mr51545514edd.376.1634621313620; Mon, 18 Oct 2021 22:28:33 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1634621313; cv=none; d=google.com; s=arc-20160816; b=W0oDnoQt3cIWewPthQbrK8A8KeR7hZXoA26kBNVuxd7EMdGA23SaxLcdGupDag1Crm OT6siCTCPU/1T9IV6uOMP+tPEkw3JJ7XA64VjsmU6ZeMD8OQClFZAOo/Mb4fMUT0EQx3 VQA3De6oWt9R/J8rKMnx+lvGXsGEgq2RD/3aGq2R+D1sgJLXrmMtDD/p1qEE5WtCqBvM UMx1lepRWrF8oN9b0Dsph1g+0RT/vLvCTXulkRcvVK+vKKkYUx7xjdwuwdPjxJgZenno N8sFLDWJRkcZEd4/4EMVGULQyAO0yiUgfGa1BnAZpVn1JAF9JqRpa5k8Oux7v93oZ7AJ uKkg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject; bh=WiAykbtBa9LXYFSvBrcDe+KqpSNqMPt/9aRZ6O2QlRw=; b=BxHn15rBquR310jQHOy0QK0OUuUQFUCt1vEBxd9V+QUnvRnZ+0y8HpZaVRL6udkerL XhOCyrAsKS8TD0ds80fxqmVYGLAUgQhxJzGLR9Pruko/gUU80JEGLvHPbl9eCCkM9wDz d9379LkU1qvdNLE8JC2/1yeh9Lypnoxu/jwHMuVAax7DJprpQ0FHXcDH1yTbkFP9VNgm SWliB8ir13BkZExy3uHXRxQ4DCf412AdOh4apchEwt/NXDxbOn0f3/pfW9M40Als2Nak TZLq8h9TCMH0g2LkmYV99KeqOSIM12ZzsYEWQXuIDrBsfh7luqd2uKwOlxyyLk5NGnro +7xw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id 5si23026919eji.132.2021.10.18.22.28.09; Mon, 18 Oct 2021 22:28:33 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233119AbhJSF1x (ORCPT + 99 others); Tue, 19 Oct 2021 01:27:53 -0400 Received: from foss.arm.com ([217.140.110.172]:44548 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229521AbhJSF1w (ORCPT ); Tue, 19 Oct 2021 01:27:52 -0400 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 3D9432F; Mon, 18 Oct 2021 22:25:40 -0700 (PDT) Received: from [10.163.74.241] (unknown [10.163.74.241]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 597C33F73D; Mon, 18 Oct 2021 22:25:37 -0700 (PDT) Subject: Re: [PATCH v5 09/15] coresight: trbe: Add infrastructure for Errata handling To: Suzuki K Poulose , will@kernel.org, mathieu.poirier@linaro.org Cc: catalin.marinas@arm.com, mike.leach@linaro.org, leo.yan@linaro.org, maz@kernel.org, coresight@lists.linaro.org, linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org References: <20211014223125.2605031-1-suzuki.poulose@arm.com> <20211014223125.2605031-10-suzuki.poulose@arm.com> From: Anshuman Khandual Message-ID: Date: Tue, 19 Oct 2021 10:55:36 +0530 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.10.0 MIME-Version: 1.0 In-Reply-To: <20211014223125.2605031-10-suzuki.poulose@arm.com> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 10/15/21 4:01 AM, Suzuki K Poulose wrote: > Add a minimal infrastructure to keep track of the errata > affecting the given TRBE instance. Given that we have > heterogeneous CPUs, we have to manage the list per-TRBE > instance to be able to apply the work around as needed. > Thus we will need to check if individual CPUs are affected > by the erratum. > > We rely on the arm64 errata framework for the actual > description and the discovery of a given erratum, to > keep the Erratum work around at a central place and > benefit from the code and the advertisement from the > kernel. Though we could reuse the "this_cpu_has_cap()" > to apply an erratum work around, it is a bit of a heavy > operation, as it must go through the "erratum" detection > check on the CPU every time it is called (e.g, scanning > through a table of affected MIDRs). Since we need > to do this check for every session, may be multiple > times (depending on the wrok around), we could save > the cycles by caching the affected errata per-CPU > instance in the per-CPU struct trbe_cpudata. > > Since we are only interested in the errata affecting > the TRBE driver, we only need to track a very few of them > per-CPU. Thus we use a local mapping of the CPUCAP for the > erratum to avoid bloating up a bitmap for trbe_cpudata. > > i.e, each arm64 TRBE erratum bit is assigned a "index" > within the driver to track. Each trbe instance updates > the list of affected erratum at probe time on the CPU. > This makes sure that we can easily access the list of > errata on a given TRBE instance without much overhead. > > Cc: Mathieu Poirier > Cc: Mike Leach > Cc: Leo Yan > Cc: Anshuman Khandual > Reviewed-by: Mathieu Poirier > Signed-off-by: Suzuki K Poulose > --- > Changes since v4: > - Ensure the arm_trbe_probe_cpu() is called from non preemptible > context for hotplugged CPUs This (both renaming and non preemptible context) makes sense. Reviewed-by: Anshuman Khandual A small nit though. Should the array sentinel value '-1' be bit formalized with a macro and check like this != instead. if (WARN_ON_ONCE(cap != TRBE_ERRATA_SENTINEL)) > Changes since v2: > - Automatically define TRBE_ERRATA_MAX > - Add some basic sanity check to make sure the new entries > are added in order. > - Describe the design choice of caching CPU local errata > in trbe_cpudata instead of using this_cpu_has_cap() > Changes since v1: > - Flip the order of args for trbe_has_erratum() > - Move erratum detection further down in the sequence > --- > drivers/hwtracing/coresight/coresight-trbe.c | 71 +++++++++++++++++++- > 1 file changed, 68 insertions(+), 3 deletions(-) > > diff --git a/drivers/hwtracing/coresight/coresight-trbe.c b/drivers/hwtracing/coresight/coresight-trbe.c > index f8c04c428780..314e5e7374c7 100644 > --- a/drivers/hwtracing/coresight/coresight-trbe.c > +++ b/drivers/hwtracing/coresight/coresight-trbe.c > @@ -16,6 +16,7 @@ > #define pr_fmt(fmt) DRVNAME ": " fmt > > #include > + > #include "coresight-self-hosted-trace.h" > #include "coresight-trbe.h" > > @@ -67,14 +68,43 @@ struct trbe_buf { > struct trbe_cpudata *cpudata; > }; > > +/* > + * TRBE erratum list > + * > + * The errata are defined in arm64 generic cpu_errata framework. > + * Since the errata work arounds could be applied individually > + * to the affected CPUs inside the TRBE driver, we need to know if > + * a given CPU is affected by the erratum. Unlike the other erratum > + * work arounds, TRBE driver needs to check multiple times during > + * a trace session. Thus we need a quicker access to per-CPU > + * errata and not issue costly this_cpu_has_cap() everytime. > + * We keep a set of the affected errata in trbe_cpudata, per TRBE. > + * > + * We rely on the corresponding cpucaps to be defined for a given > + * TRBE erratum. We map the given cpucap into a TRBE internal number > + * to make the tracking of the errata lean. > + * > + * This helps in : > + * - Not duplicating the detection logic > + * - Streamlined detection of erratum across the system > + */ > + > +static int trbe_errata_cpucaps[] = { > + -1, /* Sentinel, must be the last entry */ > +}; > + > +/* The total number of listed errata in trbe_errata_cpucaps */ > +#define TRBE_ERRATA_MAX (ARRAY_SIZE(trbe_errata_cpucaps) - 1) > + > /* > * struct trbe_cpudata: TRBE instance specific data > * @trbe_flag - TRBE dirty/access flag support > * @trbe_hw_align - Actual TRBE alignment required for TRBPTR_EL1. > - * @trbe_align - Software alignment used for the TRBPTR_EL1, > + * @trbe_align - Software alignment used for the TRBPTR_EL1 > * @cpu - CPU this TRBE belongs to. > * @mode - Mode of current operation. (perf/disabled) > * @drvdata - TRBE specific drvdata > + * @errata - Bit map for the errata on this TRBE. > */ > struct trbe_cpudata { > bool trbe_flag; > @@ -84,6 +114,7 @@ struct trbe_cpudata { > enum cs_mode mode; > struct trbe_buf *buf; > struct trbe_drvdata *drvdata; > + DECLARE_BITMAP(errata, TRBE_ERRATA_MAX); > }; > > struct trbe_drvdata { > @@ -96,6 +127,25 @@ struct trbe_drvdata { > struct platform_device *pdev; > }; > > +static void trbe_check_errata(struct trbe_cpudata *cpudata) > +{ > + int i; > + > + for (i = 0; i < TRBE_ERRATA_MAX; i++) { > + int cap = trbe_errata_cpucaps[i]; > + > + if (WARN_ON_ONCE(cap < 0)) > + return; > + if (this_cpu_has_cap(cap)) > + set_bit(i, cpudata->errata); > + } > +} > + > +static inline bool trbe_has_erratum(struct trbe_cpudata *cpudata, int i) > +{ > + return (i < TRBE_ERRATA_MAX) && test_bit(i, cpudata->errata); > +} > + > static int trbe_alloc_node(struct perf_event *event) > { > if (event->cpu == -1) > @@ -952,6 +1002,9 @@ static void arm_trbe_register_coresight_cpu(struct trbe_drvdata *drvdata, int cp > cpumask_clear_cpu(cpu, &drvdata->supported_cpus); > } > > +/* > + * Must be called with preemption disabled, for trbe_check_errata(). > + */ > static void arm_trbe_probe_cpu(void *info) > { > struct trbe_drvdata *drvdata = info; > @@ -979,6 +1032,12 @@ static void arm_trbe_probe_cpu(void *info) > goto cpu_clear; > } > > + /* > + * Run the TRBE erratum checks, now that we know > + * this instance is about to be registered. > + */ > + trbe_check_errata(cpudata); > + > cpudata->trbe_align = cpudata->trbe_hw_align; > cpudata->trbe_flag = get_trbe_flag_update(trbidr); > cpudata->cpu = cpu; > @@ -1032,18 +1091,24 @@ static int arm_trbe_remove_coresight(struct trbe_drvdata *drvdata) > return 0; > } > > +static void arm_trbe_probe_hotplugged_cpu(struct trbe_drvdata *drvdata) > +{ > + preempt_disable(); > + arm_trbe_probe_cpu(drvdata); > + preempt_enable(); > +} > + > static int arm_trbe_cpu_startup(unsigned int cpu, struct hlist_node *node) > { > struct trbe_drvdata *drvdata = hlist_entry_safe(node, struct trbe_drvdata, hotplug_node); > > if (cpumask_test_cpu(cpu, &drvdata->supported_cpus)) { > - > /* > * If this CPU was not probed for TRBE, > * initialize it now. > */ > if (!coresight_get_percpu_sink(cpu)) { > - arm_trbe_probe_cpu(drvdata); > + arm_trbe_probe_hotplugged_cpu(drvdata); > if (cpumask_test_cpu(cpu, &drvdata->supported_cpus)) > arm_trbe_register_coresight_cpu(drvdata, cpu); > if (cpumask_test_cpu(cpu, &drvdata->supported_cpus)) >