Received: by 2002:a05:6a11:4021:0:0:0:0 with SMTP id ky33csp2872444pxb; Tue, 21 Sep 2021 09:25:48 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxGe81JhO0SEueI1rOZCx3KF3mFZ0ZrO2f8kAjw10FFGQmMWSHtt/JYmV5XUn4Jgjexz+BL X-Received: by 2002:a02:908a:: with SMTP id x10mr776303jaf.30.1632241548165; Tue, 21 Sep 2021 09:25:48 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1632241548; cv=none; d=google.com; s=arc-20160816; b=r6IzeGrgiQiQFWfc+NDZsiOss2U1tgkhbW/d+glzgn3BgHo6aFnYUo+N7RxK41zvtY U9rKYc91gLyQ74Qfeuv1LVO1pl5o3etGjBP3di59wReyenhkElq7Le3dVe0uPMP7TgNq 7PBV/3BTvWXoJYLDiqaPL0RZroCs60KK4p6hNXVhmqiUZFSkKyhsoOPOmKLdWlee2on1 hmNniQF8Eu9/QdAepPHLq964Rrq0ttwbX4Lc66CD16sWIkCi0gGswXWZUucid4mNT5PV M0G4U4yApmzAAkWSmDs9DgN8UOWQVrpLlJ2Kki/W8e4044JL77z3w1SVDKmy8ot0xjS4 BvbQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject; bh=HF9+AbBQkCCON1TZJb53hdoYpaedJ50CjP6kHTmBfZA=; b=g+0XT1qMb/uQBNUP/MgU3oV8DWciwr6BM/OgojDkyI+O0A4e7xP6HOwtXY027nRhvj RwxxHVvYgK1L/JQVxpilzxSIMcIXFfiyDCe/id1eNkN6+spOCq8R50H4EnYo/gGnF9fu wcgHBYErNTnM3+7iCSuimD7DWCqWOPrwlzUw65JPr55RXbYHq3fkw4vLdw8zuXNeKgue iNllZpSYS5pFFLY+J4tEDGJPFQ1BwiMxZZlMtA5TbtPegF4HL0e0lr23jId+Ju6StszF JsFjmh5r6OfpXzl0KuYypMrr/o1YwW2W48KNENB/95AWoPZYd5DuI9ZrmuHdAfwO5fqh z7gg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id i12si16252152jac.26.2021.09.21.09.25.35; Tue, 21 Sep 2021 09:25:48 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234013AbhIUQXz (ORCPT + 99 others); Tue, 21 Sep 2021 12:23:55 -0400 Received: from foss.arm.com ([217.140.110.172]:36116 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233948AbhIUQXz (ORCPT ); Tue, 21 Sep 2021 12:23:55 -0400 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 52BD4113E; Tue, 21 Sep 2021 09:22:26 -0700 (PDT) Received: from [10.57.95.67] (unknown [10.57.95.67]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 34BBC3F718; Tue, 21 Sep 2021 09:22:25 -0700 (PDT) Subject: Re: [PATCH] coresight: Don't immediately close events that are run on invalid CPU/sink combos To: Mathieu Poirier , James Clark Cc: coresight@lists.linaro.org, Mike Leach , Leo Yan , linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org References: <20210921130231.386095-1-james.clark@arm.com> <20210921151721.GA2059841@p14s> From: Suzuki K Poulose Message-ID: <2d1326ea-a60c-8723-28a4-891a5478846f@arm.com> Date: Tue, 21 Sep 2021 17:22:23 +0100 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:78.0) Gecko/20100101 Thunderbird/78.14.0 MIME-Version: 1.0 In-Reply-To: <20210921151721.GA2059841@p14s> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-GB Content-Transfer-Encoding: 7bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 21/09/2021 16:17, Mathieu Poirier wrote: > On Tue, Sep 21, 2021 at 02:02:31PM +0100, James Clark wrote: >> When a traced process runs on a CPU that can't reach the selected sink, >> the event will be stopped with PERF_HES_STOPPED. This means that even if >> the process migrates to a valid CPU, tracing will not resume. >> >> This can be reproduced (on N1SDP) by using taskset to start the process >> on CPU 0, and then switching it to CPU 2 (ETF 1 is only reachable from >> CPU 2): >> >> taskset --cpu-list 0 ./perf record -e cs_etm/@tmc_etf1/ --per-thread -- taskset --cpu-list 2 ls >> >> This produces a single 0 length AUX record, and then no more trace: >> >> 0x3c8 [0x30]: PERF_RECORD_AUX offset: 0 size: 0 flags: 0x1 [T] >> >> After the fix, the same command produces normal AUX records. The perf >> self test "89: Check Arm CoreSight trace data recording and synthesized >> samples" no longer fails intermittently. This was because the taskset in >> the test is after the fork, so there is a period where the task is >> scheduled on a random CPU rather than forced to a valid one. >> >> Specifically selecting an invalid CPU will still result in a failure to >> open the event because it will never produce trace: >> >> ./perf record -C 2 -e cs_etm/@tmc_etf0/ >> failed to mmap with 12 (Cannot allocate memory) >> >> The only scenario that has changed is if the CPU mask has a valid CPU >> sink combo in it. >> >> Testing >> ======= >> >> * Coresight self test passes consistently: >> ./perf test Coresight >> >> * CPU wide mode still produces trace: >> ./perf record -e cs_etm// -a >> >> * Invalid -C options still fail to open: >> ./perf record -C 2,3 -e cs_etm/@tmc_etf0/ >> failed to mmap with 12 (Cannot allocate memory) >> >> * Migrating a task to a valid sink/CPU now produces trace: >> taskset --cpu-list 0 ./perf record -e cs_etm/@tmc_etf1/ --per-thread -- taskset --cpu-list 2 ls >> >> * If the task remains on an invalid CPU, no trace is emitted: >> taskset --cpu-list 0 ./perf record -e cs_etm/@tmc_etf1/ --per-thread -- ls >> >> Signed-off-by: James Clark >> --- >> .../hwtracing/coresight/coresight-etm-perf.c | 27 +++++++++++++++---- >> 1 file changed, 22 insertions(+), 5 deletions(-) > > Very interesting corner case - and I like your solution. Arnaldo, please > consider. > > Reviewed-by: Mathieu Poirier > PS: This is for coresight driver, I can pick this up. Otherwise, Reviewed-by: Suzuki K Poulose >> >> diff --git a/drivers/hwtracing/coresight/coresight-etm-perf.c b/drivers/hwtracing/coresight/coresight-etm-perf.c >> index 8ebd728d3a80..79346f0f0e0b 100644 >> --- a/drivers/hwtracing/coresight/coresight-etm-perf.c >> +++ b/drivers/hwtracing/coresight/coresight-etm-perf.c >> @@ -452,9 +452,14 @@ static void etm_event_start(struct perf_event *event, int flags) >> * sink from this ETM. We can't do much in this case if >> * the sink was specified or hinted to the driver. For >> * now, simply don't record anything on this ETM. >> + * >> + * As such we pretend that everything is fine, and let >> + * it continue without actually tracing. The event could >> + * continue tracing when it moves to a CPU where it is >> + * reachable to a sink. >> */ >> if (!cpumask_test_cpu(cpu, &event_data->mask)) >> - goto fail_end_stop; >> + goto out; >> >> path = etm_event_cpu_path(event_data, cpu); >> /* We need a sink, no need to continue without one */ >> @@ -466,16 +471,15 @@ static void etm_event_start(struct perf_event *event, int flags) >> if (coresight_enable_path(path, CS_MODE_PERF, handle)) >> goto fail_end_stop; >> >> - /* Tell the perf core the event is alive */ >> - event->hw.state = 0; >> - >> /* Finally enable the tracer */ >> if (source_ops(csdev)->enable(csdev, event, CS_MODE_PERF)) >> goto fail_disable_path; >> >> +out: >> + /* Tell the perf core the event is alive */ >> + event->hw.state = 0; >> /* Save the event_data for this ETM */ >> ctxt->event_data = event_data; >> -out: >> return; >> >> fail_disable_path: >> @@ -517,6 +521,19 @@ static void etm_event_stop(struct perf_event *event, int mode) >> if (WARN_ON(!event_data)) >> return; >> >> + /* >> + * Check if this ETM was allowed to trace, as decided at >> + * etm_setup_aux(). If it wasn't allowed to trace, then >> + * nothing needs to be torn down other than outputting a >> + * zero sized record. >> + */ >> + if (handle->event && (mode & PERF_EF_UPDATE) && >> + !cpumask_test_cpu(cpu, &event_data->mask)) { >> + event->hw.state = PERF_HES_STOPPED; >> + perf_aux_output_end(handle, 0); >> + return; >> + } >> + >> if (!csdev) >> return; >> >> -- >> 2.28.0 >>