Received: by 2002:a25:7ec1:0:0:0:0:0 with SMTP id z184csp2492051ybc; Sun, 17 Nov 2019 23:07:34 -0800 (PST) X-Google-Smtp-Source: APXvYqznBaI+7B/hJEZ5X4i8qwcEYPfC1najP1r2HuysBvmmbjSLIGH19myawL0wHr6raXlCnGxl X-Received: by 2002:a17:906:fcdb:: with SMTP id qx27mr24367987ejb.255.1574060854869; Sun, 17 Nov 2019 23:07:34 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1574060854; cv=none; d=google.com; s=arc-20160816; b=Waaxtugnb8g6DT8BI2TtnQjJ2IaF1jqYErWZGUnUq3v0Jfw355JRbSjwSg7h6ORtQU G9awpo4uWImUTY7q6A4FoYlDRt1i08qMzDc6eiUREVHZ8dWjMjKVvwrPk3ZKewC7AfG1 kqWkINlp2dFALuG2Ad5d//30vFutiEyIXnXZQx7TdVIZTDSvh2GGXlwVygabL1w7yt97 bXEWwzp2IS9sQwj5uYKZuEyMWOF4kCiuQB/Htlc6yW9N3P4GP/LSnxav3y3FiblOQCmK QRFOfi+ww4lIira2rdVL3THnGTOthRrZEwTDOs0YInaOQl270TMZMTkZWUcnltyMOxL0 8jBw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=VMbRkMv6lpqYxCRN/Iq3NzPU3YFn64D95ReaYjnSaco=; b=A4YcVoyhvEHb0A7NAldmDy8QcrtpwEmdfu+XGqDG1kZxwYtqGB86vNcazkkpXkSOk/ 5fxoG1elIZKfo1Row9zL0JWKkhW34PhBxc2C93sGCCgBKm6erhSxs/aDRhDmpzYnnoH7 dTCe7nJFotHTgQOKe8Df0iSmM0xtPJI7d0jaMEp880IpePfzBE+gaaDjWjpRxrJnUpk4 VROMgRpWcG4NrujwhOTSPCa9IjzFBnckJ9IDYXqvwNeeZXKkVpWGQjCE44F+Znbz9pNz Np9wZvxzDGIEPTXe3LImc4q42gWG9fbB4sXtVucqZA85Z3mpaKQ7FuwEsOx5R5nDezcS 43Qw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id n15si10297182ejj.68.2019.11.17.23.07.09; Sun, 17 Nov 2019 23:07:34 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726712AbfKRHGH (ORCPT + 99 others); Mon, 18 Nov 2019 02:06:07 -0500 Received: from szxga05-in.huawei.com ([45.249.212.191]:6246 "EHLO huawei.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1726483AbfKRHGH (ORCPT ); Mon, 18 Nov 2019 02:06:07 -0500 Received: from DGGEMS403-HUB.china.huawei.com (unknown [172.30.72.59]) by Forcepoint Email with ESMTP id 6683AFF17CA0DF1A3074; Mon, 18 Nov 2019 15:06:03 +0800 (CST) Received: from [127.0.0.1] (10.133.215.182) by DGGEMS403-HUB.china.huawei.com (10.3.19.203) with Microsoft SMTP Server id 14.3.439.0; Mon, 18 Nov 2019 15:05:59 +0800 Subject: Re: [RFC v2 4/4] perf tools: Support "branch-misses:pp" on arm64 To: James Clark CC: "linux-kernel@vger.kernel.org" , "linux-perf-users@vger.kernel.org" , nd References: <20191024144830.16534-1-tanxiaojun@huawei.com> <20191024144830.16534-5-tanxiaojun@huawei.com> <38c18a3e-1b9a-05fe-63f6-920af2f53fc7@huawei.com> <609eb078-7998-9e4a-ca04-6c40a8a47f84@arm.com> <7137fecb-a0bd-6dee-14c9-5753e56d39a1@huawei.com> From: Tan Xiaojun Message-ID: <41943f9a-9a52-43b4-67aa-1c669703dfc9@huawei.com> Date: Mon, 18 Nov 2019 15:05:58 +0800 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.7.1 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset="utf-8" Content-Language: en-US Content-Transfer-Encoding: 8bit X-Originating-IP: [10.133.215.182] X-CFilter-Loop: Reflected Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2019/11/15 19:37, James Clark wrote: > Hi Xiaojun, > > If the difference is not noticeable I think it would be better to leave it disabled. Presumably if the user > supplies the ":p" argument they are interested in the data being as precise as possible. > > If they want to enable jitter, then can always configure the SPE event manually. > Hi,++ James, OK. Agree. > I have a question about what kind of approach you think we should take for multiple events that are provided with :p. > For example "perf record -e branch-misses:p -e cache-misses:p ...". In your current implementation this will > give the error "There may be only one SPE event". I think this is fine for a first implementation. But I wonder if there > is a way of supporting multiple SPE events? > > From the documentation it seems like the filter events are ANDed together: > > PMSEVFR_EL1. > Controls sample filtering by events. The overall filter is the logical AND of these filters. For example, if E[3] and E[5] are both set to 1, > only samples that have both event 3 (Level 1 unified or data cache refill) and event 5 set (TLB walk) are recorded > > Which means that if we kept adding filters for new event types, there would be no events received because they wouldn't satisfy the filter requirements > of being caused by a branch miss AND a cache miss for example. I have asked internally about whether this is a mistake or not. > Yes, this is a problem, and you mentioned that we have to define several new spe events for this, and I am still considering how to add them.(I originally wanted to add them to the spe driver, but this will not avoid the problem of multiple spe events you mentioned above.) Based on scenarios where no new spe events are added, if the user specifies multiple spe events, I think we need to analyze these events and classify the spe events. If there are multiple, we need to keep only one (but record all spe events name or alias like "branch-misses, cache-misses" in some variables) and disable event_filter. Then "perf report" can create new selectors for these synthetic events and output them. Thanks. Xiaojun. > > Thanks > James > > On 15/11/2019 02:59, Tan Xiaojun wrote: >> On 2019/11/13 22:47, James Clark wrote: >>> Hi Xiaojun, >>> >>>> I can't reproduce this problem. If the current system doesn't support spe, it shouldn't report an error. I use the latest codes of the mainline: >>> >>> I think the problem is related to the 'type' attribute of the event. To open the SPE PMU the event type on the platform I'm using is '7'. If I change >>> the code like this, the problem is fixed: >>> >>> @@ -914,13 +914,27 @@ void arm_spe_precise_ip_support(struct evlist *evlist, struct evsel *evsel) >>> pmu = perf_pmu__find("arm_spe_0"); >>> if (pmu) { >>> evsel->pmu_name = pmu->name; >>> - evsel->core.attr.type = PERF_RECORD_AUXTRACE; >>> - evsel->core.attr.config = SPE_ATTR_TS_ENABLE >>> - | SPE_ATTR_PA_ENABLE >>> - | SPE_ATTR_JITTER >>> + evsel->core.attr.type = pmu->type; >>> + evsel->core.attr.config |= SPE_ATTR_TS_ENABLE >>> | SPE_ATTR_BRANCH_FILTER; >>> >> >> Hi, James, >> OK. Thank you for your fix. >> >>> Also do you think jitter should be enabled by default? I thought that it might make the data less precise, so I removed it here. >> >> Since the interval for sampling without "jitter" is fixed (default 1024 on our server), I was worried that not adding it would result in the same result for each record, and some instructions could not be collected each time. >> >> However, after many tests, it is not clear from the results that there is a significant difference between them (enable it or not). >> >> So I am confused, whether to enable it or not. >> >> Thanks. >> Xiaojun. >> >>> >>> -James >>> >>>> >>>> commit f116b96685a046a89c25d4a6ba2da489145c8888 (mainline/master) >>>> Merge: f632bfaa33ed 603d9299da32 >>>> Author: Linus Torvalds >>>> Date: Thu Oct 24 06:13:45 2019 -0400 >>>> >>>> Merge tag 'mfd-fixes-5.4' of git://git.kernel.org/pub/scm/linux/kernel/git/lee/mfd >>>> >>>> I will go and see why this will be reported. >>>> >>>>> >>>>> >>>>> I would have expected to use the event name that is listed in the SPE documentation for branch misses which is br_mis_pred or br_mis_pred_retired: >>>>> >>>>>     E[7], byte 0 bit [7] >>>>>     Mispredicted. The defined values of this bit are: >>>>>     0 Did not cause correction to the predicted program flow. >>>>>     1 A branch that caused a correction to the predicted program flow. >>>>> >>>>>     If PMUv3 is implemented this Event is required to be implemented consistently with either BR_MIS_PRED or BR_MIS_PRED_RETIRED. >>>>> >>>> >>>> Do you mean that I can add these as new events to perf? If we think of them as new events, what should we do if the user does not add :pp for them? >>>> (Or for these events, users can only add :pp to use them?) >>>> >>>>> >>>>> +       if (!strcmp(perf_env__arch(evlist->env), "arm64") >>>>> +                       && evsel->core.attr.config == PERF_COUNT_HW_BRANCH_MISSES >>>>> +                       && evsel->core.attr.precise_ip) { >>>>> >>>>> As I mentioned above PERF_COUNT_HW_BRANCH_MISSESdoesn't seem to match up with the actual event counter that is associated with this SPE event (BR_MIS_PRED). The fix for this is probably as simple as adding an OR for the other aliases for branch mispredicts. >>>> >>>> What you mean is that we can filter with spe events(like BR_MIS_PRED) first, and if we have other events that are exactly the same(no more for now), then we can handle them by adding OR in the future? >>>> >>>>> >>>>> +               pmu = perf_pmu__find("arm_spe_0"); >>>>> +               if (pmu) { >>>>> +                       evsel->pmu_name = pmu->name; >>>>> +                       evsel->core.attr.type = PERF_RECORD_AUXTRACE; >>>>> +                       evsel->core.attr.config = SPE_ATTR_TS_ENABLE >>>>> +                                               | SPE_ATTR_PA_ENABLE >>>>> >>>>> I wouldn't set physical addresses by default as this requires root. I would leave that to the user if they want to manually configure SPE. >>>> >>>> Yes. You are right. I got a error for this case. I will fix it. >>>> >>>> ------------------ >>>> ./perf record -e branch-misses:p ls >>>> Error: >>>> You may not have permission to collect stats. >>>> ... >>>> ------------------ >>>> >>>> Thanks. >>>> Xiaojun. >>>> >>>>> >>>>> I have only looked briefly and I will do some more testing. >>>>> >>>>> >>>>> Thanks >>>>> James >>>>> >>>>> >>>> >>>> >> >>