Received: by 2002:a05:6a10:c7c6:0:0:0:0 with SMTP id h6csp1726034pxy; Mon, 2 Aug 2021 08:46:13 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyKNuciPFIr1FZFDVmCJuAfL18BGsJYka3qmTgqJUJzGWs1ACYnxrmrUZcWSwpqd1eLUiRO X-Received: by 2002:a17:906:fa10:: with SMTP id lo16mr16243393ejb.154.1627919173605; Mon, 02 Aug 2021 08:46:13 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1627919173; cv=none; d=google.com; s=arc-20160816; b=b87R8odjLM+uhu17dVom0f9oR2yso3zSHN4dQ9g9GqMfjHmnycxkRx96vBSmzDDDA9 2HgvL4S+ApQBiUMgXJIMIOGb42+jVuldztihQUU681xI4nGRXVy63j12q4xRuTEKzBhA j4XYTRdjs8zC6y3YpxXqXP549uVS/XX/wO6GMmQdsMb4nVszyL/IwcwrD9NQKCk0d+Uc hObafSbEHgPJB0jAyqCpw7bsUfwJqAh+7Gxm4FqT2MIHMDvJ+A0p24FJogC5nNOK1fqE 0UA1lE9oiPWqrsP1FprqP/74PQQPT9kNoIj3KH+yYvCunBhUNl5EN2Jwgice8aF4USry W0CA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=zAWuQ/JvoX+qW03VDGPJ9Ofp3oF3NSH0evX+ItLfT0M=; b=FUJXrRBxqvZ6ugrnaVoYsCwdB0UnDkiI1+0KR5pVzeeZlBlQCEmzIAcuIn/d8NDeNx iqJfo5iqlk5npUTYwgnFk/Q5xvkMAf5D0Uh8LzsEK88x2MC8NsyBRUYhYRSE6IO8yY9n Em2LcrH8zgMmyzeSxZZqNHO7zbG6aIJslXUYERDIhrRqy+Lpcp1bOhvKng12e9FJAakN kLH1bgi9fZYuI9DpFYA547yjgc/D2E+t8pk47F+8vFD5b+6HMEPS7OL0Z4zvxEuUvqYl iJxpwjGWjZpUw4fXHYvNOsMfGvtxltGpYrEeTyN4X6JGIoc17ouuqBQyLP8fCJy7ce4F IPkA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=OY1dHlSB; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id lv19si12111134ejb.527.2021.08.02.08.45.42; Mon, 02 Aug 2021 08:46:13 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=OY1dHlSB; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S235243AbhHBPoX (ORCPT + 99 others); Mon, 2 Aug 2021 11:44:23 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:39380 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234551AbhHBPoW (ORCPT ); Mon, 2 Aug 2021 11:44:22 -0400 Received: from mail-wm1-x32c.google.com (mail-wm1-x32c.google.com [IPv6:2a00:1450:4864:20::32c]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 062F0C06175F for ; Mon, 2 Aug 2021 08:44:12 -0700 (PDT) Received: by mail-wm1-x32c.google.com with SMTP id n12-20020a05600c3b8cb029025a67bbd40aso3915535wms.0 for ; Mon, 02 Aug 2021 08:44:11 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=zAWuQ/JvoX+qW03VDGPJ9Ofp3oF3NSH0evX+ItLfT0M=; b=OY1dHlSBGhnSjUiSRde6FOpi/D/J7vDN3qiynTmUlf6bnERXdwDmKC+7PzgqdLk2VQ 6k3GrRyW36WgMN0o5BCwydoFGH4LJ3/Td+yTEL2V0YgpV6njdbtogAPjJdjk3Zbc5vED uxDQrZlBUxtd99jT+x1vQWolvyWpM4AFa8GYKJBYNk9U6K9uhIivv3UR8tFoz+b2YCyd 3+K4ynU4wb5PxA9Sg88UzIdL+uzuohxlpQG0/uGCx/97aBGwwyM9AMyl7xf+OhyBmuOK qWl1yv2dQPqzDtWlA5msz53nVoSicHeDAYeGDn/2tdS/3UYQQB+gKVYy19uWXoVYrYMX vYKw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=zAWuQ/JvoX+qW03VDGPJ9Ofp3oF3NSH0evX+ItLfT0M=; b=EGJazFyqQ6prnhyODjGLkOMpIQwji0Ej/v4xwL/3VFG+4BFqqcUFJK8farXM5zz/0W ZZm7pf9viXEMevlf0eIwqnq9LF4kG968zFo7f+RBSZlJOGKTpa/bGvLmV6klszpCZwvX YKQs9gzgcl2csnnjRrI4jaEqX/3yM8mxELLOUL3wX2njKNDK62pff+IjeW90m6hD16Sp VpAkUqnsTsoSLTM/3OAeo4onAYbBHPCSD7hMgppGhaP4FfI3hCbO9cxpx+C2sT9y/TCs g38pPA1MaYtsr1CHGJ3a5CwWPQn7kiwlMKb3lwg5HsCJ2gARgWzH86WHre7Sm3y4Y8Sq pqIQ== X-Gm-Message-State: AOAM531XKZap4qkhPr00q+Vk5CUL5RY8+uZZ4VPyYAMzMLb857kCSDw0 VfqkM4aoiizmepszfW+/FT5CU5UlEauOd4zvv6cjdg== X-Received: by 2002:a7b:c0cc:: with SMTP id s12mr49605wmh.0.1627919050360; Mon, 02 Aug 2021 08:44:10 -0700 (PDT) MIME-Version: 1.0 References: <20210721090706.21523-1-james.clark@arm.com> <20210721090706.21523-3-james.clark@arm.com> <20210731060312.GB7437@leoy-ThinkPad-X240s> <20210802150358.GA148327@leoy-ThinkPad-X240s> In-Reply-To: <20210802150358.GA148327@leoy-ThinkPad-X240s> From: Mike Leach Date: Mon, 2 Aug 2021 16:43:59 +0100 Message-ID: Subject: Re: [PATCH 2/6] perf cs-etm: Initialise architecture based on TRCIDR1 To: Leo Yan Cc: James Clark , Arnaldo Carvalho de Melo , Mathieu Poirier , Coresight ML , Al Grant , "Suzuki K. Poulose" , Anshuman Khandual , John Garry , Will Deacon , Mark Rutland , Alexander Shishkin , Jiri Olsa , Namhyung Kim , linux-arm-kernel , Linux Kernel Mailing List , linux-perf-users@vger.kernel.org Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Leo, On Mon, 2 Aug 2021 at 16:04, Leo Yan wrote: > > Hi Mike, > > On Mon, Aug 02, 2021 at 03:04:14PM +0100, Mike Leach wrote: > > [...] > > > > > > +#define TRCIDR1_TRCARCHMIN_SHIFT 4 > > > > > +#define TRCIDR1_TRCARCHMIN_MASK GENMASK(7, 4) > > > > > +#define TRCIDR1_TRCARCHMIN(x) (((x) & TRCIDR1_TRCARCHMIN_MASK) >> TRCIDR1_TRCARCHMIN_SHIFT) > > > > > +static enum _ocsd_arch_version cs_etm_decoder__get_arch_ver(u32 reg_idr1) > > > > > +{ > > > > > + /* > > > > > + * If the ETM trace minor version is 4 or more then we can assume > > > > > + * the architecture is ARCH_AA64 rather than just V8 > > > > > + */ > > > > > + return TRCIDR1_TRCARCHMIN(reg_idr1) >= 4 ? ARCH_AA64 : ARCH_V8; > > > > > +} > > > > > > > > This is true for ETM4.x & ETE 1.x (arch 5.x) but not ETM 3.x > > > > Probably need to beef up this comment or the function name to emphasise this. > > > > > > Yeah, I think it's good to change the function name. Eventually, this > > > function should only be used for ETM4.x and ETE. > > > > > > Another minor comment is: can we refine the arch version number, e.g. > > > change the OpenCSD's macro "ARCH_AA64" to "ARCH_V8R4", (or > > > "ARCH_V8R3_AA64"), this can give more clear clue what's the ETM version. > > > > > > > The purpose of these macros is to inform the decoder of the > > architecture of the PE - not the version of the ETM. > > > > These OpenCSD macros are defined by the library headers > > (ocsd_if_types.h) and not the perf headers. > > These have been published as the API / ABI for OpenCSD and as such > > changing them affects all OpenCSD clients, not just perf. > > I understand these macros are defined in OpenCSD lib as APIs, since I > saw these macros have not been widely used in perf tool (e.g. > ARCH_AA64), so this is why I think it's good to take chance to refine > the naming conventions. > The macros are used in other tools - so changing now affects those too. Not something I am prepared to do without good reason. > > This PE architecture version is used along with the core profile to > > determine which instructions are valid waypoint instructions to > > associate with atom elements when walking the program image during > > trace decode. > > > > From v8.3 onwards we moved away from filtering on specific > > architecture versions. This was due to two factors:- > > 1. The architectural rules now allow architectural features for one > > increment e.g. Arch 8.4, to be backported into the previous increment > > - e,g, 8.3, which made this filtering more difficult to track. > > 2. After discussion with the PE architects it was clear that > > instructions in a later architect version would not re-use older > > opcodes from a previous one and be nop / invalid in the earlier > > architectures. (certainly in the scope of AA64). Therefore > > the policy in the decoder is to check for all the instructions we know > > about for the latest version of architecture, even if we could be > > decoding an earlier architecture version. This means we may check for > > a few more opcodes than necessary for earlier version of the > > architecture, but the overall decode is more robust and easier to > > maintain. > > > > Therefore for any AA64 core beyond v8.3 - it is safe to use the > > ARCH_AA64 PE architecture version and the decoder will handle it. > > I have no objection for current approach; but two things can cause > confusions and it might be difficult for maintenance: > > - The first thing is now we base on the bit fields TRCIDR1::TRCARCHMIN > to decide the PE architecture version. In the ETMv4 spec, > TRCIDR1::TRCARCHMIN is defined as the trace unit minor version, > so essentially it's a minor version number for tracer (ETM) but not > the PE architecture number. But now we are using it to decide the > PE architecture number (8.3, 8.4, etc...). > This is a slight weakness in the implementation of perf. Ideally one does need to establish the architecture version of the PE - but perf /cs-etm is using an assumption regarding the profile and version of the core, according to the ETM / ETE versiom. That said - the ETM / ETE version numbers do have a strong relationship with PE architecture version numbers, so this assumption holds for the current supported devices. > - The second thing is the macros' naming convention. > E.g. "AA64" gives me an impression it is a general naming "Arm Arch 64" > for all Arm 64-bit CPUs, it's something like an abbreviation for > "aarch64"; so seems to me it doesn't show any meaningful info for PE's > architecture version number. This is why I proposed to use more > explict macro definition for architectures (e.g. ARCH_V8R3, ARCH_V8R4, > ARCH_V9R0, etc). > For modern cores it is sufficient for the decoder to know the profile and that it is aarch 64 - so yes the macro is simply saying this a general AA64 core. The macros for earlier versions are a little more specific as certain filtering is used according to the version of the PE. ARCH_V8R4, ARCH_V9R0 etc would have no significance to the decoder and would not be useful. If we get to the stage where we need more specific PE architecture versions - then these can be added as required. Using the ARCH_AA64 macro means that we do not have to update the API for every version update of the architecture, and there are no changes required to the perf / cs-etm handling. > If we really want to use ARCH_AA64, it's better to give some comments in > the code. > There are comments in the OpenCSD headers, though additional ones in the perf / cs-etm handling soruce code could be added. Regards Mike > Thanks a lot for shared the background info. > > Leo -- Mike Leach Principal Engineer, ARM Ltd. Manchester Design Centre. UK