Received: by 2002:a05:6a10:9afc:0:0:0:0 with SMTP id t28csp246167pxm; Wed, 2 Mar 2022 14:29:51 -0800 (PST) X-Google-Smtp-Source: ABdhPJwuci2ayfktSJCxe7wyz/hZNpsXUBVux00F8OEBv1JutCh00fSLouPcW7WmDvg07pfsE9/h X-Received: by 2002:a63:fb46:0:b0:372:a1d6:45ea with SMTP id w6-20020a63fb46000000b00372a1d645eamr28048793pgj.549.1646260191179; Wed, 02 Mar 2022 14:29:51 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1646260191; cv=none; d=google.com; s=arc-20160816; b=eRf0hLitUG57dyzrpfmZeCO3qNlbla9Cb+LvS6UiXC0WmIKXWjJWOdFjKXrb2mdfI5 AJD+1eifk5BDDi0nj2roFkENMEVQA50IFNcV4POlZwffiVDRZhswVy1htGb0hfrVVD4J ryVwVQIg0HnA414dyLL2c+z47LsL2X7/BPsiwRB9gha+6AZ7B9FWZ9JX6bVaNhFMDXlM 7mfFvSXZegzPWhRHNGFKJTjRF+OGMLTkEQ4h5Sw57DBgZLQjljEQUGXKJww9jrss/ul1 jnLsvTGPV1ma+50cmZBuaYpM++jP3o1OUqDK0BCCpx+Q3OqCncD3zjAWjXO0v4nMqTXW qgmA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-language:content-transfer-encoding :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject; bh=q/YudClJe2EzvSTQ01YuxNqKE5sIH9/cUtCcn8UdIk0=; b=HJX5p7unVQ6HVqnZCUQSzseKLgbZx1fy8iu5BXqD2ZNezrsI/w1ZFgVkku04DUQMM9 jFp75y8lZVrsbz3q65V/dD6SyqmrNB0/dRXybVfe3oPGmZ28yYiZVs6jMN+fmirZOTyH 06qfKjWXxe2owbrvbnSL05bJnsSYwJ+3oXOwOS1R7/ASR2Pk41EE8OfUDBPlhpVWSGf0 kwAu1eg/N3Du814KhQritVMJiF9f6LrFnzc7S/BleYPzSB/p2yEj1ocf8ueiFM99MV5t 0KVIPormBfapKs9FT5pbNrScH2hT1T8cJJhb+XRVbKhelWJ6Fn43VMrbRpoPFJNcIUfi fHFw== ARC-Authentication-Results: i=1; mx.google.com; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [23.128.96.19]) by mx.google.com with ESMTPS id m19-20020a056a00165300b004f6664623casi321509pfc.214.2022.03.02.14.29.50 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 02 Mar 2022 14:29:51 -0800 (PST) Received-SPF: softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) client-ip=23.128.96.19; Authentication-Results: mx.google.com; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id D6CE6E6C23; Wed, 2 Mar 2022 14:27:50 -0800 (PST) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S236372AbiCBPks (ORCPT + 99 others); Wed, 2 Mar 2022 10:40:48 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:60338 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233923AbiCBPkq (ORCPT ); Wed, 2 Mar 2022 10:40:46 -0500 Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 97CC0CB640; Wed, 2 Mar 2022 07:40:02 -0800 (PST) Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 4857A13D5; Wed, 2 Mar 2022 07:40:02 -0800 (PST) Received: from [10.1.26.154] (e127744.cambridge.arm.com [10.1.26.154]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 6ACAE3F73D; Wed, 2 Mar 2022 07:39:58 -0800 (PST) Subject: Re: [PATCH v2 2/2] perf mem: Support HITM for when mem_lvl_num is used To: Ali Saidi , linux-kernel@vger.kernel.org, linux-perf-users@vger.kernel.org, linux-arm-kernel@lists.infradead.org, leo.yan@linaro.org Cc: benh@kernel.crashing.org, Peter Zijlstra , Ingo Molnar , Arnaldo Carvalho de Melo , Mark Rutland , Alexander Shishkin , Jiri Olsa , Namhyung Kim , John Garry , Will Deacon , Mathieu Poirier , James Clark , Andrew Kilroy , Jin Yao , Kajol Jain , Li Huafei References: <20220221224807.18172-1-alisaidi@amazon.com> <20220221224807.18172-2-alisaidi@amazon.com> From: German Gomez Message-ID: Date: Wed, 2 Mar 2022 15:39:04 +0000 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.14.0 MIME-Version: 1.0 In-Reply-To: <20220221224807.18172-2-alisaidi@amazon.com> Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit Content-Language: en-US X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00, HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,NICE_REPLY_A, RDNS_NONE,SPF_HELO_NONE,T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 21/02/2022 22:48, Ali Saidi wrote: > Current code only support HITM statistics for last level cache (LLC) > when mem_lvl encodes the level. On existing Arm64 machines there are as > many as four levels cache and this change supports decoding l1, l2, and > llc hits from the mem_lvl_num data. Given that the mem_lvl namespace is > being deprecated take this opportunity to encode the neoverse data into > mem_lvl_num. Since Neoverse is mentioned in the commit message, I think there should be a comment somewhere in the code as well. > > For loads that hit in a the LLC snoop filter and are fullfilled from a > higher level cache, it's not usually clear what the true level of the > cache the data came from (i.e. a transfer from a core could come from > it's L1 or L2). Instead of making an assumption of where the line came > from, add support for incrementing HITM if the source is CACHE_ANY. > > Since other architectures don't seem to populate the mem_lvl_num field > here there shouldn't be a change in functionality. > > Signed-off-by: Ali Saidi > --- > tools/perf/util/mem-events.c | 14 ++++++++++---- > 1 file changed, 10 insertions(+), 4 deletions(-) > > diff --git a/tools/perf/util/mem-events.c b/tools/perf/util/mem-events.c > index ed0ab838bcc5..6c3fd4aac7ae 100644 > --- a/tools/perf/util/mem-events.c > +++ b/tools/perf/util/mem-events.c > @@ -485,6 +485,7 @@ int c2c_decode_stats(struct c2c_stats *stats, struct mem_info *mi) > u64 daddr = mi->daddr.addr; > u64 op = data_src->mem_op; > u64 lvl = data_src->mem_lvl; > + u64 lnum = data_src->mem_lvl_num; > u64 snoop = data_src->mem_snoop; > u64 lock = data_src->mem_lock; > u64 blk = data_src->mem_blk; > @@ -527,16 +528,18 @@ do { \ > if (lvl & P(LVL, UNC)) stats->ld_uncache++; > if (lvl & P(LVL, IO)) stats->ld_io++; > if (lvl & P(LVL, LFB)) stats->ld_fbhit++; > - if (lvl & P(LVL, L1 )) stats->ld_l1hit++; > - if (lvl & P(LVL, L2 )) stats->ld_l2hit++; > - if (lvl & P(LVL, L3 )) { > + if (lvl & P(LVL, L1) || lnum == P(LVLNUM, L1)) > + stats->ld_l1hit++; > + if (lvl & P(LVL, L2) || lnum == P(LVLNUM, L2)) > + stats->ld_l2hit++; > + if (lvl & P(LVL, L3) || lnum == P(LVLNUM, L4)) { According to a comment in the previous patch, using L4 is specific to Neoverse, right? Maybe we need to distinguish the Neoverse case from the generic one here as well if (is_neoverse) // treat L4 as llc else // treat L3 as llc > if (snoop & P(SNOOP, HITM)) > HITM_INC(lcl_hitm); > else > stats->ld_llchit++; > } > > - if (lvl & P(LVL, LOC_RAM)) { > + if (lvl & P(LVL, LOC_RAM) || lnum == P(LVLNUM, RAM)) { > stats->lcl_dram++; > if (snoop & P(SNOOP, HIT)) > stats->ld_shared++; > @@ -564,6 +567,9 @@ do { \ > HITM_INC(rmt_hitm); > } > > + if (lnum == P(LVLNUM, ANY_CACHE) && snoop & P(SNOOP, HITM)) > + HITM_INC(lcl_hitm); > + > if ((lvl & P(LVL, MISS))) > stats->ld_miss++; >