Received: by 2002:a05:6602:2086:0:0:0:0 with SMTP id a6csp4365221ioa; Wed, 27 Apr 2022 02:03:18 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzu/E3o9TpRkGCur5qunv3+JSTyRvJg3EabV0t1mcdGanrdKh2IyJuO0OZ0KYO0DrHBUQ55 X-Received: by 2002:a17:90a:df0f:b0:1d9:2372:b55e with SMTP id gp15-20020a17090adf0f00b001d92372b55emr26138441pjb.104.1651050198740; Wed, 27 Apr 2022 02:03:18 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1651050198; cv=none; d=google.com; s=arc-20160816; b=Gk0p9EputH4Y3/f7f3I7SVB2FXaPPhp+2NegP7xFAbKIScybHiLUAmO3xTaoHlYRCl E+l3AOyjwFH5u94JDdy/YuDskU0EaFp148xr11BbG+v8Bn5SzsJnb/QoCuJl6X9VVujG XpG7OKJE0/RUW9TMQw1D/yPQTp/Mn/tvhBXwk26yjcboV/DiD+AGdU+5A7w02HKflgGD xuTzJ0MagHghrmSoFrOBJbAoghclpb3fge7Br6Q1oV0J6ghCfIBWs+rSu1MQ5aaOpPbI W/kH2WHRhaf09mMAu383B/d+GSc23edzbl8jdi/Xtt37CjP7PhEd07duLMU9nbs8FTOL LyVQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=gl1gHbDUo7lTz2ZfpQgLhkdMuBK7nhPRxt/EDLeMc7k=; b=GZpQrBMcIFPlyxiKKhxbfOQ7hG1CDSrO3+S0y+BKGrPFUEybU+4P1njOOADbdBBGZO 6BRaovpV9mViUi/8DqIt6RDGo4I1+kmlp/TtAKAvnQNYT7LaFB0SQTSqxCBAQCtokwM3 e6gZCZuj1Feugd/ZUJHm+IUAeN40TVExuDJyzbngjUZjuiqaTPntnVeiiYkrIl1u3kbW h3AQ1y7SBm3KqrIEWNs/lyNh4BXnKlRabvkQSE4l8euTlwQLpN8BMuE985uKz3HtWd9K bS5X3EU1Uhfkixv0OPkYBUG/DjNCo27FOgzbJXzqdH6TFm4y/xJbg22VsP3pMKZ9dFxF VQyA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@infradead.org header.s=casper.20170209 header.b=MRxtqzR2; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [23.128.96.19]) by mx.google.com with ESMTPS id my4-20020a17090b4c8400b001d9253ea2c6si5695525pjb.74.2022.04.27.02.03.17 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 27 Apr 2022 02:03:18 -0700 (PDT) Received-SPF: softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) client-ip=23.128.96.19; Authentication-Results: mx.google.com; dkim=pass header.i=@infradead.org header.s=casper.20170209 header.b=MRxtqzR2; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id AAC261E0C3F; Wed, 27 Apr 2022 01:57:36 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1349008AbiDZKdg (ORCPT + 99 others); Tue, 26 Apr 2022 06:33:36 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:37848 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1348808AbiDZKaf (ORCPT ); Tue, 26 Apr 2022 06:30:35 -0400 Received: from casper.infradead.org (casper.infradead.org [IPv6:2001:8b0:10b:1236::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 86DFCEDB50; Tue, 26 Apr 2022 03:07:32 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=infradead.org; s=casper.20170209; h=In-Reply-To:Content-Type:MIME-Version: References:Message-ID:Subject:Cc:To:From:Date:Sender:Reply-To: Content-Transfer-Encoding:Content-ID:Content-Description; bh=gl1gHbDUo7lTz2ZfpQgLhkdMuBK7nhPRxt/EDLeMc7k=; b=MRxtqzR2ciPl+7PgV511KoXRHd lMdFk44jhEmKVk2PBTJf0z8HO6nNaUZGearlJu4XKgM/rvxzHgQYk1XX5ioetlZy01tFwlSxOmgjE PEDcBg5cNoC9OXwrIs1cvjtx7Swp/GotK04YJn57NujG3iVrBsqM0SS8Mckwi0Cr8hmP3UY9TFHUd MjC87to64sKQo2g4FQ3nAxDa+uuocpr8hGqO2eFlQe17A5OEQ3KWF/YpdvNLLErXL63i0nEyKUhqM rwHC1D+pGtbb21VGal6kVe3RUw72iC8L2nRYTy/W8fHsI+CIF3rOCXxOruIy5siWc7WI61mKAiixC kRg1D1yA==; Received: from j217100.upc-j.chello.nl ([24.132.217.100] helo=worktop.programming.kicks-ass.net) by casper.infradead.org with esmtpsa (Exim 4.94.2 #2 (Red Hat Linux)) id 1njI69-009XAa-Tg; Tue, 26 Apr 2022 10:07:10 +0000 Received: by worktop.programming.kicks-ass.net (Postfix, from userid 1000) id 87D579819FC; Tue, 26 Apr 2022 12:07:07 +0200 (CEST) Date: Tue, 26 Apr 2022 12:07:07 +0200 From: Peter Zijlstra To: Ravi Bangoria Cc: acme@kernel.org, mingo@redhat.com, mark.rutland@arm.com, jolsa@kernel.org, namhyung@kernel.org, tglx@linutronix.de, bp@alien8.de, irogers@google.com, yao.jin@linux.intel.com, james.clark@arm.com, leo.yan@linaro.org, kan.liang@linux.intel.com, ak@linux.intel.com, eranian@google.com, like.xu.linux@gmail.com, x86@kernel.org, linux-perf-users@vger.kernel.org, linux-kernel@vger.kernel.org, sandipan.das@amd.com, ananth.narayan@amd.com, kim.phillips@amd.com, rrichter@amd.com, santosh.shukla@amd.com Subject: Re: [PATCH 1/6] perf/amd/ibs: Add support for L3 miss filtering Message-ID: <20220426100707.GP2731@worktop.programming.kicks-ass.net> References: <20220425044323.2830-1-ravi.bangoria@amd.com> <20220425044323.2830-2-ravi.bangoria@amd.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20220425044323.2830-2-ravi.bangoria@amd.com> X-Spam-Status: No, score=-2.0 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,RDNS_NONE,SPF_HELO_NONE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Apr 25, 2022 at 10:13:18AM +0530, Ravi Bangoria wrote: > IBS L3 miss filtering works by tagging an instruction on IBS counter > overflow and generating an NMI if the tagged instruction causes an L3 > miss. Samples without an L3 miss are discarded and counter is reset > with random value (between 1-15 for fetch pmu and 1-127 for op pmu). > This helps in reducing sampling overhead when user is interested only > in such samples. One of the use case of such filtered samples is to > feed data to page-migration daemon in tiered memory systems. > > Add support for L3 miss filtering in IBS driver via new pmu attribute > "l3missonly". Example usage: > > # perf record -a -e ibs_op/l3missonly=1/ --raw-samples sleep 5 > > Signed-off-by: Ravi Bangoria > --- > arch/x86/events/amd/ibs.c | 42 ++++++++++++++++++++++--------- > arch/x86/include/asm/perf_event.h | 3 +++ > 2 files changed, 33 insertions(+), 12 deletions(-) > > diff --git a/arch/x86/events/amd/ibs.c b/arch/x86/events/amd/ibs.c > index 9739019d4b67..a5303d62060c 100644 > --- a/arch/x86/events/amd/ibs.c > +++ b/arch/x86/events/amd/ibs.c > @@ -520,16 +520,12 @@ static void perf_ibs_read(struct perf_event *event) { } > > PMU_FORMAT_ATTR(rand_en, "config:57"); > PMU_FORMAT_ATTR(cnt_ctl, "config:19"); > +PMU_EVENT_ATTR_STRING(l3missonly, fetch_l3missonly, "config:59"); > +PMU_EVENT_ATTR_STRING(l3missonly, op_l3missonly, "config:16"); > > -static struct attribute *ibs_fetch_format_attrs[] = { > - &format_attr_rand_en.attr, > - NULL, > -}; > - > -static struct attribute *ibs_op_format_attrs[] = { > - NULL, /* &format_attr_cnt_ctl.attr if IBS_CAPS_OPCNT */ > - NULL, > -}; > +/* size = nr attrs plus NULL at the end */ > +static struct attribute *ibs_fetch_format_attrs[3]; > +static struct attribute *ibs_op_format_attrs[3]; > > static struct perf_ibs perf_ibs_fetch = { > .pmu = { > @@ -759,9 +755,9 @@ static __init int perf_ibs_pmu_init(struct perf_ibs *perf_ibs, char *name) > return ret; > } > > -static __init void perf_event_ibs_init(void) > +static __init void perf_ibs_fetch_prepare(void) > { > - struct attribute **attr = ibs_op_format_attrs; > + struct attribute **format_attrs = perf_ibs_fetch.format_attrs; > > /* > * Some chips fail to reset the fetch count when it is written; instead > @@ -773,11 +769,22 @@ static __init void perf_event_ibs_init(void) > if (boot_cpu_data.x86 == 0x19 && boot_cpu_data.x86_model < 0x10) > perf_ibs_fetch.fetch_ignore_if_zero_rip = 1; > > + *format_attrs++ = &format_attr_rand_en.attr; > + if (ibs_caps & IBS_CAPS_ZEN4IBSEXTENSIONS) { > + perf_ibs_fetch.config_mask |= IBS_FETCH_L3MISSONLY; > + *format_attrs++ = &fetch_l3missonly.attr.attr; > + } > + > perf_ibs_pmu_init(&perf_ibs_fetch, "ibs_fetch"); > +} > + > +static __init void perf_ibs_op_prepare(void) > +{ > + struct attribute **format_attrs = perf_ibs_op.format_attrs; > > if (ibs_caps & IBS_CAPS_OPCNT) { > perf_ibs_op.config_mask |= IBS_OP_CNT_CTL; > - *attr++ = &format_attr_cnt_ctl.attr; > + *format_attrs++ = &format_attr_cnt_ctl.attr; > } > > if (ibs_caps & IBS_CAPS_OPCNTEXT) { > @@ -786,7 +793,18 @@ static __init void perf_event_ibs_init(void) > perf_ibs_op.cnt_mask |= IBS_OP_MAX_CNT_EXT_MASK; > } > > + if (ibs_caps & IBS_CAPS_ZEN4IBSEXTENSIONS) { > + perf_ibs_op.config_mask |= IBS_OP_L3MISSONLY; > + *format_attrs++ = &op_l3missonly.attr.attr; > + } > + > perf_ibs_pmu_init(&perf_ibs_op, "ibs_op"); > +} Right, so Greg told us to stop doing silly things like this and use .is_visible, also see commits like: b7c9b3927337 ("perf/x86/intel: Use ->is_visible callback for default group") There's quite a bit of that in the intel driver and some in the x86 core code too. Please have a look.