Received: by 2002:a5d:925a:0:0:0:0:0 with SMTP id e26csp412283iol; Thu, 9 Jun 2022 06:24:40 -0700 (PDT) X-Google-Smtp-Source: ABdhPJymG2dwro2gIcuoe9OBzS36mToR7L4s8vysLIsBJxcfq7wuCQXnDM/MzPIJ/UszrpsBB4oN X-Received: by 2002:a17:903:1013:b0:163:e8b9:2429 with SMTP id a19-20020a170903101300b00163e8b92429mr39826452plb.74.1654781080594; Thu, 09 Jun 2022 06:24:40 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1654781080; cv=none; d=google.com; s=arc-20160816; b=BctfgL3pOBBA41GivznYhI+O+UcC/NertN3B7vXuAFON0Pg3N7bbjZHWswzVC9mDt8 g0QTjPRN9N75QgDQ58lJQZd5+O6t1iRfWL5g7H2ubuEGCzJeK87brXt7ryleXQxu66Ru EX4Pu3CPB6qJSkwtqvsgxhbHpiypNYqVMCNf6n6W207h5dorKg+B6CkA0xY+6atfitrw +sAngkqbPzM9PqZlvpBmNJeZhKQ5qiWLKPyFGeIrdkxx+it4cc8Wozx6nugFNJmLNkdW m15LC2+aiKmUbRLR+iwKkWqfSyypZYZHOc+xSW4mP2F4CQa000mXDXLyEJWR5utKGqf0 ieGg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=llootBNvnKzlHSo7HBVwmmGvWmrZUKeALsskuPp9oY4=; b=SLWLWr3bsFb+qZhFLsqG/6L5OnGTMJrV/PD9MEoEMCcrnRnpZ4cw/JzTaYmOhuycXB suLY8OJe4Dez5waHmIhgA/wa9M1gh+Smuhb7KCMK2Idr1PcrAhuqlKpO+I838lholauF NWUiGMFdOFFSRLxsp6tPbWg+5hxk2OrfDl4yVW91Hpv+sUBKInic6F08l9JaIvEP0KaL xMPC/eN6MJznG/szIwua7xvja5HeelY9rPnMsxfWPj8CWGjoa0QgglRSZ3DoPVSrWTvZ Oxv7wYsyBj6UJaiQGxbhS9pYf5yda4EOW6RKeo8BX2gl4nXJE9FcmPnM78ndlqjSDSGS bI7Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=PFIHGY61; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id l6-20020a170903244600b0016774f4edabsi18515564pls.78.2022.06.09.06.24.19; Thu, 09 Jun 2022 06:24:40 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=PFIHGY61; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S236881AbiFIMX6 (ORCPT + 99 others); Thu, 9 Jun 2022 08:23:58 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:39292 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S245364AbiFIMXv (ORCPT ); Thu, 9 Jun 2022 08:23:51 -0400 Received: from mail-lf1-x12c.google.com (mail-lf1-x12c.google.com [IPv6:2a00:1450:4864:20::12c]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 83BA62F008 for ; Thu, 9 Jun 2022 05:23:49 -0700 (PDT) Received: by mail-lf1-x12c.google.com with SMTP id i29so20968606lfp.3 for ; Thu, 09 Jun 2022 05:23:49 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=llootBNvnKzlHSo7HBVwmmGvWmrZUKeALsskuPp9oY4=; b=PFIHGY61X8JnRhWtvwXV3fYxBQkbuMhPZHdKbAHE3ATwSdFXTvNH4TNE3BWhkVLrTA RnsJwyyJcVEHEP9Nw5e+K1ihE7iH0Xfit/DfK0NPMl9vttkwJvauVBNABZvteYYsIhEh yv7YZ//OcOg7qT5bGswpxjgt+Kfvd6LF7haJGNsBalV56ROiZBUrZdj6zL22RcMJkqIQ ZwIeIB/NPCZGcTqz0av+IrXgk4syOGa6CvDK6t6N2/4LjyIeWwkjxghkYiKyEOOhAR+U 5CXFKlE07dNhpOd0idZE212zOSd6eDxZHxCwrE/ZsdVg350SfgcYq6bDbib5CToscZpb i6Vg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=llootBNvnKzlHSo7HBVwmmGvWmrZUKeALsskuPp9oY4=; b=RAFAcfAQgTN1lhUeZMOjgpxvGazF6Iz1+MIL9LjojxLOkKrOfk0MJ3xj2WXuFUk3ey vHgxK+0ozz5PApKyhpRDXgK8qAOQVCDXxYmh8FVrGU8dNuliiuCPqfcEWZ0iF4ng6DtE mGvjhRMTEz+McSwdK9G9XYkh2YNikCvF0VfbVkDq5XIXmoYyByEo57LATXhzfNhXi1aA uatZAImhPXDd0L80YVD9GJl7aMXzduzUoQWTVEuIovdCnLpkrvDLloa/fixtzO2NGqye 6QW65nTJCQldjU+M93BxEYbGS8MFhQx2bohgy+sIpe8DjlKTkZ00NsHOS9t+jw6hBust 4R5Q== X-Gm-Message-State: AOAM530coKhY/GnuPIU1sMjfYOY1bvuOplKUgRXOXOIUVPpNoU9uh9DZ yon1UUv9KDnZ7wE7ZRRUH9Xsy3hOECva0MyfVXazGg== X-Received: by 2002:a05:6512:1085:b0:479:478b:d2cc with SMTP id j5-20020a056512108500b00479478bd2ccmr12747169lfg.540.1654777427515; Thu, 09 Jun 2022 05:23:47 -0700 (PDT) MIME-Version: 1.0 References: <20220609113046.780504-1-elver@google.com> <20220609113046.780504-5-elver@google.com> In-Reply-To: From: Dmitry Vyukov Date: Thu, 9 Jun 2022 14:23:36 +0200 Message-ID: Subject: Re: [PATCH 4/8] perf/hw_breakpoint: Make hw_breakpoint_weight() inlinable To: Marco Elver Cc: Peter Zijlstra , Frederic Weisbecker , Ingo Molnar , Thomas Gleixner , Arnaldo Carvalho de Melo , Mark Rutland , Alexander Shishkin , Jiri Olsa , Namhyung Kim , linux-perf-users@vger.kernel.org, x86@kernel.org, linux-sh@vger.kernel.org, kasan-dev@googlegroups.com, linux-kernel@vger.kernel.org Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-17.6 required=5.0 tests=BAYES_00,DKIMWL_WL_MED, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF, ENV_AND_HDR_SPF_MATCH,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS, T_SCC_BODY_TEXT_LINE,USER_IN_DEF_DKIM_WL,USER_IN_DEF_SPF_WL autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, 9 Jun 2022 at 14:08, Marco Elver wrote: > > > Due to being a __weak function, hw_breakpoint_weight() will cause the > > > compiler to always emit a call to it. This generates unnecessarily bad > > > code (register spills etc.) for no good reason; in fact it appears in > > > profiles of `perf bench -r 100 breakpoint thread -b 4 -p 128 -t 512`: > > > > > > ... > > > 0.70% [kernel] [k] hw_breakpoint_weight > > > ... > > > > > > While a small percentage, no architecture defines its own > > > hw_breakpoint_weight() nor are there users outside hw_breakpoint.c, > > > which makes the fact it is currently __weak a poor choice. > > > > > > Change hw_breakpoint_weight()'s definition to follow a similar protocol > > > to hw_breakpoint_slots(), such that if defines > > > hw_breakpoint_weight(), we'll use it instead. > > > > > > The result is that it is inlined and no longer shows up in profiles. > > > > > > Signed-off-by: Marco Elver > > > --- > > > include/linux/hw_breakpoint.h | 1 - > > > kernel/events/hw_breakpoint.c | 4 +++- > > > 2 files changed, 3 insertions(+), 2 deletions(-) > > > > > > diff --git a/include/linux/hw_breakpoint.h b/include/linux/hw_breakpoint.h > > > index 78dd7035d1e5..9fa3547acd87 100644 > > > --- a/include/linux/hw_breakpoint.h > > > +++ b/include/linux/hw_breakpoint.h > > > @@ -79,7 +79,6 @@ extern int dbg_reserve_bp_slot(struct perf_event *bp); > > > extern int dbg_release_bp_slot(struct perf_event *bp); > > > extern int reserve_bp_slot(struct perf_event *bp); > > > extern void release_bp_slot(struct perf_event *bp); > > > -int hw_breakpoint_weight(struct perf_event *bp); > > > int arch_reserve_bp_slot(struct perf_event *bp); > > > void arch_release_bp_slot(struct perf_event *bp); > > > void arch_unregister_hw_breakpoint(struct perf_event *bp); > > > diff --git a/kernel/events/hw_breakpoint.c b/kernel/events/hw_breakpoint.c > > > index 8e939723f27d..5f40c8dfa042 100644 > > > --- a/kernel/events/hw_breakpoint.c > > > +++ b/kernel/events/hw_breakpoint.c > > > @@ -125,10 +125,12 @@ static __init int init_breakpoint_slots(void) > > > } > > > #endif > > > > > > -__weak int hw_breakpoint_weight(struct perf_event *bp) > > > > Humm... this was added in 2010 and never actually used to return > > anything other than 1 since then (?). Looks like over-design. Maybe we > > drop "#ifndef" and add a comment instead? > > Then there's little reason for the function either and we can just > directly increment/decrement 1 everywhere. If we drop the ability for > an arch to override, I feel that'd be cleaner. > > Either way, codegen won't change though. > > Preferences? I don't have strong preferences either way. Can also be: #define HW_BREAKPOINT_WEIGHT 1