Received: by 2002:ad5:4acb:0:0:0:0:0 with SMTP id n11csp5607971imw; Wed, 20 Jul 2022 08:52:08 -0700 (PDT) X-Google-Smtp-Source: AGRyM1vBgiM2PywdYqUIpYb+C0v4DK91L7pwyrJXCTrNkdYhlgeDB/HtmnB2iF+mHqOf98kpNrVt X-Received: by 2002:a17:907:75d7:b0:72b:307d:fb60 with SMTP id jl23-20020a17090775d700b0072b307dfb60mr36326040ejc.22.1658332310248; Wed, 20 Jul 2022 08:51:50 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1658332310; cv=none; d=google.com; s=arc-20160816; b=jxFLrD52I+J2LOu27CCSkjQ5+1xsFQnBIdrWvBgK+FdFRDW5G5TIPAm6VwRvKbErLB kh1a+q8ruePPJYabEMC/Px2rkKxH7WdBfr7IdF5gpWB02JoPMzoxxKG2ZUPUH4TrBUDa 0OVv3lxnACP+9NOZQilww4iPDgkHl0dtRlKi+gYWIyvZ163eKe612+adHNuyUTQPHVCp p/8WuUvEOU+FHRj2gdI6wMAF2wknB09gSwY3iS2QswP97IIgjtrpO+f4oSgHFfLURsU8 8xDm4ucOAS8KYSwrgNl8zwXCyPCYUWakM22tgdjJaYuoTS3NxtYp1Qg05XUDTrCleVHb 5nZg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=srzHwQjnGGGl03nd5STwWyJmD4z8xROiVMWo0kyoayo=; b=Me9G61rcSyh47gEPGkSBmzsiuRRVYDebY5yKKTeVLgF5Get3/plXHwgPNyOwTRlQby M7cf51vYZndK1JMY4n20WYWIpP4/736LTGtSLgO0WDZNhA1/CHC3raasavslxzMKjqAV ittqMDnaGy6PbFIDJS2D/9J05lfR0xXgDLPYgCGSgJfeCBgWD7eqQGR5Z31UbE9t0PsE wauwSWZQpP4D+1NAhQoQPurGFXTaIFtCgxoi7EGUeKJhT6eL+V2PzIjrNJWrl8CZiRvI gbVCmtgnLvge6q9FelZ6Qcpe+2ZdNhllrjXLfoKooIbHTqwRUhl0BqWc44myZoj7Z8lv /gYg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=f1+SWiW7; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id u13-20020a50d50d000000b0043a1cbc52e4si24253507edi.497.2022.07.20.08.51.24; Wed, 20 Jul 2022 08:51:50 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=f1+SWiW7; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230451AbiGTPfh (ORCPT + 99 others); Wed, 20 Jul 2022 11:35:37 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:44660 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229677AbiGTPfg (ORCPT ); Wed, 20 Jul 2022 11:35:36 -0400 Received: from mail-wm1-x32b.google.com (mail-wm1-x32b.google.com [IPv6:2a00:1450:4864:20::32b]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 588C025EBD for ; Wed, 20 Jul 2022 08:35:34 -0700 (PDT) Received: by mail-wm1-x32b.google.com with SMTP id f24-20020a1cc918000000b003a30178c022so1589641wmb.3 for ; Wed, 20 Jul 2022 08:35:34 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=srzHwQjnGGGl03nd5STwWyJmD4z8xROiVMWo0kyoayo=; b=f1+SWiW78i/6mjUfTRlRYvayvxkg7XVTB+MASYrMVvi98wATKmL6cn5wW2O0IzTMT8 /JBKPSBsA4j4IX4bVwXqUfD4ylkNbhd2WIa/kf8BWUe4wvV4OYJyn1oeoW/E/CHxOZjR /WgqDi/D4jET4n2/XIG2Ee44zcUAyHvyBbi9yf8pK8OE9J5pOLPoWHae3gO18zm8ty+q S8t5OzuVtRyLlaRGYebkxN1E9XC8kojcx5oL8heN8Z58YLRAR7pWiaNPS359vKhdsDE6 6i2/yiXqXoCV/ocImEiNN71XNSRfmCkBJ31Kzb3aGnvAvr6K/kmHNZ4g7N2YQ/XxGIqq xf3A== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=srzHwQjnGGGl03nd5STwWyJmD4z8xROiVMWo0kyoayo=; b=Fe7FNw6y/J2eJBSo7NM6nHw4TVhEtX7ndY8NafhZBoux7jruYZliaLmViBR9+FPy07 HzrQGpmYE4aPfattolrx2tosfsXrHEj2Rx2nLJ/X6xe/u5XCHB/9vY0UOWUQv4Jd0TkY jJouMus9sUZ433EiBPlkNQl57qRbD1zXu4RrhEXwuIHXk/dtOTo1Q1nnQkbIPw18VtvS yxyulK69rMkWhXOMNiW7JVTYVi0Pfg46hIot8Og62qyqu9jGXmapgnIp0dY89VmRtIwv o6IPTe8ygXYKcYp9Mx5bezDXIysDFz1IFsZPwNW/I2+03i1+fQ7PJKloclNcBsXJMfV7 zV8w== X-Gm-Message-State: AJIora/xCtU4vdrid8BgqYzuO6puOZdK22QMyvlMEKkEkY/ce4gvtkMx LTpPnTeKFa58WDJR8PD0ZUU50yklYX4fAtSAQaEfLQ== X-Received: by 2002:a05:600c:19d2:b0:3a3:2cdb:cc02 with SMTP id u18-20020a05600c19d200b003a32cdbcc02mr1939558wmq.182.1658331332788; Wed, 20 Jul 2022 08:35:32 -0700 (PDT) MIME-Version: 1.0 References: <20220704150514.48816-1-elver@google.com> <20220704150514.48816-10-elver@google.com> In-Reply-To: <20220704150514.48816-10-elver@google.com> From: Ian Rogers Date: Wed, 20 Jul 2022 08:35:20 -0700 Message-ID: Subject: Re: [PATCH v3 09/14] powerpc/hw_breakpoint: Avoid relying on caller synchronization To: Marco Elver Cc: Peter Zijlstra , Frederic Weisbecker , Ingo Molnar , Thomas Gleixner , Arnaldo Carvalho de Melo , Mark Rutland , Alexander Shishkin , Jiri Olsa , Namhyung Kim , Dmitry Vyukov , Michael Ellerman , linuxppc-dev@lists.ozlabs.org, linux-perf-users@vger.kernel.org, x86@kernel.org, linux-sh@vger.kernel.org, kasan-dev@googlegroups.com, linux-kernel@vger.kernel.org Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-17.6 required=5.0 tests=BAYES_00,DKIMWL_WL_MED, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF, ENV_AND_HDR_SPF_MATCH,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS, USER_IN_DEF_DKIM_WL,USER_IN_DEF_SPF_WL autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Jul 4, 2022 at 8:07 AM Marco Elver wrote: > > Internal data structures (cpu_bps, task_bps) of powerpc's hw_breakpoint > implementation have relied on nr_bp_mutex serializing access to them. > > Before overhauling synchronization of kernel/events/hw_breakpoint.c, > introduce 2 spinlocks to synchronize cpu_bps and task_bps respectively, > thus avoiding reliance on callers synchronizing powerpc's hw_breakpoint. > > Reported-by: Dmitry Vyukov > Signed-off-by: Marco Elver > Acked-by: Dmitry Vyukov Acked-by: Ian Rogers Thanks, Ian > --- > v2: > * New patch. > --- > arch/powerpc/kernel/hw_breakpoint.c | 53 ++++++++++++++++++++++------- > 1 file changed, 40 insertions(+), 13 deletions(-) > > diff --git a/arch/powerpc/kernel/hw_breakpoint.c b/arch/powerpc/kernel/hw_breakpoint.c > index 2669f80b3a49..8db1a15d7acb 100644 > --- a/arch/powerpc/kernel/hw_breakpoint.c > +++ b/arch/powerpc/kernel/hw_breakpoint.c > @@ -15,6 +15,7 @@ > #include > #include > #include > +#include > #include > #include > > @@ -129,7 +130,14 @@ struct breakpoint { > bool ptrace_bp; > }; > > +/* > + * While kernel/events/hw_breakpoint.c does its own synchronization, we cannot > + * rely on it safely synchronizing internals here; however, we can rely on it > + * not requesting more breakpoints than available. > + */ > +static DEFINE_SPINLOCK(cpu_bps_lock); > static DEFINE_PER_CPU(struct breakpoint *, cpu_bps[HBP_NUM_MAX]); > +static DEFINE_SPINLOCK(task_bps_lock); > static LIST_HEAD(task_bps); > > static struct breakpoint *alloc_breakpoint(struct perf_event *bp) > @@ -174,7 +182,9 @@ static int task_bps_add(struct perf_event *bp) > if (IS_ERR(tmp)) > return PTR_ERR(tmp); > > + spin_lock(&task_bps_lock); > list_add(&tmp->list, &task_bps); > + spin_unlock(&task_bps_lock); > return 0; > } > > @@ -182,6 +192,7 @@ static void task_bps_remove(struct perf_event *bp) > { > struct list_head *pos, *q; > > + spin_lock(&task_bps_lock); > list_for_each_safe(pos, q, &task_bps) { > struct breakpoint *tmp = list_entry(pos, struct breakpoint, list); > > @@ -191,6 +202,7 @@ static void task_bps_remove(struct perf_event *bp) > break; > } > } > + spin_unlock(&task_bps_lock); > } > > /* > @@ -200,12 +212,17 @@ static void task_bps_remove(struct perf_event *bp) > static bool all_task_bps_check(struct perf_event *bp) > { > struct breakpoint *tmp; > + bool ret = false; > > + spin_lock(&task_bps_lock); > list_for_each_entry(tmp, &task_bps, list) { > - if (!can_co_exist(tmp, bp)) > - return true; > + if (!can_co_exist(tmp, bp)) { > + ret = true; > + break; > + } > } > - return false; > + spin_unlock(&task_bps_lock); > + return ret; > } > > /* > @@ -215,13 +232,18 @@ static bool all_task_bps_check(struct perf_event *bp) > static bool same_task_bps_check(struct perf_event *bp) > { > struct breakpoint *tmp; > + bool ret = false; > > + spin_lock(&task_bps_lock); > list_for_each_entry(tmp, &task_bps, list) { > if (tmp->bp->hw.target == bp->hw.target && > - !can_co_exist(tmp, bp)) > - return true; > + !can_co_exist(tmp, bp)) { > + ret = true; > + break; > + } > } > - return false; > + spin_unlock(&task_bps_lock); > + return ret; > } > > static int cpu_bps_add(struct perf_event *bp) > @@ -234,6 +256,7 @@ static int cpu_bps_add(struct perf_event *bp) > if (IS_ERR(tmp)) > return PTR_ERR(tmp); > > + spin_lock(&cpu_bps_lock); > cpu_bp = per_cpu_ptr(cpu_bps, bp->cpu); > for (i = 0; i < nr_wp_slots(); i++) { > if (!cpu_bp[i]) { > @@ -241,6 +264,7 @@ static int cpu_bps_add(struct perf_event *bp) > break; > } > } > + spin_unlock(&cpu_bps_lock); > return 0; > } > > @@ -249,6 +273,7 @@ static void cpu_bps_remove(struct perf_event *bp) > struct breakpoint **cpu_bp; > int i = 0; > > + spin_lock(&cpu_bps_lock); > cpu_bp = per_cpu_ptr(cpu_bps, bp->cpu); > for (i = 0; i < nr_wp_slots(); i++) { > if (!cpu_bp[i]) > @@ -260,19 +285,25 @@ static void cpu_bps_remove(struct perf_event *bp) > break; > } > } > + spin_unlock(&cpu_bps_lock); > } > > static bool cpu_bps_check(int cpu, struct perf_event *bp) > { > struct breakpoint **cpu_bp; > + bool ret = false; > int i; > > + spin_lock(&cpu_bps_lock); > cpu_bp = per_cpu_ptr(cpu_bps, cpu); > for (i = 0; i < nr_wp_slots(); i++) { > - if (cpu_bp[i] && !can_co_exist(cpu_bp[i], bp)) > - return true; > + if (cpu_bp[i] && !can_co_exist(cpu_bp[i], bp)) { > + ret = true; > + break; > + } > } > - return false; > + spin_unlock(&cpu_bps_lock); > + return ret; > } > > static bool all_cpu_bps_check(struct perf_event *bp) > @@ -286,10 +317,6 @@ static bool all_cpu_bps_check(struct perf_event *bp) > return false; > } > > -/* > - * We don't use any locks to serialize accesses to cpu_bps or task_bps > - * because are already inside nr_bp_mutex. > - */ > int arch_reserve_bp_slot(struct perf_event *bp) > { > int ret; > -- > 2.37.0.rc0.161.g10f37bed90-goog >