Received: by 2002:a05:6358:1087:b0:cb:c9d3:cd90 with SMTP id j7csp2652916rwi; Tue, 11 Oct 2022 11:25:06 -0700 (PDT) X-Google-Smtp-Source: AMsMyM4MDYjD8ncNY1znrDdmItHIK8iRGrBGS3ZEjYHopVP0Ufc8vuSQns8IJKE7OklJZ5nxibnb X-Received: by 2002:aa7:c157:0:b0:458:8ef7:eaff with SMTP id r23-20020aa7c157000000b004588ef7eaffmr23396642edp.82.1665512706153; Tue, 11 Oct 2022 11:25:06 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1665512706; cv=none; d=google.com; s=arc-20160816; b=MFLUaqFIAy6PX+bjkzPa6InBgIdyAAmbiNIC+hnVghSd9jRm+t6+nIpPbX2t/oCruz eYzu76hhE7bwJHOKEMuMwKdnUWFDNfja2sLJyF0eJLVs9MUPIbaiRceyu2opngzGvRAY dEFb4Z+GTauNsv9SwDwLBgF2vPsneMxjO/IhlIwmhXykKja+W7gbcTEuODyHWiSAAaJz O5LtgxB1aHN6u9jLkPm+rhaTxpKEbKe9/EDvvhLfglny9EcVCFH2xbO3+8WEvGl7ZkyW KuTAnDmVGnhaoLU21tF79VBfbuRzmuGH8ougkxCAyuNuqXxggufR3LurYTnmQNFOBtRr TFMg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :dkim-signature; bh=plappG9k77YdtIag7crhL391J4nyLb085zqdNeDp75Y=; b=NkZc7cPPcGPwBPoC7hI6wi6LX26HDLxmwpwDZBoc50JMLFOWkxxvTon3ZReXaF78ZO rg2YzcUnHH+z5t+8c2a67P3h1/v/gWltBpxjfaXDMao5g7H58aCY9jWiluRrQIUhZHET YGablPmBEvljFYntmMU2r2P2DeJMnSLPQRk7dsGPiCpapc9eEvNsureDYs5qIf2tzMOU rKls4U9RSARuXEiV+GHblWcljNpPkYrLeKyNoB8rhpsxpW35yYtORzGHlvvGnl+9CY1L OjU2f+SG6LrMZZWZLnvYpUTSdPbUOMo6AP3OEUyY+JQK+JsVrXQKhLW5CahxdbUDVCW/ sb/A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@joelfernandes.org header.s=google header.b=hdR5ajjx; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id ga6-20020a1709070c0600b0076f061ffab4si11951546ejc.51.2022.10.11.11.24.38; Tue, 11 Oct 2022 11:25:06 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@joelfernandes.org header.s=google header.b=hdR5ajjx; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230135AbiJKSDL (ORCPT + 99 others); Tue, 11 Oct 2022 14:03:11 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:48176 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230063AbiJKSCf (ORCPT ); Tue, 11 Oct 2022 14:02:35 -0400 Received: from mail-qk1-x72b.google.com (mail-qk1-x72b.google.com [IPv6:2607:f8b0:4864:20::72b]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 4A1FC6DAE3 for ; Tue, 11 Oct 2022 11:02:20 -0700 (PDT) Received: by mail-qk1-x72b.google.com with SMTP id z30so8992838qkz.13 for ; Tue, 11 Oct 2022 11:02:20 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=joelfernandes.org; s=google; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:from:to:cc:subject:date :message-id:reply-to; bh=plappG9k77YdtIag7crhL391J4nyLb085zqdNeDp75Y=; b=hdR5ajjxAIVul5tbbOlnvwSo9I3DYSzxoY1xcR4eb6jwny5twe0ISmcZU/UkvYSarB zPYDTi8JRgtYy+9hdDKCGqtfNv2u7kEvfMptXFynVnjFxBv+Gz4AZ4l3ZmHFHrujL9Sq CVXxzoJQg8eV2S7xKrypckY7eWh/m7S+d6s3s= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=content-transfer-encoding:mime-version:references:in-reply-to :message-id:date:subject:cc:to:from:x-gm-message-state:from:to:cc :subject:date:message-id:reply-to; bh=plappG9k77YdtIag7crhL391J4nyLb085zqdNeDp75Y=; b=J4qDP+SrGzsqY/yXmnxKmFItPmvFKn7UNsqXYlpfPMA2eo6id1tu48qzGehOqVlpO9 6PwiwNdcxyPaTmW0ZprnFflhTiEmo/R4LblKlh0L9vPqof/8o4jwjLwSrDWiltVfQ4cS 97ugh7IzNj06Ct93EbcyhPEzF6NMS2XVut/DdH8nDwPPIv5pyed1qaq/piNjIGcHErIx yIIZeF61fg4ci8roIPEtkPb2reHpISvvrULpx/B8se2R17gKCiRdM2LPzgAIQf3FNYim 8aIKFHqGAUNems2X4cQjnOTrzK+xIiCl4b7nHjj0Sie/Z0fuLfOUqcgoVC7biwK4+2rN yjDA== X-Gm-Message-State: ACrzQf1CFidQEfG+mO7P/S4Szyaqycn9MT9OLaxQYHrko+NTNUL5KtIE uX7IyYbWTOCynK4Lb92F5ToBjQ3Yx7Q7wg== X-Received: by 2002:ac8:5e4a:0:b0:35c:d4ee:532f with SMTP id i10-20020ac85e4a000000b0035cd4ee532fmr20343714qtx.18.1665511329346; Tue, 11 Oct 2022 11:02:09 -0700 (PDT) Received: from joelboxx.c.googlers.com.com (228.221.150.34.bc.googleusercontent.com. [34.150.221.228]) by smtp.gmail.com with ESMTPSA id w22-20020a05620a425600b006cbc00db595sm13464375qko.23.2022.10.11.11.02.08 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 11 Oct 2022 11:02:08 -0700 (PDT) From: "Joel Fernandes (Google)" To: rcu@vger.kernel.org Cc: linux-kernel@vger.kernel.org, rushikesh.s.kadam@intel.com, urezki@gmail.com, neeraj.iitr10@gmail.com, frederic@kernel.org, paulmck@kernel.org, rostedt@goodmis.org, youssefesmat@google.com, surenb@google.com, Joel Fernandes Subject: [PATCH v8 01/13] rcu: Fix missing nocb gp wake on rcu_barrier() Date: Tue, 11 Oct 2022 18:01:30 +0000 Message-Id: <20221011180142.2742289-2-joel@joelfernandes.org> X-Mailer: git-send-email 2.38.0.rc1.362.ged0d419d3c-goog In-Reply-To: <20221011180142.2742289-1-joel@joelfernandes.org> References: <20221011180142.2742289-1-joel@joelfernandes.org> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, SPF_HELO_NONE,SPF_PASS autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Frederic Weisbecker Upon entraining a callback to a NOCB CPU, no further wake up is issued on the corresponding nocb_gp kthread. As a result, the callback and all the subsequent ones on that CPU may be ignored, at least until an RCU_NOCB_WAKE_FORCE timer is ever armed or another NOCB CPU belonging to the same group enqueues a callback on an empty queue. Here is a possible bad scenario: 1) CPU 0 is NOCB unlike all other CPUs. 2) CPU 0 queues a callback 2) The grace period related to that callback elapses 3) The callback is moved to the done list (but is not invoked yet), there are no more pending callbacks for CPU 0 4) CPU 1 calls rcu_barrier() and sends an IPI to CPU 0 5) CPU 0 entrains the callback but doesn't wake up nocb_gp 6) CPU 1 blocks forever, unless CPU 0 ever queues enough further callbacks to arm an RCU_NOCB_WAKE_FORCE timer. Make sure the necessary wake up is produced whenever necessary. This is also required to make sure lazy callbacks in future patches don't end up making rcu_barrier() wait for multiple seconds. Reported-by: Joel Fernandes (Google) Fixes: 5d6742b37727 ("rcu/nocb: Use rcu_segcblist for no-CBs CPUs") Signed-off-by: Frederic Weisbecker Signed-off-by: Joel Fernandes (Google) --- kernel/rcu/tree.c | 6 ++++++ kernel/rcu/tree.h | 1 + kernel/rcu/tree_nocb.h | 5 +++++ 3 files changed, 12 insertions(+) diff --git a/kernel/rcu/tree.c b/kernel/rcu/tree.c index 5ec97e3f7468..dc1c502216c7 100644 --- a/kernel/rcu/tree.c +++ b/kernel/rcu/tree.c @@ -3894,6 +3894,8 @@ static void rcu_barrier_entrain(struct rcu_data *rdp) { unsigned long gseq = READ_ONCE(rcu_state.barrier_sequence); unsigned long lseq = READ_ONCE(rdp->barrier_seq_snap); + bool wake_nocb = false; + bool was_alldone = false; lockdep_assert_held(&rcu_state.barrier_lock); if (rcu_seq_state(lseq) || !rcu_seq_state(gseq) || rcu_seq_ctr(lseq) != rcu_seq_ctr(gseq)) @@ -3902,6 +3904,7 @@ static void rcu_barrier_entrain(struct rcu_data *rdp) rdp->barrier_head.func = rcu_barrier_callback; debug_rcu_head_queue(&rdp->barrier_head); rcu_nocb_lock(rdp); + was_alldone = rcu_rdp_is_offloaded(rdp) && !rcu_segcblist_pend_cbs(&rdp->cblist); WARN_ON_ONCE(!rcu_nocb_flush_bypass(rdp, NULL, jiffies)); if (rcu_segcblist_entrain(&rdp->cblist, &rdp->barrier_head)) { atomic_inc(&rcu_state.barrier_cpu_count); @@ -3909,7 +3912,10 @@ static void rcu_barrier_entrain(struct rcu_data *rdp) debug_rcu_head_unqueue(&rdp->barrier_head); rcu_barrier_trace(TPS("IRQNQ"), -1, rcu_state.barrier_sequence); } + wake_nocb = was_alldone && rcu_segcblist_pend_cbs(&rdp->cblist); rcu_nocb_unlock(rdp); + if (wake_nocb) + wake_nocb_gp(rdp, false); smp_store_release(&rdp->barrier_seq_snap, gseq); } diff --git a/kernel/rcu/tree.h b/kernel/rcu/tree.h index d4a97e40ea9c..925dd98f8b23 100644 --- a/kernel/rcu/tree.h +++ b/kernel/rcu/tree.h @@ -439,6 +439,7 @@ static void zero_cpu_stall_ticks(struct rcu_data *rdp); static struct swait_queue_head *rcu_nocb_gp_get(struct rcu_node *rnp); static void rcu_nocb_gp_cleanup(struct swait_queue_head *sq); static void rcu_init_one_nocb(struct rcu_node *rnp); +static bool wake_nocb_gp(struct rcu_data *rdp, bool force); static bool rcu_nocb_flush_bypass(struct rcu_data *rdp, struct rcu_head *rhp, unsigned long j); static bool rcu_nocb_try_bypass(struct rcu_data *rdp, struct rcu_head *rhp, diff --git a/kernel/rcu/tree_nocb.h b/kernel/rcu/tree_nocb.h index f77a6d7e1356..094fd454b6c3 100644 --- a/kernel/rcu/tree_nocb.h +++ b/kernel/rcu/tree_nocb.h @@ -1558,6 +1558,11 @@ static void rcu_init_one_nocb(struct rcu_node *rnp) { } +static bool wake_nocb_gp(struct rcu_data *rdp, bool force) +{ + return false; +} + static bool rcu_nocb_flush_bypass(struct rcu_data *rdp, struct rcu_head *rhp, unsigned long j) { -- 2.38.0.rc1.362.ged0d419d3c-goog