Received: by 2002:a05:6a10:6d10:0:0:0:0 with SMTP id gq16csp1330910pxb; Sat, 23 Apr 2022 03:53:28 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxGyl9rD/47NgrARQ7qFzkLE1ofuF8mEdOI2lTQZSeYkJ2/clW+qo7SpqZAb653qdimSZ7j X-Received: by 2002:a05:6a00:134f:b0:4fb:1307:ceed with SMTP id k15-20020a056a00134f00b004fb1307ceedmr9532178pfu.23.1650711207970; Sat, 23 Apr 2022 03:53:27 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1650711207; cv=none; d=google.com; s=arc-20160816; b=iG/S3ULMQjQ1BGXnApwapzehlLYUHmXX0nbwjTd8BKW455nh4L7KSN5SKuAF3Vz4FW +gK2CGk6uiaGlenk5G9K4sTc4iB08vtHYpazloxZH4KFRMpAYyFZqj0gviGAR2onLLMS in7RJ2Rt3k0Qzx+44a4mVe4QjcAtK/4cNtRg9BTSwz9Zj+TRBunMRLbB2WPsNBEs+cys s5DgHg0piClFMqC5DADgwsKMs/Pe0aqIDKA3Vs8LFsKVkLUDg/Alh8MSfjGYtDYFfKuZ 57cVGdN105807OtA9eZR6esIIO2itIK2LSKHGvTaL+ldC+hlxZ1+MzNMA67JWfT4iLky 6w9g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from:dkim-signature; bh=6cGUJu+eHH+ea/OTC0qPTaxvEClDPRO4bc57tjeoVRM=; b=ARbedTncFp2xq/+MojAjyg9jNC1ZYnv8rdL5DTHvZ4n0lJ9eIuJHLH47rvf7i06rwo 9Bkd1qXCMLu+jPBK65ciW936j5yimpYm/NTiAFgX1K1zxkYMdoQSVRLwdSdyQ/Xq7XKq 229PWqBKq+EqWQCQP5kVmrx+OeNRiJ7DB1X02xvyYNm09MGNRSxpNFy5CfVJkWSG/ga2 XZ8v7NHtTYlOta4GYbsBnmihrou4tDBvznaYC3ecZ/47Ww5Xmadrkf0wOd2SvYekrE1y 1OijEt1yT/yq7e5x/DO6/g7MvqJdqlVyjgWEns44j4ebbCC48QedkSshOdUe83g1rnpM JpQw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=I5PRpDoy; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id k2-20020a170902c40200b00158f23ac798si331144plk.128.2022.04.23.03.52.38; Sat, 23 Apr 2022 03:53:27 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=I5PRpDoy; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234985AbiDWKbu (ORCPT + 99 others); Sat, 23 Apr 2022 06:31:50 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:53572 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S233643AbiDWKbs (ORCPT ); Sat, 23 Apr 2022 06:31:48 -0400 Received: from mail-pg1-x530.google.com (mail-pg1-x530.google.com [IPv6:2607:f8b0:4864:20::530]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 152521BD59A; Sat, 23 Apr 2022 03:28:51 -0700 (PDT) Received: by mail-pg1-x530.google.com with SMTP id 15so989397pgf.4; Sat, 23 Apr 2022 03:28:51 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=from:to:cc:subject:date:message-id:mime-version :content-transfer-encoding; bh=6cGUJu+eHH+ea/OTC0qPTaxvEClDPRO4bc57tjeoVRM=; b=I5PRpDoyuKWFR+rJGHKZWkE8zeUBF5qYUQhQgc1DYuP6JSryBkNa709nKUwaWlA+u7 N+KRF5znja9lAzqs3ORSzlnA7DvVuTS/gxbic6g1Fw7j6E4Hp5N2WYqjTo1AhKsJVkks oVHMqWdv7Js8t74swgOuuUkGDYfWesB/9ft7h1PeZYFXrGrIczAmPAVrM2tPs5CK1Rdl rJrBZvDbBi263IA8ndp/imReOZCjFxtnMZrDuSfaqDcmgqNj4y8+E3kgg0IgLF6EzdPx HZq3jazJ/qzinU+WvTaTuFgpZ03nKtupIXt0vsCQbICn580uBMetSUIciKNjvSFRSp15 gH0w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:from:to:cc:subject:date:message-id:mime-version :content-transfer-encoding; bh=6cGUJu+eHH+ea/OTC0qPTaxvEClDPRO4bc57tjeoVRM=; b=NlaqPGxkFFw51vp8pbwsYs250NX3aYkVGqjdFe+AL2anp1+GriIwmOT3vUvLqPa8Tl oM270RZddyVhi4vwit4mFp2GVHCzQJOiocIKJBhn18r/v8dY3R0if54eINhr4tE+ALcf GEPC/yQxPFBoRJPDwJ03mehmTIUN686V6Tx2wIBYM2qVdpjQ4bJQW/CSE77Pfc/nMYkr ucmKFauPPgHKmAnYBTcFhHZAYIpPUoYvQTfSHobtOyqB04V/xF5Pb9RcGtYV44mQpS1V MaP78sTwx5JF6jx8myi3uK3YRCkB/VNJUwqd+q2l5gdBlJt+ogNY5RCQkX/Ww19bC1SA UJrg== X-Gm-Message-State: AOAM531pOlfp42krsU32N7cf5ZKMJobH/0RjsKsiUITltaU+lySI38vB Q/zTEjeBeEZ77q7arRu50eU= X-Received: by 2002:a63:3841:0:b0:399:26da:29af with SMTP id h1-20020a633841000000b0039926da29afmr7325069pgn.489.1650709730497; Sat, 23 Apr 2022 03:28:50 -0700 (PDT) Received: from localhost ([101.86.206.159]) by smtp.gmail.com with ESMTPSA id d14-20020a62f80e000000b0050d2671b11bsm1101522pfh.53.2022.04.23.03.28.49 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sat, 23 Apr 2022 03:28:50 -0700 (PDT) From: Patrick Wang To: paulmck@kernel.org, rostedt@goodmis.org, frederic@kernel.org, quic_neeraju@quicinc.com, josh@joshtriplett.org, mathieu.desnoyers@efficios.com, jiangshanlai@gmail.com, joel@joelfernandes.org Cc: rcu@vger.kernel.org, linux-kernel@vger.kernel.org, patrick.wang.shcn@gmail.com Subject: [PATCH v2] rcu: ftrace: avoid tracing a few functions executed in stop machine Date: Sat, 23 Apr 2022 18:28:44 +0800 Message-Id: <20220423102844.1371595-1-patrick.wang.shcn@gmail.com> X-Mailer: git-send-email 2.25.1 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,FREEMAIL_FROM, RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Because of the change of stop machine implementation, there are functions being called while waiting currently: ---------------------------------------------------------------- Former stop machine wait loop: do { cpu_relax(); => macro ... } while (curstate != STOPMACHINE_EXIT); ----------------------------------------------------------------- Current stop machine wait loop: do { stop_machine_yield(cpumask); => function (notraced) ... touch_nmi_watchdog(); => function (notraced, inside calls also notraced) ... rcu_momentary_dyntick_idle(); => function (notraced, inside calls traced) } while (curstate != MULTI_STOP_EXIT); ------------------------------------------------------------------ These functions (including the calls inside) should be marked notrace to avoid their codes being updated when they are being called. The calls inside rcu_momentary_dyntick_idle() still remain traced, and will cause crash: rcu: INFO: rcu_preempt detected stalls on CPUs/tasks: rcu: 1-...!: (0 ticks this GP) idle=14f/1/0x4000000000000000 softirq=3397/3397 fqs=0 rcu: 3-...!: (0 ticks this GP) idle=ee9/1/0x4000000000000000 softirq=5168/5168 fqs=0 (detected by 0, t=8137 jiffies, g=5889, q=2 ncpus=4) Task dump for CPU 1: task:migration/1 state:R running task stack: 0 pid: 19 ppid: 2 flags:0x00000000 Stopper: multi_cpu_stop+0x0/0x18c <- stop_machine_cpuslocked+0x128/0x174 Call Trace: Task dump for CPU 3: task:migration/3 state:R running task stack: 0 pid: 29 ppid: 2 flags:0x00000000 Stopper: multi_cpu_stop+0x0/0x18c <- stop_machine_cpuslocked+0x128/0x174 Call Trace: rcu: rcu_preempt kthread timer wakeup didn't happen for 8136 jiffies! g5889 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x402 rcu: Possible timer handling issue on cpu=2 timer-softirq=594 rcu: rcu_preempt kthread starved for 8137 jiffies! g5889 f0x0 RCU_GP_WAIT_FQS(5) ->state=0x402 ->cpu=2 rcu: Unless rcu_preempt kthread gets sufficient CPU time, OOM is now expected behavior. rcu: RCU grace-period kthread stack dump: task:rcu_preempt state:I stack: 0 pid: 14 ppid: 2 flags:0x00000000 Call Trace: schedule+0x56/0xc2 schedule_timeout+0x82/0x184 rcu_gp_fqs_loop+0x19a/0x318 rcu_gp_kthread+0x11a/0x140 kthread+0xee/0x118 ret_from_exception+0x0/0x14 rcu: Stack dump where RCU GP kthread last ran: Task dump for CPU 2: task:migration/2 state:R running task stack: 0 pid: 24 ppid: 2 flags:0x00000000 Stopper: multi_cpu_stop+0x0/0x18c <- stop_machine_cpuslocked+0x128/0x174 Call Trace: Mark the calls inside rcu_momentary_dyntick_idle(): rcu_preempt_deferred_qs() rcu_preempt_need_deferred_qs() rcu_preempt_deferred_qs_irqrestore() as notrace to prevent this. Signed-off-by: Patrick Wang --- v1->v2: - Modify log message. kernel/rcu/tree_plugin.h | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/kernel/rcu/tree_plugin.h b/kernel/rcu/tree_plugin.h index c8ba0fe17267..971bb6a00ede 100644 --- a/kernel/rcu/tree_plugin.h +++ b/kernel/rcu/tree_plugin.h @@ -460,7 +460,7 @@ static bool rcu_preempt_has_tasks(struct rcu_node *rnp) * be quite short, for example, in the case of the call from * rcu_read_unlock_special(). */ -static void +notrace static void rcu_preempt_deferred_qs_irqrestore(struct task_struct *t, unsigned long flags) { bool empty_exp; @@ -581,7 +581,7 @@ rcu_preempt_deferred_qs_irqrestore(struct task_struct *t, unsigned long flags) * is disabled. This function cannot be expected to understand these * nuances, so the caller must handle them. */ -static bool rcu_preempt_need_deferred_qs(struct task_struct *t) +notrace static bool rcu_preempt_need_deferred_qs(struct task_struct *t) { return (__this_cpu_read(rcu_data.cpu_no_qs.b.exp) || READ_ONCE(t->rcu_read_unlock_special.s)) && @@ -595,7 +595,7 @@ static bool rcu_preempt_need_deferred_qs(struct task_struct *t) * evaluate safety in terms of interrupt, softirq, and preemption * disabling. */ -static void rcu_preempt_deferred_qs(struct task_struct *t) +notrace static void rcu_preempt_deferred_qs(struct task_struct *t) { unsigned long flags; -- 2.25.1