Received: by 2002:a25:6193:0:0:0:0:0 with SMTP id v141csp31052ybb; Fri, 27 Mar 2020 15:25:46 -0700 (PDT) X-Google-Smtp-Source: ADFU+vtUSM4ABjHMXnvV0DT7DPK3VLhKBysU8BO0fRPa46I6kbhD7Sgj5HVO2HP4B+MIkcogqpzG X-Received: by 2002:a4a:6f0d:: with SMTP id h13mr1333281ooc.99.1585347945987; Fri, 27 Mar 2020 15:25:45 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1585347945; cv=none; d=google.com; s=arc-20160816; b=SA5P9tBbZ0Qi3lXMsSNLMyWcq8d1fzGqpI84scerWwTDft7wtDvlEW2vgvQP0CjpT7 saoEaYwflSiSEc7Z5WbHCiouICQa+d1XdxsxZ5Dc4g53M5IDlFThO7LhVH3iGht2E525 7WVYx49CLx1fV8gM+4vGWEYNZdFHRTmG5T4jVouD1uHuuXeujC2hzt1t8lu75fh20iAn m07Dx07JuMNqQvxGKzYjG5FNQh6kx04myeaq9REtlo5ByKN87Rntmus+zJX5EusNpDWT ocjX3pYksrKDJsx3k4a06n9ZKaPXt9Ooshfv0FFnQZ8AOPs2SF1QsMY/yTi9bEeUVrRe ljjQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:references:in-reply-to:message-id:date :subject:cc:to:from:dkim-signature; bh=eS9PrufsxCY4iVVXjBFohn1mOAZXm/urAh3MenlmF0I=; b=IQkuM6wjVi10UR3cECjdA77RqqQHhMH1SfwE7vdut4tHLmJDtpF9izyqiIbIctgTQo eUiaULY3wsEOyd2VBHcKfPBTZnZAAQTa4FOvdNTv1I0ABps1P4I2prJm4da5Snl/+j0S aYturD9npWppmYRH0zyQKucXWZzRuXOKGvm2qmLQ0r5HueoEvxT7+5tVtlyoVRHvY9zu stYoIb/6BbCtl6TCtoVvNmbfwGY+/TWAYnyj2tZdd7szfLq7Iz0b0CNjdvphgfMAHVb4 mey73ra/Kv7tH2teOj3FiIFyBh18Z4usyUVIi52DpLTtO68ho8KMwqm0w+TNrr1c/vyq P9Ww== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=JwdocSBN; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id q206si2971110oig.71.2020.03.27.15.25.32; Fri, 27 Mar 2020 15:25:45 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=JwdocSBN; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727897AbgC0WZK (ORCPT + 99 others); Fri, 27 Mar 2020 18:25:10 -0400 Received: from mail.kernel.org ([198.145.29.99]:42952 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727821AbgC0WZE (ORCPT ); Fri, 27 Mar 2020 18:25:04 -0400 Received: from paulmck-ThinkPad-P72.home (50-39-105-78.bvtn.or.frontiernet.net [50.39.105.78]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 86ABC215A4; Fri, 27 Mar 2020 22:25:03 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1585347903; bh=WXBO/ZPMHex+UqRjYTFM44vWqG22vzMF6bLX0ctGiVU=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=JwdocSBNZLDugUS6nKYjuf8tbC0aA8MrJcmtm0NxZk3yqJbvBMk4bD/3SK4ki70fh x+/qvqeAFIX4NIB3BwiZyVHzWdT+14YCBeJwB3jtAewhNddoX11NchmJOf5zOE61Xp EDE5NkiOaQQ7R95usAXzZS7Jlrmu9/xV2AjyKZNk= From: paulmck@kernel.org To: rcu@vger.kernel.org Cc: linux-kernel@vger.kernel.org, kernel-team@fb.com, mingo@kernel.org, jiangshanlai@gmail.com, dipankar@in.ibm.com, akpm@linux-foundation.org, mathieu.desnoyers@efficios.com, josh@joshtriplett.org, tglx@linutronix.de, peterz@infradead.org, rostedt@goodmis.org, dhowells@redhat.com, edumazet@google.com, fweisbec@gmail.com, oleg@redhat.com, joel@joelfernandes.org, "Paul E. McKenney" Subject: [PATCH v3 tip/core/rcu 16/34] rcu-tasks: Add stall warnings for RCU Tasks Trace Date: Fri, 27 Mar 2020 15:24:38 -0700 Message-Id: <20200327222456.12470-16-paulmck@kernel.org> X-Mailer: git-send-email 2.9.5 In-Reply-To: <20200327222346.GA12082@paulmck-ThinkPad-P72> References: <20200327222346.GA12082@paulmck-ThinkPad-P72> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: "Paul E. McKenney" This commit adds RCU CPU stall warnings for RCU Tasks Trace. These dump out any tasks blocking the current grace period, as well as any CPUs that have not responded to an IPI request. This happens in two phases, when initially extracting state from the tasks and later when waiting for any holdout tasks to check in. Signed-off-by: Paul E. McKenney --- kernel/rcu/tasks.h | 70 ++++++++++++++++++++++++++++++++++++++++++++++++++---- 1 file changed, 66 insertions(+), 4 deletions(-) diff --git a/kernel/rcu/tasks.h b/kernel/rcu/tasks.h index a5ed7e2..fc7f116 100644 --- a/kernel/rcu/tasks.h +++ b/kernel/rcu/tasks.h @@ -794,9 +794,41 @@ static void rcu_tasks_trace_postscan(void) // Any tasks that exit after this point will set ->trc_reader_checked. } +/* Show the state of a task stalling the current RCU tasks trace GP. */ +static void show_stalled_task_trace(struct task_struct *t, bool *firstreport) +{ + int cpu; + + if (*firstreport) { + pr_err("INFO: rcu_tasks_trace detected stalls on tasks:\n"); + *firstreport = false; + } + // FIXME: This should attempt to use try_invoke_on_nonrunning_task(). + cpu = task_cpu(t); + pr_alert("P%d: %c%c%c nesting: %d%c cpu: %d\n", + t->pid, + ".I"[READ_ONCE(t->trc_ipi_to_cpu) > 0], + ".i"[is_idle_task(t)], + ".N"[cpu > 0 && tick_nohz_full_cpu(cpu)], + t->trc_reader_nesting, + " N"[!!t->trc_reader_need_end], + cpu); + sched_show_task(t); +} + +/* List stalled IPIs for RCU tasks trace. */ +static void show_stalled_ipi_trace(void) +{ + int cpu; + + for_each_possible_cpu(cpu) + if (per_cpu(trc_ipi_to_cpu, cpu)) + pr_alert("\tIPI outstanding to CPU %d\n", cpu); +} + /* Do one scan of the holdout list. */ static void check_all_holdout_tasks_trace(struct list_head *hop, - bool ndrpt, bool *frptp) + bool needreport, bool *firstreport) { struct task_struct *g, *t; @@ -809,21 +841,51 @@ static void check_all_holdout_tasks_trace(struct list_head *hop, // If check succeeded, remove this task from the list. if (READ_ONCE(t->trc_reader_checked)) trc_del_holdout(t); + else if (needreport) + show_stalled_task_trace(t, firstreport); + } + if (needreport) { + if (firstreport) + pr_err("INFO: rcu_tasks_trace detected stalls?\n"); + show_stalled_ipi_trace(); } } /* Wait for grace period to complete and provide ordering. */ static void rcu_tasks_trace_postgp(void) { + bool firstreport; + struct task_struct *g, *t; + LIST_HEAD(holdouts); + long ret; + // Remove the safety count. smp_mb__before_atomic(); // Order vs. earlier atomics atomic_dec(&trc_n_readers_need_end); smp_mb__after_atomic(); // Order vs. later atomics // Wait for readers. - wait_event_idle_exclusive(trc_wait, - atomic_read(&trc_n_readers_need_end) == 0); - + for (;;) { + ret = wait_event_idle_exclusive_timeout( + trc_wait, + atomic_read(&trc_n_readers_need_end) == 0, + READ_ONCE(rcu_task_stall_timeout)); + if (ret) + break; // Count reached zero. + for_each_process_thread(g, t) + if (READ_ONCE(t->trc_reader_need_end)) + trc_add_holdout(t, &holdouts); + firstreport = true; + list_for_each_entry_safe(t, g, &holdouts, trc_holdout_list) + if (READ_ONCE(t->trc_reader_need_end)) { + show_stalled_task_trace(t, &firstreport); + trc_del_holdout(t); + } + if (firstreport) + pr_err("INFO: rcu_tasks_trace detected stalls?\n"); + show_stalled_ipi_trace(); + pr_err("\t%d holdouts\n", atomic_read(&trc_n_readers_need_end)); + } smp_mb(); // Caller's code must be ordered after wakeup. } -- 2.9.5