Received: by 2002:a25:ad19:0:0:0:0:0 with SMTP id y25csp3418912ybi; Fri, 19 Jul 2019 03:00:27 -0700 (PDT) X-Google-Smtp-Source: APXvYqxgy3lV08Aau5Pw/sl4A9rVksRSAD+UT2Vr6BxM1VfdTClUM8eVvtBw3oElpsyFRUfgK+8Q X-Received: by 2002:a63:7d49:: with SMTP id m9mr42820662pgn.161.1563530427607; Fri, 19 Jul 2019 03:00:27 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1563530427; cv=none; d=google.com; s=arc-20160816; b=wXhUfVtytfisZtBSltSD+G2TdPAvNWNkQzEmf23UM1RxJ1Fpx3ZW8WBbEQRX0l7tqK Uaa44HSJ13rYNxiT/uE/z9E35f6/t/b/9mXnvczLBL3yK6lfIIMm+XkiZzqPylPX510d 7ZokvXWOEldxQXi99aHPwQBWQ4J3H1MRVG7AtrJW6m1XZLHBIxfkVYc0iKdFar6hDCdn dtqgUebipP6d4M+gdcv0IpfjWBuj5J68WIPwL0if3zUySPSuegISnsbGVkWkK2RG3TmD dAqWDLHrcOeeyXEJhNVU9usgrTpH/BfXmg0kvUQlumvi7Fh78IlW6VE2c2dQ8hiisTBN SMWQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=2gpYE+zJ6EXcLs21MOYZokgRLOUE5PGmGdhYUwcBgjU=; b=G0nrjH/UKc/F2Ob4KLcZJEvOWgSQpEJLwJhP8pySKR7HX227pg8vOFyFPuInmohOfW H5wW7WYeVJ+vHSsSWRy76/12WYVbCCGD+Slfil6Cx6K4js04CAnwtyf3OrdPLVrx0e+O 9QhCNVaBbh/IYvXhBXlKFGuZk3weZihd4NcPP19QEIxUBGEhPyG/Z2Nyf6+/dXVuw61V PY7v+qEggSmEBdlBAQ7cAfpdli/LyG33QWQm3Zm1ECSMHc+PENmkDkPOtiUIEFfQQjlK 99U/B2aOYA39CimuMtFv0weBl7IyUyK7CrQ6RLXmOybxcrFYmjQOJtUpuv7C+vVN+hJv XVjA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=rTW8MZ4x; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id i15si2104107pfd.259.2019.07.19.03.00.11; Fri, 19 Jul 2019 03:00:27 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=rTW8MZ4x; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726339AbfGSJ6L (ORCPT + 99 others); Fri, 19 Jul 2019 05:58:11 -0400 Received: from mail-ed1-f67.google.com ([209.85.208.67]:41924 "EHLO mail-ed1-f67.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725794AbfGSJ6L (ORCPT ); Fri, 19 Jul 2019 05:58:11 -0400 Received: by mail-ed1-f67.google.com with SMTP id p15so33941765eds.8; Fri, 19 Jul 2019 02:58:10 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=2gpYE+zJ6EXcLs21MOYZokgRLOUE5PGmGdhYUwcBgjU=; b=rTW8MZ4xBHDd/sC6TSsfB+StXORNzg7HRiHx1YbH0m4LtNTF6MrW/HUhQbrtZ6DkIK fkKCaXYur2JwBI1zyBUvUFcjMLrUqZWzcNGu//qOpVaRciPFjjG+AmeU84nNeuokIoRV 310WML2iuWQenrkUXWsMF4KfGBfJDyfROXTZld/U3aBo9clcWlPRuFjmmawow6r82L31 jfQCGR9rRt/G/hyx49P9BKHxJNANxhN/dfz/B3IuPlWTHElL3o1Yc8eBBYHRdFuoODDR 8iICTsBnuzLTXtrtaDn+F6oQydOp8Jy5+Z0eCRl7LAOLNrTVRF01VTWvS57YP1491P42 eDbQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=2gpYE+zJ6EXcLs21MOYZokgRLOUE5PGmGdhYUwcBgjU=; b=cH24Ng4p7xxDlV3HyzpBXrvnLTe1yY+yMjcCPWfguU/tB/bmNtCumtDAWehP3grth9 1V7uS8w/NsVCBH5Dq0BzdqJ6t/SVEmeyuKMzlgE3apGZl4TQ7a74NwlJgIpXKAHYlYaJ OVTazzkH5T9dai9uNre563O8L/eoTRKYZsaxu5PvdrCrZUueIstXE5KPT3F5Hn4jIXRo TKBb+7VWQKITtjbdqAZ2VfyE98t8bpYPru5i8iSzOYp1a/fzDxyCUH4jdSwKZTUPKTQI lICMa0e39QvDwsfLSFCMzUZZgtm10i74Rejt9hjIwONmp1ZMJXzdu6Uxkui0sDtqSqDJ h4tw== X-Gm-Message-State: APjAAAUunGrDiXrZucz/S6psh5mK0Cz9stnZjLHzZNFF6ZXlW5DWZ9ox WK1OtDhHCmQ5x83btQrFXGmxSv5Shyq87+OOQ9o= X-Received: by 2002:a17:906:9447:: with SMTP id z7mr40297534ejx.165.1563530289559; Fri, 19 Jul 2019 02:58:09 -0700 (PDT) MIME-Version: 1.0 References: <20190712063240.GD7702@X58A-UD3R> <20190712125116.GB92297@google.com> <20190713151330.GE26519@linux.ibm.com> <20190713154257.GE133650@google.com> <20190713174111.GG26519@linux.ibm.com> <20190719003942.GA28226@X58A-UD3R> <20190719074329.GY14271@linux.ibm.com> In-Reply-To: <20190719074329.GY14271@linux.ibm.com> From: Byungchul Park Date: Fri, 19 Jul 2019 18:57:58 +0900 Message-ID: Subject: Re: [PATCH] rcu: Make jiffies_till_sched_qs writable To: "Paul E. McKenney" Cc: Joel Fernandes , Byungchul Park , rcu , LKML , kernel-team@lge.com Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Jul 19, 2019 at 4:43 PM Paul E. McKenney wrote: > > On Thu, Jul 18, 2019 at 08:52:52PM -0400, Joel Fernandes wrote: > > On Thu, Jul 18, 2019 at 8:40 PM Byungchul Park wrote: > > [snip] > > > > - There is a bug in the CPU stopper machinery itself preventing it > > > > from scheduling the stopper on Y. Even though Y is not holding up the > > > > grace period. > > > > > > Or any thread on Y is busy with preemption/irq disabled preventing the > > > stopper from being scheduled on Y. > > > > > > Or something is stuck in ttwu() to wake up the stopper on Y due to any > > > scheduler locks such as pi_lock or rq->lock or something. > > > > > > I think what you mentioned can happen easily. > > > > > > Basically we would need information about preemption/irq disabled > > > sections on Y and scheduler's current activity on every cpu at that time. > > > > I think all that's needed is an NMI backtrace on all CPUs. An ARM we > > don't have NMI solutions and only IPI or interrupt based backtrace > > works which should at least catch and the preempt disable and softirq > > disable cases. > > True, though people with systems having hundreds of CPUs might not > thank you for forcing an NMI backtrace on each of them. Is it possible > to NMI only the ones that are holding up the CPU stopper? What a good idea! I think it's possible! But we need to think about the case NMI doesn't work when the holding-up was caused by IRQ disabled. Though it's just around the corner of weekend, I will keep thinking on it during weekend! Thanks, Byungchul > Thanx, Paul > > > But yeah I don't see why just the stacks of those CPUs that are > > blocking the CPU X would not suffice for the trivial cases where a > > piece of misbehaving code disable interrupts / preemption and > > prevented the stopper thread from executing. > > > > May be once the test case is ready (no rush!) , then it will be more > > clear what can help. > > > > J. > > -- Thanks, Byungchul