Received: by 2002:a25:ad19:0:0:0:0:0 with SMTP id y25csp4208645ybi; Fri, 19 Jul 2019 17:34:01 -0700 (PDT) X-Google-Smtp-Source: APXvYqzsnMriWM3TzJso1xsPq4hM+L7EJpSoy8HUj4LvDPElRv86ZqEioxYYfJk0WXUrDWnakHeG X-Received: by 2002:a63:3dcd:: with SMTP id k196mr10468088pga.283.1563582841524; Fri, 19 Jul 2019 17:34:01 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1563582841; cv=none; d=google.com; s=arc-20160816; b=p0Qxb5VHGDUgSVkoLtGDYRsHpfYn4y9Y/ylQfB1HE6r+vgU13/ZEV9mIb+OmuhwQs5 tNl2AF4dpsBt2Mk4EenBXQJbln11zMB7fCx/yEiQMbNRUZrnyPDAbyezLEB3nOUjcSgo BFNsqQI1fhkv1TgM0295ci4snakWIgxRouZ4oHftLmevU7me33fLVA8wLxzClO5aRk0R RCzwNWmM6JiNaylSYu96z+PO+XwM3e2z+Q2BdFm6YESC0Xn+VOpmRhIlE6hIUBcJITjQ e8hF7KuGF+89D2w5e8uO1R+z+VqiQv/C0c9RXvUk/tHmlIbL8SGn//7mC2H58kijLSGz 1QnQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=YWoLcIWo45da57QgaYno7VI6Op8uVj/Cqdt+Sn9VmY0=; b=ZssOCEEO9AE7LJj6Agrx2JroMdkH5ocIA1ItyeMlTdQxfSMC0+Ioh+kVumeGz6dRjk crA140UrJeaoDkZf/VwrFPpWvm4l77WXBAgcDeJVsfTqKdenSTf+8c4G3ljxdaEZnepQ kTdNzT8s4MEfqj/vhsrx1csM5aaQlAKKssTkzgKoz8xL5h5sFAPkTLrW3oigX6LM6wg5 XhdmK2aYyvdDDU1jWlgj6w5w/BeaOonTMsnihjgW9SPvL7mc7pyp5tZEqKT+yi+IXDo0 7Te6+nBl/pOLIcmUmXaazsVih4czHC4tfpf9PBfRR+ydhQUn2iesPV2JJ62vv6dn6Eom oghg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@joelfernandes.org header.s=google header.b=iOcnUsUO; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id g5si2794739plt.271.2019.07.19.17.33.45; Fri, 19 Jul 2019 17:34:01 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@joelfernandes.org header.s=google header.b=iOcnUsUO; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730849AbfGSUeK (ORCPT + 99 others); Fri, 19 Jul 2019 16:34:10 -0400 Received: from mail-lj1-f195.google.com ([209.85.208.195]:35505 "EHLO mail-lj1-f195.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728526AbfGSUeK (ORCPT ); Fri, 19 Jul 2019 16:34:10 -0400 Received: by mail-lj1-f195.google.com with SMTP id x25so31986966ljh.2 for ; Fri, 19 Jul 2019 13:34:09 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=joelfernandes.org; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=YWoLcIWo45da57QgaYno7VI6Op8uVj/Cqdt+Sn9VmY0=; b=iOcnUsUOUmUDjUPd2RB7piycSYuRa0AXleJVgdN2h7YSBPG3416E5nC6hOnjd+wwLj Ggo0iU/RRwL6CXCbxKfsxQoSLuMiCW1vGnCQyhaTUt4tV3nx9uTJr2CpIGB8xyPZCfMy VLJL5zyEFPoIpF/pFAmQzdew8l7ALpgpyKrt4= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=YWoLcIWo45da57QgaYno7VI6Op8uVj/Cqdt+Sn9VmY0=; b=he1XBejCt3OyjLExSi7Uwebrc68GxnIx6h2AxwFrJoEeaE8tMupKSByEIe/2oPce8n P16vE5bOb0jaIqE1l6aKx58M3Bw70Qn8h9bQl1z6plSQsfE0KbU/EjlolAMoBuXGlGWJ QvssXSO7Mnoz6XeQcyWQeXjNZcvlfn1h4hmbiTRyPH/Msl4XixhSlsDva8p7GVzcXwOH cy11MxZ8RBicxda+nhXMoZihNMZzYMOnB1jDd3/lhaNy4sxyv2res8V9X8zCtuV7lPo7 qs4kfbOL1WX5S0PRTn5uHjtoAfJUUu4fLlHjNnMbiYqXtTAcluiQktS+7OImtof4mQbY C5Iw== X-Gm-Message-State: APjAAAWRRU31LH3saabB2BcOaQFv/Pb7ZqxNkravatV+pVigSvmmPGUx cF3xw0m9BB7e0IqbMx1Kyo9CUSE2hr4zQC3/7Hg= X-Received: by 2002:a2e:3602:: with SMTP id d2mr28981129lja.112.1563568448218; Fri, 19 Jul 2019 13:34:08 -0700 (PDT) MIME-Version: 1.0 References: <20190713151330.GE26519@linux.ibm.com> <20190713154257.GE133650@google.com> <20190713174111.GG26519@linux.ibm.com> <20190719003942.GA28226@X58A-UD3R> <20190719074329.GY14271@linux.ibm.com> <20190719195728.GF14271@linux.ibm.com> In-Reply-To: <20190719195728.GF14271@linux.ibm.com> From: Joel Fernandes Date: Fri, 19 Jul 2019 16:33:56 -0400 Message-ID: Subject: Re: [PATCH] rcu: Make jiffies_till_sched_qs writable To: "Paul E. McKenney" Cc: Byungchul Park , Byungchul Park , rcu , LKML , kernel-team@lge.com Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Jul 19, 2019 at 3:57 PM Paul E. McKenney wrote: > > On Fri, Jul 19, 2019 at 06:57:58PM +0900, Byungchul Park wrote: > > On Fri, Jul 19, 2019 at 4:43 PM Paul E. McKenney wrote: > > > > > > On Thu, Jul 18, 2019 at 08:52:52PM -0400, Joel Fernandes wrote: > > > > On Thu, Jul 18, 2019 at 8:40 PM Byungchul Park wrote: > > > > [snip] > > > > > > - There is a bug in the CPU stopper machinery itself preventing it > > > > > > from scheduling the stopper on Y. Even though Y is not holding up the > > > > > > grace period. > > > > > > > > > > Or any thread on Y is busy with preemption/irq disabled preventing the > > > > > stopper from being scheduled on Y. > > > > > > > > > > Or something is stuck in ttwu() to wake up the stopper on Y due to any > > > > > scheduler locks such as pi_lock or rq->lock or something. > > > > > > > > > > I think what you mentioned can happen easily. > > > > > > > > > > Basically we would need information about preemption/irq disabled > > > > > sections on Y and scheduler's current activity on every cpu at that time. > > > > > > > > I think all that's needed is an NMI backtrace on all CPUs. An ARM we > > > > don't have NMI solutions and only IPI or interrupt based backtrace > > > > works which should at least catch and the preempt disable and softirq > > > > disable cases. > > > > > > True, though people with systems having hundreds of CPUs might not > > > thank you for forcing an NMI backtrace on each of them. Is it possible > > > to NMI only the ones that are holding up the CPU stopper? > > > > What a good idea! I think it's possible! > > > > But we need to think about the case NMI doesn't work when the > > holding-up was caused by IRQ disabled. > > > > Though it's just around the corner of weekend, I will keep thinking > > on it during weekend! > > Very good! Me too will think more about it ;-) Agreed with point about 100s of CPUs usecase, Thanks, have a great weekend, - Joel