Received: by 2002:a25:ad19:0:0:0:0:0 with SMTP id y25csp2963312ybi; Thu, 18 Jul 2019 18:13:26 -0700 (PDT) X-Google-Smtp-Source: APXvYqxI+zIJzaQOe/Y99ep5ysmhHiy6hosPlmyln9vmir7K7V42iV/bsgg6oVjOsFB1K+RAoTR6 X-Received: by 2002:a17:90a:a613:: with SMTP id c19mr55476599pjq.17.1563498806473; Thu, 18 Jul 2019 18:13:26 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1563498806; cv=none; d=google.com; s=arc-20160816; b=HgnG/aSEJ9JC0nWc8TeOlTrc/lNQ+ATxLMbyKeTkhNJvHT5hnaDxWZBf+8g2Ne6qnp tDZ3B0U+I+XdlwN0tl7/cJtEJQIvkgpBHTGBFmL8F52sMpyUj5MlsqDGJeIArLX5QWJg tU2FQXu4EoU6b0GUbcTskGD5tqHNo5O0cyA0AzJjt87L1xqQdEAbHrL2MjVR996KuGhk 3bU8yF5lfuzNfR4V9FGkB6+rxNQ8ScArp+knkl3Pg1Q6u7F21TLzrvzwTLOXe0yiF63h y7QknUKdE1fJ2OQEa9V/QAG+9vuTfql7/oBcgl+sw8p3vQQ+EiWDabMxsPBN6LbPtccY sL/w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=IW20Ofx3f5HH8b3RRyiE3nSZqXvWNdZSisK1Og430lk=; b=03ahM6hOIj+D9FAnxQk4x1/e5n1851zA/gHfVnc12zZ3WvjuXyc6WvjRk/Nff4DkH7 MlhfHO/QGQ29L4CHGGGqQq5uj9x366Ehgxl7W3A72jloQjW3bWm5MQA1lAhQjU4uSPCG 0L2y7Ms9yEIMn4NCr3MP9wqo4E6lCmuVBn38jCpLv0oDQepwBXs6aRDlQZC1c7GDw5YS +BzI8kNS4t/X/1xhyhZsUtVRBwqn+L+pE8C9NbeSPrGCxt5ma9yu35flsiZHRxeag0Ki uu7phwMH/nSd+Opcgb6SqL00S/VVsu/PLu4XvtzB5rm66GYF/0Op2ntjpmeZfE//S6UB V1Dg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id y6si532776pgv.210.2019.07.18.18.13.10; Thu, 18 Jul 2019 18:13:26 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726462AbfGSBLR (ORCPT + 99 others); Thu, 18 Jul 2019 21:11:17 -0400 Received: from lgeamrelo11.lge.com ([156.147.23.51]:43009 "EHLO lgeamrelo11.lge.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726042AbfGSBLQ (ORCPT ); Thu, 18 Jul 2019 21:11:16 -0400 Received: from unknown (HELO lgeamrelo01.lge.com) (156.147.1.125) by 156.147.23.51 with ESMTP; 19 Jul 2019 10:11:15 +0900 X-Original-SENDERIP: 156.147.1.125 X-Original-MAILFROM: byungchul.park@lge.com Received: from unknown (HELO X58A-UD3R) (10.177.222.33) by 156.147.1.125 with ESMTP; 19 Jul 2019 10:11:15 +0900 X-Original-SENDERIP: 10.177.222.33 X-Original-MAILFROM: byungchul.park@lge.com Date: Fri, 19 Jul 2019 10:10:11 +0900 From: Byungchul Park To: Joel Fernandes Cc: "Paul E. McKenney" , Byungchul Park , rcu , LKML , kernel-team@lge.com Subject: Re: [PATCH] rcu: Make jiffies_till_sched_qs writable Message-ID: <20190719011011.GC28226@X58A-UD3R> References: <20190712063240.GD7702@X58A-UD3R> <20190712125116.GB92297@google.com> <20190713151330.GE26519@linux.ibm.com> <20190713154257.GE133650@google.com> <20190713174111.GG26519@linux.ibm.com> <20190719003942.GA28226@X58A-UD3R> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Jul 18, 2019 at 08:52:52PM -0400, Joel Fernandes wrote: > On Thu, Jul 18, 2019 at 8:40 PM Byungchul Park wrote: > [snip] > > > - There is a bug in the CPU stopper machinery itself preventing it > > > from scheduling the stopper on Y. Even though Y is not holding up the > > > grace period. > > > > Or any thread on Y is busy with preemption/irq disabled preventing the > > stopper from being scheduled on Y. > > > > Or something is stuck in ttwu() to wake up the stopper on Y due to any > > scheduler locks such as pi_lock or rq->lock or something. > > > > I think what you mentioned can happen easily. > > > > Basically we would need information about preemption/irq disabled > > sections on Y and scheduler's current activity on every cpu at that time. > > I think all that's needed is an NMI backtrace on all CPUs. An ARM we > don't have NMI solutions and only IPI or interrupt based backtrace > works which should at least catch and the preempt disable and softirq > disable cases. > > But yeah I don't see why just the stacks of those CPUs that are > blocking the CPU X would not suffice for the trivial cases where a > piece of misbehaving code disable interrupts / preemption and > prevented the stopper thread from executing. Right. So it makes more interesting tho! :-) > May be once the test case is ready (no rush!) , then it will be more > clear what can help. Yes. I'm really happy to help things about RCU that I love, fixed or improved. And with the the test case or a real issue, I believe I can do more helpful work. Looking forward to it, too (no rush!). Thanks, Byungchul