Received: by 2002:a25:ad19:0:0:0:0:0 with SMTP id y25csp2496560ybi; Thu, 18 Jul 2019 09:16:46 -0700 (PDT) X-Google-Smtp-Source: APXvYqxiRtVBT/xEECcnnJZKeZdiBWEI2VfzVi5kArpJRgL1yYkGSmdBiYiYQy1+P04efSwlIwCu X-Received: by 2002:a17:902:6b85:: with SMTP id p5mr48198747plk.225.1563466606607; Thu, 18 Jul 2019 09:16:46 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1563466606; cv=none; d=google.com; s=arc-20160816; b=OpDEwSncWSuMI+geZoMlBSSG+Uij86NLaO0DGJeSbN9ih95m16acePWm+9lb+my4wz QBq+Q3UTnnDyY8o2UKiKEPGwgCVjas0kEulfXhgZYq0q2x/8YxILQxd20puYfsmvuHQm TAK9/tD28QLVc3J3XMITwSAxvz7FECGsZjUk+76rtjGpll2cQlKNHoUXgtJ3/iVLMA1/ JEyMxGvEViWQtRrZRrZxfcJ6ldSxAm8YnOJCkTPPQHXNnr4qU4YoBWLU/t+T860xcxWU dJY9gzYGnzVJ3tAgwsNRNVmxid+6/sfdTBfBxw5i7a/ZWKOSmQ14x2ZDiwTbkalQxouk kPpQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=hcUjT0W7KZXLkur2UXUjc33cVy9DWOd5ZbJ4nex9bqo=; b=FW0qqT7nceCHBWyQ6q+P0thbtw6SuYFC03BJSyygpxNLWIJJhjNnyEo7TI53rHbanF lbAvUNOvG2YukorkWGCY8/MN7OKIVkr2OcaKkErtO2kqkQ4YBfPcgJLcX1wNnjiJDVX3 wXAaOct7b1fBh3bS5xL4dJ7VApDJyBI6ZUOSb//a41x4LiKaEXbj7sajwUW2AJNFLIGV lbpHPLlU6+brj9vVz18sNLesHkMfyepGxnkJJAzUnVYLXIYNzk+LucYQ0S4i7ZLJ0m14 tWMauAg/d7JeVId8TiIZKhfnm/pkXt71TVM/pCzz/yJ3vw5wRuf8yTsxcWpVRuY67Enr PMiw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@joelfernandes.org header.s=google header.b=yl42423+; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id v38si128118plg.277.2019.07.18.09.16.30; Thu, 18 Jul 2019 09:16:46 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@joelfernandes.org header.s=google header.b=yl42423+; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727985AbfGRQQG (ORCPT + 99 others); Thu, 18 Jul 2019 12:16:06 -0400 Received: from mail-lf1-f67.google.com ([209.85.167.67]:46366 "EHLO mail-lf1-f67.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726715AbfGRQQG (ORCPT ); Thu, 18 Jul 2019 12:16:06 -0400 Received: by mail-lf1-f67.google.com with SMTP id z15so15344497lfh.13 for ; Thu, 18 Jul 2019 09:16:04 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=joelfernandes.org; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=hcUjT0W7KZXLkur2UXUjc33cVy9DWOd5ZbJ4nex9bqo=; b=yl42423+ShMJNj0qJ6yfAuBSR0MXnGVaDILz9UQlWPSYEdTrOp+3pnad7KPBf60BH6 fppJeMupie5K3dihgf9R+Qi+xJDWJugRkkO7KTci5nIcqaqe15xdJpCdgNxr57Dew51u 6QeJ1kp0ZdPv6HtKQpWz4c+KXoxXD8vMf66j8= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=hcUjT0W7KZXLkur2UXUjc33cVy9DWOd5ZbJ4nex9bqo=; b=N6IQkmQKlfNAtnI5uPQ1ZqBSn2rM5kjaxsuUAchyUQf6O4jGsZoEuRj5oR+5r8lKWO SV24xJsujGyTeIcQu/nvgRC5zUBYWqa9I2FRhBk+Dr6og/FcagH5QjaHrIoagV080sSb JAUiSdQFPIuk+hXs8THO3uoUtQjtLMAvutExcH00Gf0yE3tRysNJx8QWswYYKP/xdLIB oBTnW7t2sX+NejnmPFgmtljWSZ1aejSxAjMnsnAzDrrybIp73waLeO8WGhgrL3JjEVpk DlNSeDLjhe+9RYHCqba3LjwDO5FnKol0B4RUgnGb+H404DE02Lh5Y0wVcWRuynqku3aH ys2A== X-Gm-Message-State: APjAAAW2fPDQ500LMpRpq8zwdx1Qb9cdArKC8utymBqdWlI1O2g2pooK DmNDQ9eIZp3pKFVCraOvEhmUJgM8XoMntIvt+Rk= X-Received: by 2002:a19:c503:: with SMTP id w3mr19091061lfe.139.1563466563806; Thu, 18 Jul 2019 09:16:03 -0700 (PDT) MIME-Version: 1.0 References: <20190711130849.GA212044@google.com> <20190711150215.GK26519@linux.ibm.com> <20190711164818.GA260447@google.com> <20190711195839.GA163275@google.com> <20190712063240.GD7702@X58A-UD3R> <20190712125116.GB92297@google.com> <20190713151330.GE26519@linux.ibm.com> <20190713154257.GE133650@google.com> <20190713174111.GG26519@linux.ibm.com> In-Reply-To: From: Joel Fernandes Date: Thu, 18 Jul 2019 12:15:51 -0400 Message-ID: Subject: Re: [PATCH] rcu: Make jiffies_till_sched_qs writable To: "Paul E. McKenney" Cc: Byungchul Park , Byungchul Park , rcu , LKML , kernel-team@lge.com Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Jul 18, 2019 at 12:14 PM Joel Fernandes wrote: > > Trimming the list a bit to keep my noise level low, > > On Sat, Jul 13, 2019 at 1:41 PM Paul E. McKenney wrote: > [snip] > > > It still feels like you guys are hyperfocusing on this one particular > > > > knob. I instead need you to look at the interrelating knobs as a group. > > > > > > Thanks for the hints, we'll do that. > > > > > > > On the debugging side, suppose someone gives you an RCU bug report. > > > > What information will you need? How can you best get that information > > > > without excessive numbers of over-and-back interactions with the guy > > > > reporting the bug? As part of this last question, what information is > > > > normally supplied with the bug? Alternatively, what information are > > > > bug reporters normally expected to provide when asked? > > > > > > I suppose I could dig out some of our Android bug reports of the past where > > > there were RCU issues but if there's any fires you are currently fighting do > > > send it our way as debugging homework ;-) > > > > Suppose that you were getting RCU CPU stall > > warnings featuring multi_cpu_stop() called from cpu_stopper_thread(). > > Of course, this really means that some other CPU/task is holding up > > multi_cpu_stop() without also blocking the current grace period. > > > > So I took a shot at this trying to learn how CPU stoppers work in > relation to this problem. > > I am assuming here say CPU X has entered MULTI_STOP_DISABLE_IRQ state > in multi_cpu_stop() but another CPU Y has not yet entered this state. > So CPU X is stalling RCU but it is really because of CPU Y. Now in the > problem statement, you mentioned CPU Y is not holding up the grace > period, which means Y doesn't have any of IRQ, BH or preemption > disabled ; but is still somehow stalling RCU indirectly by troubling > X. > > This can only happen if : > - CPU Y has a thread executing on it that is higher priority than CPU > X's stopper thread which prevents it from getting scheduled. - but the Sorry, here I meant "higher priority than CPU Y's stopper thread".