Received: by 2002:a05:6a10:a0d1:0:0:0:0 with SMTP id j17csp1222406pxa; Thu, 6 Aug 2020 02:38:21 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwCCC1jiVDn9QQQuHs/Ls9Om+KIKPn8RDbAv5nDDH6ClsGUXq89j9XJtUTiiMwdk/UMKlht X-Received: by 2002:a50:f411:: with SMTP id r17mr3215850edm.190.1596706701497; Thu, 06 Aug 2020 02:38:21 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1596706701; cv=none; d=google.com; s=arc-20160816; b=oD9z5NkSdrh1RrkK1cKEGpwm1vzfe/HJqF8V7TbLTjUxFUizFfkWFA0fSWXzhQEcsf uHDMeXrys/fbAXZAMi1651+Bwdd+OUrMn/tJ/f+IJLac+WiFenrs0IaHm8rgbV0jwhFn tdF4PvjGTFpywlSG79zbRytWyjdgbTIieU0ALXPGeIcFkZrpXeRjHn4XFyHOi2bpyuUD WpOv3NTvkj9AmYHHmmw4lK2hE94SYZMFsjjGQk1sZtXWVkOuMCTMsXDu+mKlI4jLpIY2 PnHFImlug8iJfoeu58/K23NjHJ1hHCQt3Os2GDAw4Q7yu3ELJ1C3+8LX791jUnGX/MQy lvlA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:message-id:date:references :in-reply-to:subject:cc:to:dkim-signature:dkim-signature:from; bh=fQkbKY9HMHdIDVxo57b/7ooYqvcFkMgq0/3YSoWs1eI=; b=GCSjul3Zbt9csxNMl2rLEvDd9RSYyWJkmOVDfbA2+3l3pSk5sUqK8LD6nfmnBXyssg uM6EAqmO+5rvc07Wz/o4gluW+GlgfM1txpTRtEkTHopMh+NHaMV7HU+WHenlJHRc7gL4 YgVuXcamLU6LKwUrtFlNc491RU/VjmRdWAibLFCvP6bCd5drBbB0KOQTWOaZaD3gNEhC VntkFnMtw4GR0HLNDehBTUmcvYefH2xCnfnkk+djc3jCH6Pww1YfRIOjYsI7WkobZ/bR jYo3YyiAWcR1QLSlBIzBqQjkcVoHivLGEAxK65+r1xqwuSJDwDWiLUVfnK0KX8CiRfSY lrLw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linutronix.de header.s=2020 header.b=kN9sYWpF; dkim=neutral (no key) header.i=@vger.kernel.org; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=linutronix.de Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id l17si2707415eja.654.2020.08.06.02.37.58; Thu, 06 Aug 2020 02:38:21 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@linutronix.de header.s=2020 header.b=kN9sYWpF; dkim=neutral (no key) header.i=@vger.kernel.org; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=linutronix.de Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729267AbgHFJhF (ORCPT + 99 others); Thu, 6 Aug 2020 05:37:05 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:60858 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1729074AbgHFJey (ORCPT ); Thu, 6 Aug 2020 05:34:54 -0400 Received: from galois.linutronix.de (Galois.linutronix.de [IPv6:2a0a:51c0:0:12e:550::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id BD03BC061574 for ; Thu, 6 Aug 2020 02:34:53 -0700 (PDT) From: Thomas Gleixner DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linutronix.de; s=2020; t=1596706483; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=fQkbKY9HMHdIDVxo57b/7ooYqvcFkMgq0/3YSoWs1eI=; b=kN9sYWpFBqxH8i+Q+XfIJ8K3gXFkwExErJzky6iYhLWWPcVa2Z+p+OkEvD+GisZSXAeaLZ /oJ9kuejzW+IncznugGgptjhxw1+7hzVMcsFt7Y1Oq4issSKGHiO8rU7Rs5jqhzql0diX2 2oG3d8F9RifH6uyXVYIIZqNNKOuJmzgMFr7kSEomrSqabk5OASsk4BB+EN+HwRxo7IqWBZ zoRt4BEcLoNSg66M2m8QbTb2dRdkf7tE/7hJ4bjWqnvjmFwRE7dmSMQT6GFb35Fxk9YSpH n8j4k/ev4EXmFfg2P1UeeSmzVNT1JD2pVHXwXZN5cCxzYfIGBmWpyeLkIjCxzA== DKIM-Signature: v=1; a=ed25519-sha256; c=relaxed/relaxed; d=linutronix.de; s=2020e; t=1596706483; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=fQkbKY9HMHdIDVxo57b/7ooYqvcFkMgq0/3YSoWs1eI=; b=MLXgfR34ZN2c7p471X0+h9DeZYNO1M5vJ4NsJRZZh/+fSvjbiWc5sKSQu+R2OqaUwHFJ4S b72tNLPsH2m+vSBA== To: peterz@infradead.org Cc: Valentin Schneider , Vladimir Oltean , Kurt Kanzenbach , Alison Wang , catalin.marinas@arm.com, will@kernel.org, paulmck@kernel.org, mw@semihalf.com, leoyang.li@nxp.com, vladimir.oltean@nxp.com, linux-arm-kernel@lists.infradead.org, linux-kernel@vger.kernel.org, Anna-Maria Gleixner Subject: Re: [RFC PATCH] arm64: defconfig: Disable fine-grained task level IRQ time accounting In-Reply-To: <20200805134002.GQ2674@hirez.programming.kicks-ass.net> References: <877dumbtoi.fsf@kurt> <20200729094943.lsmhsqlnl7rlnl6f@skbuf> <87mu3ho48v.fsf@kurt> <20200730082228.r24zgdeiofvwxijm@skbuf> <873654m9zi.fsf@kurt> <87lfiwm2bj.fsf@nanos.tec.linutronix.de> <20200803114112.mrcuupz4ir5uqlp6@skbuf> <87d047n4oh.fsf@nanos.tec.linutronix.de> <875z9zmt4i.fsf@nanos.tec.linutronix.de> <20200805134002.GQ2674@hirez.programming.kicks-ass.net> Date: Thu, 06 Aug 2020 11:34:42 +0200 Message-ID: <877duci0ct.fsf@nanos.tec.linutronix.de> MIME-Version: 1.0 Content-Type: text/plain Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org peterz@infradead.org writes: > On Mon, Aug 03, 2020 at 09:22:53PM +0200, Thomas Gleixner wrote: > >> totaltime = irqtime + tasktime >> >> Ignoring irqtime and pretending that totaltime is what the scheduler >> can control and deal with is naive at best. > > Well no, that's what we call system overhead and is assumed to be > included in the 'error margin'. > > The way things are set up is that we say that, by default, RT tasks can > consume 95% of cputime and the remaining 5% is sufficient to keep the > system alive. > > Those 5% include all system overhead, IRQs, RCU, !RT workqueues etc.. > > Obviously IRQ_TIME accounting changes the balance a bit, but that's what > it is. We can't really do anything better. > > Apparently this SoC has significant IRQ time for some reason. Also, > relying on RT throttling for 'correct' behaviour is also wrong. What > needs to be done is find who is using all this RT time and why, that > isn't right. It's a test case and we know already who is using the time. But that's not the point. A runaway RT task resulting in a RCU stall or whatever lockup of the system is definitely not the right answer. The throttler, as much as it's a horrible hack, is there to prevent this and to give the admin a chance to pinpoint and kill that thing instead of having to press the reset button and scratching head what might have caused this. Thanks, tglx