Received: by 2002:a05:6a10:206:0:0:0:0 with SMTP id 6csp1726914pxj; Wed, 19 May 2021 12:26:18 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxOU39sa/hBk+7T1SAHDWQjydD6Wd/wDrG7clW6cenZ0b/07jsc/vGi4Y/KuDo2KQ12vQR7 X-Received: by 2002:a92:cda7:: with SMTP id g7mr602821ild.50.1621452378067; Wed, 19 May 2021 12:26:18 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1621452378; cv=none; d=google.com; s=arc-20160816; b=iuNC8pU7z4EgkTwaNoPCIZCWFnT/MAJ0yJ+wrALQ9lOZ9VbhHCkANs5VdP7R3ULO/f QvdABbTklZmqkC7Yh8NRpoSaQMvmziOaVgXR7xBeJE0FXfva50psJfMrJkrvMnNnDZ2c eY/eRy9eCGpH95s2PDUl+GdIKOPpcsAec4XskQ5QH9sONFxL6YrCRFji9jBloO73kbJi sZABZ23sf1wVJmsGOqEw9uKY20AkTxaEzN9GaD4hmfSmfiitziWPZv9FELilj6dFGewj KXk+EkHO5UaifZWpQHSTSXHvg2qSPe5FagFrv661meHMEJJQAm077EBL1P6YhT7kBLnj ygJQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=HBJRIrPYNEWkUV6whSNKp+lO4+I/xTtmWr90cYSZI5Y=; b=If+8p3of0FxhqMgHF+dxZMZy9m6DelYGCSyoXT4LIIkCloioo8ANx/62c3pc6lrT8v 7WmdRkrWXZwQIUcbKy/CHPwkFZziTQjK37+pcSP0E8xDLNeQLW70oZE797FFy1NUtEdG zWv93W2GGF6x315TUNBsCIdrrS1g0DDEDoqMpDCagoibYNVS2Rz8Wf8yGMOsn2pE/TlQ bnSdKbOV5gzFh2pE2vUr5Zqc4jVmI1RvyeJabQmPgI2Q05GRnJuUekvL3Whc6dqrRd1V HgM8ZdycPHyhiEavx9KboBL5tRG7puDXyBye/rsQ2yv3c45uB1/ucVdlLdBvdzvsm+cw MM/A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@infradead.org header.s=casper.20170209 header.b=Hv3R4gIh; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id j2si190749jak.119.2021.05.19.12.26.05; Wed, 19 May 2021 12:26:18 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@infradead.org header.s=casper.20170209 header.b=Hv3R4gIh; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1343975AbhESJ2t (ORCPT + 99 others); Wed, 19 May 2021 05:28:49 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:56996 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229668AbhESJ2s (ORCPT ); Wed, 19 May 2021 05:28:48 -0400 Received: from casper.infradead.org (casper.infradead.org [IPv6:2001:8b0:10b:1236::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 0A20EC06175F for ; Wed, 19 May 2021 02:27:28 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=infradead.org; s=casper.20170209; h=In-Reply-To:Content-Type:MIME-Version: References:Message-ID:Subject:Cc:To:From:Date:Sender:Reply-To: Content-Transfer-Encoding:Content-ID:Content-Description; bh=HBJRIrPYNEWkUV6whSNKp+lO4+I/xTtmWr90cYSZI5Y=; b=Hv3R4gIhziazV15RzKo8j6Dn2F LFEHlm0uIFkuZ2ROfqEeUOX3LPXLnF7sbnhtgf0vEDDh2BsTZsIHhdOdNvsA41F8czwPHd7AJBAkc VzXC/r20rn2OuVZaFBI8tkCekHtNRSLSclJniyFoGAJREz5EvNbKTdDIoCETfiaZZPQZ90QoNWSZL 7zdHKCSWmbFGjnddvcKrSiKqx/lf4kBhB23iYDj/P63o8UsUcmgLBSkg5Asq3MYdHPNepcch4F21N zGhwMC392GRAl/9MlxYqVNPhjf5IVbi+ul2vBZyqSJ1UFGIqX9L1nGnVELsh7XZZWo4i7+WvyEbsu VoMH41DA==; Received: from j217100.upc-j.chello.nl ([24.132.217.100] helo=noisy.programming.kicks-ass.net) by casper.infradead.org with esmtpsa (Exim 4.94 #2 (Red Hat Linux)) id 1ljIRo-00Enx4-Ou; Wed, 19 May 2021 09:25:08 +0000 Received: from hirez.programming.kicks-ass.net (hirez.programming.kicks-ass.net [192.168.1.225]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (Client did not present a certificate) by noisy.programming.kicks-ass.net (Postfix) with ESMTPS id C5DFE3001DB; Wed, 19 May 2021 11:24:58 +0200 (CEST) Received: by hirez.programming.kicks-ass.net (Postfix, from userid 1000) id 8380E207153F1; Wed, 19 May 2021 11:24:58 +0200 (CEST) Date: Wed, 19 May 2021 11:24:58 +0200 From: Peter Zijlstra To: "hasegawa-hitomi@fujitsu.com" Cc: "'mingo@kernel.org'" , "'fweisbec@gmail.com'" , "'tglx@linutronix.de'" , "'juri.lelli@redhat.com'" , "'vincent.guittot@linaro.org'" , "'dietmar.eggemann@arm.com'" , "'rostedt@goodmis.org'" , "'bsegall@google.com'" , "'mgorman@suse.de'" , "'bristot@redhat.com'" , "'linux-kernel@vger.kernel.org'" Subject: Re: Utime and stime are less when getrusage (RUSAGE_THREAD) is executed on a tickless CPU. Message-ID: References: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, May 19, 2021 at 06:30:36AM +0000, hasegawa-hitomi@fujitsu.com wrote: > Hi Ingo, Peter, Juri, and Vincent. > > > > Your email is malformed. > > I'm sorry. I was sent in the wrong format. I correct it and resend. > Thank you, Peter, for pointing this out. > > > I found that when I run getrusage(RUSAGE_THREAD) on a tickless CPU, > the utime and stime I get are less than the actual time, unlike when I run > getrusage(RUSAGE_SELF) on a single thread. > This problem seems to be caused by the fact that se.sum_exec_runtime is not > updated just before getting the information from 'current'. > In the current implementation, task_cputime_adjusted() calls task_cputime() to > get the 'current' utime and stime, then calls cputime_adjust() to adjust the > sum of utime and stime to be equal to cputime.sum_exec_runtime. On a tickless > CPU, sum_exec_runtime is not updated periodically, so there seems to be a > discrepancy with the actual time. > Therefore, I think I should include a process to update se.sum_exec_runtime > just before getting the information from 'current' (as in other processes > except RUSAGE_THREAD). I'm thinking of the following improvement. > > @@ void getrusage(struct task_struct *p, int who, struct rusage *r) > if (who == RUSAGE_THREAD) { > + task_sched_runtime(current); > task_cputime_adjusted(current, &utime, &stime); > > Is there any possible problem with this? Would be superfluous for CONFIG_VIRT_CPU_ACCOUNTING_NATIVE=y architectures at the very least. It also doesn't help any of the other callers, like for example procfs. Something like the below ought to work and fix all variants I think. But it does make the call significantly more expensive. Looking at thread_group_cputime() that already does something like this, but that's also susceptible to a variant of this very same issue; since it doesn't call it unconditionally, nor on all tasks, so if current isn't part of the threadgroup and/or another task is on a nohz_full cpu, things will go wobbly again. There's a note about syscall performance there, so clearly someone seems to care about that aspect of things, but it does suck for nohz_full. Frederic, didn't we have remote ticks that should help with this stuff? And mostly I think the trade-off here is that if you run on nohz_full, you're not expected to go do syscalls anyway (because they're sodding expensive) and hence the accuracy of these sort of things is mostly irrelevant. So it might be the use-case is just fundamentally bonkers and we shouldn't really bother fixing this. Anyway? --- diff --git a/kernel/sched/cputime.c b/kernel/sched/cputime.c index 872e481d5098..620871c8e4f8 100644 --- a/kernel/sched/cputime.c +++ b/kernel/sched/cputime.c @@ -612,7 +612,7 @@ void cputime_adjust(struct task_cputime *curr, struct prev_cputime *prev, void task_cputime_adjusted(struct task_struct *p, u64 *ut, u64 *st) { struct task_cputime cputime = { - .sum_exec_runtime = p->se.sum_exec_runtime, + .sum_exec_runtime = task_sched_runtime(p), }; task_cputime(p, &cputime.utime, &cputime.stime);