Received: by 2002:ac0:e350:0:0:0:0:0 with SMTP id g16csp182756imn; Thu, 4 Aug 2022 01:38:13 -0700 (PDT) X-Google-Smtp-Source: AA6agR4fYsHopk7BnskJBnsJ8kHQ03ctKdL3HFXoZPOEiJ81uhYFCfKwOWlDSWndQSlhM6fl2DD8 X-Received: by 2002:a05:6402:f17:b0:43e:4700:f63e with SMTP id i23-20020a0564020f1700b0043e4700f63emr935329eda.190.1659602293309; Thu, 04 Aug 2022 01:38:13 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1659602293; cv=none; d=google.com; s=arc-20160816; b=kz3JomFnGA/cZL+k8w6PlwfFibo0V2+FxBV70Vj2hAk9nA3ocQJagilEDDiNOhzxqi gyI3jUpusw31lXW1VcStETJKAOauEb6fZmV3Uh7C9Ft3bvyEa4B3RG7PMo3JbFLzHcmr IbL1tTBMJtjWnlOYyxT+M60WlG/CMp9w9Y8yyduYCe5tkh+ohGqgdBh2fI9l3kemeJwD b0mfDktSSxyEE8R0VwNy+ax9FkhOm3A+VIgur/CZQ8NUfVNEYeukJrWGB/JcWxtQZIMb OO75ndTL73j79SUeObiEZ58yEDDVCBlhtoOERisKhfLa5dOOhm4+u0hvg6Yxa/OkmpfQ S9Jg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:references :to:from:subject:user-agent:mime-version:date:message-id; bh=FaOXkEfwvJTl/MUXOxWzobG9RHemvgMRA3KKUL84+sc=; b=EqmmcchMToqG+Oz8tRRLvhtacYDK44PhhgyPvrtaE6emC+ZH4ivUY242LdNaBpTRDV taF6n5T7WlkarlOzVSZLi7/BowEHolT6WO/14bgYuKWU0ghk8j0wPse5kc4WbXEmg40n huukHJUM77g6vq+biZ5YKa7t6OQXpxAEUqOewIP7XPM9zsOWyjBh80MdKxHCn8bgLXwJ bq+kA7K0Jj8LPJc+cZtiQCGk6QnXhkGWV+MyIWr4YbURfMOtq9OO4TZioeBgUnS28hYK S4T2tVSfRqA/dw5hAG8P7gx/mOKv/s69/FoNBu+J5HuU2N1r0PWlSZsm0rRsiUVENAET l2Hw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id g24-20020a056402181800b0043bdfd057e4si541701edy.194.2022.08.04.01.37.47; Thu, 04 Aug 2022 01:38:13 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=huawei.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S239221AbiHDHoS (ORCPT + 99 others); Thu, 4 Aug 2022 03:44:18 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:60758 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232405AbiHDHoQ (ORCPT ); Thu, 4 Aug 2022 03:44:16 -0400 Received: from szxga01-in.huawei.com (szxga01-in.huawei.com [45.249.212.187]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 80A5061D7A for ; Thu, 4 Aug 2022 00:44:14 -0700 (PDT) Received: from canpemm500009.china.huawei.com (unknown [172.30.72.57]) by szxga01-in.huawei.com (SkyGuard) with ESMTP id 4Lz1031vtHzmVLF; Thu, 4 Aug 2022 15:42:11 +0800 (CST) Received: from [10.67.109.51] (10.67.109.51) by canpemm500009.china.huawei.com (7.192.105.203) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2375.24; Thu, 4 Aug 2022 15:44:08 +0800 Message-ID: <1f1f625a-148d-0398-f840-1f9b4e964189@huawei.com> Date: Thu, 4 Aug 2022 15:44:08 +0800 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:91.0) Gecko/20100101 Thunderbird/91.3.2 Subject: Re: [Question] Reading /proc/stat has a time backward issue From: "Lihua (lihua, ran)" To: Ingo Molnar , Peter Zijlstra , Juri Lelli , Vincent Guittot , Dietmar Eggemann , Steven Rostedt , Ben Segall , Mel Gorman , Daniel Bristot de Oliveira , "open list:SCHEDULER" References: <85d5087b-450c-351f-270d-c61303cf3187@huawei.com> In-Reply-To: <85d5087b-450c-351f-270d-c61303cf3187@huawei.com> Content-Type: text/plain; charset="UTF-8"; format=flowed Content-Transfer-Encoding: 8bit X-Originating-IP: [10.67.109.51] X-ClientProxiedBy: dggems706-chm.china.huawei.com (10.3.19.183) To canpemm500009.china.huawei.com (7.192.105.203) X-CFilter-Loop: Reflected X-Spam-Status: No, score=-4.2 required=5.0 tests=BAYES_00,NICE_REPLY_A, RCVD_IN_DNSWL_MED,SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org ping... Any good suggestions? thanks all. 在 2022/7/27 12:02, Lihua (lihua, ran) 写道: > Hi all, > > I found a problem that the statistical time goes backward, the value read first is 319, and the value read again is 318. As follows: > first: > cat /proc/stat |  grep cpu1 > cpu1    319    0    496    41665    0    0    0    0    0    0 > then: > cat /proc/stat |  grep cpu1 > cpu1    318    0    497    41674    0    0    0    0    0    0 > > Time goes back, which is counterintuitive. > > After debug this, I found that the problem is caused by the implementation of kcpustat_cpu_fetch_vtime. As follows: > >                               CPU0                                                                          CPU1 > First: > show_stat(): >     ->kcpustat_cpu_fetch() >         ->kcpustat_cpu_fetch_vtime() >             ->cpustat[CPUTIME_USER] = kcpustat_cpu(cpu) + vtime->utime + delta;              rq->curr is in user mod >              ---> When CPU1 rq->curr running on userspace, need add utime and delta >                                                                                              --->  rq->curr->vtime->utime is less than 1 tick > Then: > show_stat(): >     ->kcpustat_cpu_fetch() >         ->kcpustat_cpu_fetch_vtime() >             ->cpustat[CPUTIME_USER] = kcpustat_cpu(cpu);                                     rq->curr is in kernel mod >             ---> When CPU1 rq->curr running on kernel space, just got kcpustat > > Because the values ​​of utime、 stime and delta are temporarily written to cpustat. Therefore, there are two problems  read from /proc/stat: > 1. There may be a regression phenomenon; > 2. When there are many tasks, the statistics are not accurate enough when utime and stime do not exceed one TICK. > The time goes back is counterintuitive, and I want to discuss whether there is a good solution without compromising performance. > > Thanks a lot.