Received: by 2002:ad5:4acb:0:0:0:0:0 with SMTP id n11csp4523358imw; Tue, 19 Jul 2022 08:10:38 -0700 (PDT) X-Google-Smtp-Source: AGRyM1tLOzSHbRxNpABG2Canykph4Va68TDkrl3qW/Twyh3lwE9Eq7lYXAWEynF6+u+LlITnGHrW X-Received: by 2002:a05:6602:180c:b0:67c:296:2561 with SMTP id t12-20020a056602180c00b0067c02962561mr5953772ioh.173.1658243438221; Tue, 19 Jul 2022 08:10:38 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1658243438; cv=none; d=google.com; s=arc-20160816; b=P33g2m3FzXKIsjCMocNycVK7TItk1wuPWyOjl+WgR2koZK9tH3dcCUsrLfF0Z2n0sB A8ne44d5pJ2iwE7R7MZtpI9PHfbLFLE6R50YNiX6Hdn6VTxFP1ublzESakMLRN7jQXA3 ACE48mvKZ045CTIrwcaTVf+a/86q2Z1qa3bu5jTyH4iXRa1pp9Dpf1K/TP2h14nGC+0T oNZLcQS60GU1sjplzsvu8NW+jkBx9YPUpGrkfRCEtt6XnSI/GSKwQZyr+5ff7OeJhgL/ ieLyZxsYMYKwdGr5XLYNh4RR18EmKg1OJBV2dGSZYQXtzUAWEkuSa/iV+oqwxocSdlJh r2cQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:content-language:subject:user-agent:mime-version :date:message-id; bh=UauK7OtGZhH/P5uIiYJeIA1oMKP6uQwwfArKxe76A8w=; b=UOyZhyZSZjf+ULUmT8tT3g4M2jS2kOLU2m9N33wH6bzW5cV76uOqs8gapM1fdgIjaX PLVRe3vjY2x59YTvYsGe9selHTgJIXVX0kfENf8ELwozjbY7U3uO3havYGeYaOSUzGjJ cNlrxYU1c1l/GOnQLeFVZ8wDMpjdaOu0vVHsVC8rfZU6U/aUe4rWeaQMKtxc6fG6Xnlv daCsJFWhvX9lGT5OsuQ2NgSX01W4/j1aCQ2V6XFvc3NkHtDrgKNfH53bIUQXlGX4Inl+ 60ZpIxUR17WWYrbnu4PRW5rysXvznPukQm+1u/9XvhM1TS+yJX2g7YNigree7dToqdLR BUSQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id k11-20020a056e02156b00b002dcf594fdc1si5224558ilu.179.2022.07.19.08.10.23; Tue, 19 Jul 2022 08:10:38 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S235311AbiGSPCp (ORCPT + 99 others); Tue, 19 Jul 2022 11:02:45 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:32882 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S235188AbiGSPCe (ORCPT ); Tue, 19 Jul 2022 11:02:34 -0400 Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 3F5CA4D15D for ; Tue, 19 Jul 2022 08:02:33 -0700 (PDT) Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 7409D13D5; Tue, 19 Jul 2022 08:02:33 -0700 (PDT) Received: from [192.168.178.6] (unknown [172.31.20.19]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id B14283F70D; Tue, 19 Jul 2022 08:02:31 -0700 (PDT) Message-ID: <17c9af40-bf53-be3c-c678-159a8ab8964a@arm.com> Date: Tue, 19 Jul 2022 17:02:22 +0200 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101 Thunderbird/91.9.1 Subject: Re: [External] Re: [PATCH v2 07/10] sched/fair: use update_load_avg() to attach/detach entity load_avg Content-Language: en-US To: Chengming Zhou , mingo@redhat.com, peterz@infradead.org, vincent.guittot@linaro.org, rostedt@goodmis.org, bsegall@google.com, vschneid@redhat.com Cc: linux-kernel@vger.kernel.org References: <20220713040430.25778-1-zhouchengming@bytedance.com> <20220713040430.25778-8-zhouchengming@bytedance.com> <88062fb6-e2fe-cf4e-10b5-7694c4d30941@bytedance.com> From: Dietmar Eggemann In-Reply-To: <88062fb6-e2fe-cf4e-10b5-7694c4d30941@bytedance.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-6.9 required=5.0 tests=BAYES_00,NICE_REPLY_A, RCVD_IN_DNSWL_HI,SPF_HELO_NONE,SPF_NONE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 15/07/2022 18:21, Chengming Zhou wrote: > On 2022/7/15 19:18, Dietmar Eggemann wrote: >> On 13/07/2022 06:04, Chengming Zhou wrote: >>> Since update_load_avg() support DO_ATTACH and DO_DETACH now, we can >>> use update_load_avg() to implement attach/detach entity load_avg. >>> >>> Another advantage of using update_load_avg() is that it will check >>> last_update_time before attach or detach, instead of unconditional >>> attach/detach in the current code. >>> >>> This way can avoid some corner problematic cases of load tracking, >>> like twice attach problem, detach unattached NEW task problem. >> >> This explanation is somewhat hard to follow for me. Since both issues >> have been fixed already (you mention this further below) you're saying >> that with you change you don't reintroduce them? > > Sorry for this not very clear explanation. > > Yes, both issues have been fixed already, what I want to say is that bugfix > brings its own problem and limitation mentioned below. > > So I want to use another way to solve these problems better. [...] >>> These problems have been fixed in commit 7dc603c9028e >>> ("sched/fair: Fix PELT integrity for new tasks"), which also >>> bring its own problems. >>> >>> First, it add a new task state TASK_NEW and an unnessary limitation >>> that we would fail when change the cgroup of TASK_NEW tasks. > > This is the limitation that bugfix has brought. > > We can't change cgroup or switch to fair for task with last_update_time=0 > if we don't have conditional detach/attach. > > So we have to: > > 1. !fair task also need to set last_update_time. > 2. cpu_cgroup_can_attach() have to wait for TASK_NEW to fully attached. I see. `cgroup_migrate_execute() -> cpu_cgroup_[can|]_attach()` has to wait for `wake_up_new_task() -> WRITE_ONCE(p->__state, TASK_RUNNING)`. Just to understand this change better: IMHO, this is still the case for fair tasks, right? `wake_up_new_task() -> post_init_entity_util_avg() -> attach_entity_cfs_rq()` has to happen before the fair task can move between taskgroups in `cgroup_migrate_execute() -> cpu_cgroup_attach() -> sched_move_task() -> sched_change_group() -> task_change_group_fair()`. [...]