Received: by 2002:a05:6359:c8b:b0:c7:702f:21d4 with SMTP id go11csp19212rwb; Wed, 21 Sep 2022 16:41:02 -0700 (PDT) X-Google-Smtp-Source: AMsMyM4cyfkFhqFTotCvdrai5mEDmKwJeXtWzue+OiDtmczB9ZWCYeACXEG1DSaoiuPLqQ7iIX9C X-Received: by 2002:a17:907:8a01:b0:780:d2b8:61aa with SMTP id sc1-20020a1709078a0100b00780d2b861aamr566139ejc.454.1663803662255; Wed, 21 Sep 2022 16:41:02 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1663803662; cv=none; d=google.com; s=arc-20160816; b=ICaija+RY6MGfWwEAVKwpDpIoMU+OT5xsknyVs2jGyZLKiQvp6B6i0Vcs2hfw2LfxY W4qrGV8A7FkLvXrn9FBHFJlIsYeH+BJgeyjSdZH1onsQPkXp68USE3VLauYY5BE/IrpX 6WDs7atq7C9OS/QOPbmcZAn/UKeLJhI0zzFdkjqLDjcmMHwWoPqbIrKu9uVz15hF+j0I gje6V2t/gtLNhjrbBfVB+qdkQnKghqjitSTOeoO+KGtl+shsbZ0/zqVKV9aqkkrx/2lH h7tyPSRxsM55BsvmU7eygiyN7HV/S6TNosx5pA93CBLPqqzisTt9gvVf3ogeXmZYycKv 6mmA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:content-language:subject:user-agent:mime-version :date:message-id; bh=Jri+suD2qmjIlaEQqBPxT1/rLgDt5v+ooF3MSEY0P78=; b=uQn/a0Md6+wgirMvZEnHumppDcGcdQPJBhNKVkJKlAi65lklEsLrUY3W2h/gotWb4q DJ58LbPO+MgtTXPDCzi+g9VVJAM8UrVShesSrf+e0t1pyScMMdSWMWgNfHngN0BzPk2Z 0G+vsepiTf9y+5n5D/XBEOSBuX2mYRFPPeoKMpqQWtWpkAPtDFQMV0EzFrhWijoApvT9 hbZf0+EXG3v62A84u5G/nu+OeSBv7/uji/9kaPhdFsJsnZXGgAlvFGAcMQSdAwUL+xBC R+3TBCJj7XmbPRvOCNCtGgQYO2o+sFWF3xH27GAGUxAfrklwqfBj9uSnuXDtiqIISAUm JIOw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id hr23-20020a1709073f9700b00781e984151esi3815237ejc.232.2022.09.21.16.40.28; Wed, 21 Sep 2022 16:41:02 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229555AbiIUWl4 (ORCPT + 99 others); Wed, 21 Sep 2022 18:41:56 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:54766 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229678AbiIUWly (ORCPT ); Wed, 21 Sep 2022 18:41:54 -0400 Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 6F2A8A2613 for ; Wed, 21 Sep 2022 15:41:53 -0700 (PDT) Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id A64D5143D; Wed, 21 Sep 2022 15:41:59 -0700 (PDT) Received: from [192.168.178.6] (unknown [172.31.20.19]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 0A9D23F73B; Wed, 21 Sep 2022 15:41:49 -0700 (PDT) Message-ID: <04c65f4f-5072-2a07-cbe0-63046a7bc58f@arm.com> Date: Thu, 22 Sep 2022 00:41:40 +0200 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101 Thunderbird/91.11.0 Subject: Re: [PATCH v4 5/8] sched/fair: Take into account latency priority at wakeup Content-Language: en-US To: Vincent Guittot Cc: mingo@redhat.com, peterz@infradead.org, juri.lelli@redhat.com, rostedt@goodmis.org, bsegall@google.com, mgorman@suse.de, bristot@redhat.com, vschneid@redhat.com, linux-kernel@vger.kernel.org, parth@linux.ibm.com, qais.yousef@arm.com, chris.hyser@oracle.com, valentin.schneider@arm.com, patrick.bellasi@matbug.net, David.Laight@aculab.com, pjt@google.com, pavel@ucw.cz, tj@kernel.org, qperret@google.com, tim.c.chen@linux.intel.com, joshdon@google.com References: <20220916080305.29574-1-vincent.guittot@linaro.org> <20220916080305.29574-6-vincent.guittot@linaro.org> <073938c4-ab23-2270-8e60-291f2901e230@arm.com> From: Dietmar Eggemann In-Reply-To: Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-10.6 required=5.0 tests=BAYES_00,NICE_REPLY_A, RCVD_IN_DNSWL_HI,SPF_HELO_NONE,SPF_NONE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 20/09/2022 17:49, Vincent Guittot wrote: > On Tue, 20 Sept 2022 at 15:18, Dietmar Eggemann > wrote: >> >> On 19/09/2022 17:39, Vincent Guittot wrote: >>> On Mon, 19 Sept 2022 at 12:05, Dietmar Eggemann >>> wrote: >>>> >>>> On 16/09/2022 10:03, Vincent Guittot wrote: [...] >>>>> + * the idle thread and don't set next buddy as a candidate for being >>>>> + * picked in priority. >>>>> + * In case of simultaneous wakeup from idle, the latency sensitive tasks >>>>> + * lost opportunity to preempt non sensitive tasks which woke up >>>>> + * simultaneously. >>>>> + */ >>>> >>>> The position of this comment block within this function is somehow >>>> misleading since it describes the reason for the function rather then a >>>> particular condition within this function. Wouldn't it be more readable >>>> when it would be a function header comment instead? >>> >>> I put it after the usual early return tests to put the comment close >>> to the useful part: the use of next buddy and __pick_first_entity() >> >> So you want to have the `wakeup_preempt_entity(se, pse) == 1` condition >> from check_preempt_wakeup() also for cfs_task woken up by others. > > I wake the wakeup_preempt_entity(cfs_rq->next, left) < 1 in > pick_next_entity() to pick the task with highest latency constraint > when another class is running while waking up That's correct. This is where you potentially pick this task since it is the next_buddy. All I wanted to say is that check_preempt_from_others() and its `next && wakeup_preempt_entity(next, se) == 1` is the counterpart of the `wakeup_preempt_entity(se, pse) == 1` in check_preempt_wakeup() to be able to set next_buddy even curr is from an other class than CFS. [...] >>>> I still don't get the rationale behind why when either one (se or curr) >>>> of the latency_nice values is negative, we use the diff between them but >>>> if not, we only care about se's value. Why don't you always use the diff >>>> between se and curr? Since we have a range [-20 ... 19] why shouldn't we >>>> use the difference between let's say se = 19 and curr = 5? >>>> You discussed this with Tao Zhou on the v1 but I didn't understand it fully. >>> >>> Let say that current has a latency nice prio of 19 and a task A with a >>> latency nice of 10 wakes up. Both tasks don't care about scheduling >>> latency (current more than task A). If we use the diff, the output of >>> wakeup_latency_gran() would be negative (-10ms) which reflects the >>> fact that the waking task is sensitive to the latency and wants to >>> preempt current even if its vruntime is after. But obviously both >>> current and task A don't care to preempt at wakeup. >> >> OK, I understand but there is a certain level of unsteadiness here. >> >> If p has >0 it gets treated differently in case current has >=0 and case > > "If p >=0"; 0 has same behavior than [1..19] > >> current has <0. Not quite. It depends on curr. With sysctl_sched_latency = 24ms: (1) p = 10 curr = 19 -> wakeup_latency_gran() returns 12ms (2) p = 10 curr = -10 -> wakeup_latency_gran() returns 24ms In (1) only p's own latency counts whereas in (2) we take the diff, In (A) we 'punish' p even though it competes against curr which has an even lower latency requirement than p, >> Do we expect that tasks set their value to [1..19] in this case, when >> the default 0 already indicates a 'don't care'? > > I'm not sure that I understand your concern as [0..19] are treated in > the same way. Only tasks (curr or se) with offset <0 need a relative > comparison to the other. If curr and se has both a latency nice of > -19, se should not blindly preempt curr but only if curr already run > for its amount of time With p = -19 and curr = -19 we would take the diff, so 0ms. With p = 19 and curr = 19, if we would use `latency_offset -= curr->latency_offset` wakeup_latency_gran() would return 973/1024*24ms - 973/1024*24ms = 0ms and nothing will shift. OTHA, in case (1) wakeup_latency_gran() would return 512/1024*24ms - 973/1024*24ms = - 10.80ms. So p would gain an advantage here instead of a penalty. Essentially using the full [-20 .. 19] nice scope for `se vs. curr` comparison.