Received: by 2002:a05:6a10:2785:0:0:0:0 with SMTP id ia5csp1053004pxb; Sat, 9 Jan 2021 06:02:05 -0800 (PST) X-Google-Smtp-Source: ABdhPJzcUrIXbWi19+dd7k/0ZH7loRDleOEBgKCDgR/X6CiU3cENNGKEkBejm5EJ+dL7+qzLI6FV X-Received: by 2002:aa7:cd71:: with SMTP id ca17mr9009677edb.25.1610200925237; Sat, 09 Jan 2021 06:02:05 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1610200925; cv=none; d=google.com; s=arc-20160816; b=DXTJsjVHAxUusA5nnZdxiyI5Ml30nEau+wf8vnYVlTc+l1SKayFOvexbZ4BlBSi3n7 yqIAA0/CpUVxA6bwDWao+JL1d69yl/EGzrlpCxCNqXfCQ1ukFvlkj/KAutVxAgDerBmK 8G4jw6frEvW5Rg68JkRSR4KvBwWSsF+kdP6JwEXhmpjQ8SSGZ/EbJlRPZJ/tobmIlExr EsQdJYNGfKHBlV0eYgTCqZOYCXSSuIzhFNhOSlqlTgpY2pRt8hV1pH8PuXUgIW/fpBxc YkrlT4KjhUiVxvoi92xK9+Su2kqeUZAHW83nfPMY7/fjr/v+zAEFynTCpGEyXqiRPh1W tCGQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date; bh=0SO9lOr65EXI2v4D1rbh5MdynOne8WHftLcHK+DiZyk=; b=wVSg80aML+DhiVpuFhT17z64NpQ9/XezAIYirH4NNzPwmXuAFbrjxzifV7Txd8RQkZ UwmggSOJO7MPoO72MwHnEv6zv7Gr4PhaW6UsaLKxmOcygVGCt69ojx1shrB6Ma9BrVsN iuIrwvmPr3eLCnfkUbCOt/o8tZsyGS4n890b9cDZW2nfjVO7TkXPoUsrNJ6l6RnVDneR KGY/Y/zr2CZdjdTd9hxWXO+TcSeKmOgYU87jiB1c0Z+4EbnItzlxddmfp1t8fQ63Mluz mgH+h04uJvJi5d72UNul+cPN8Xc/SHDq5j2FgsqvuAd0xNhmeG4J2BB1XMKght2Gzz4P /6Yg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id t20si4982426edw.31.2021.01.09.06.01.41; Sat, 09 Jan 2021 06:02:05 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1725901AbhAIOAs (ORCPT + 99 others); Sat, 9 Jan 2021 09:00:48 -0500 Received: from outbound-smtp62.blacknight.com ([46.22.136.251]:35281 "EHLO outbound-smtp62.blacknight.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725780AbhAIOAr (ORCPT ); Sat, 9 Jan 2021 09:00:47 -0500 Received: from mail.blacknight.com (pemlinmail02.blacknight.ie [81.17.254.11]) by outbound-smtp62.blacknight.com (Postfix) with ESMTPS id 43E7AFABA5 for ; Sat, 9 Jan 2021 13:59:55 +0000 (GMT) Received: (qmail 23259 invoked from network); 9 Jan 2021 13:59:55 -0000 Received: from unknown (HELO techsingularity.net) (mgorman@techsingularity.net@[84.203.22.4]) by 81.17.254.9 with ESMTPSA (AES256-SHA encrypted, authenticated); 9 Jan 2021 13:59:55 -0000 Date: Sat, 9 Jan 2021 13:59:53 +0000 From: Mel Gorman To: Peter Zijlstra Cc: "Li, Aubrey" , vincent.guittot@linaro.org, linux-kernel@vger.kernel.org, mingo@redhat.com, juri.lelli@redhat.com, valentin.schneider@arm.com, qais.yousef@arm.com, dietmar.eggemann@arm.com, rostedt@goodmis.org, bsegall@google.com, tim.c.chen@linux.intel.com, benbjiang@gmail.com Subject: Re: [RFC][PATCH 1/5] sched/fair: Fix select_idle_cpu()s cost accounting Message-ID: <20210109135953.GF3592@techsingularity.net> References: <20201214164822.402812729@infradead.org> <20201214170017.877557652@infradead.org> <20201215075911.GA3040@hirez.programming.kicks-ass.net> <20210108102738.GB3592@techsingularity.net> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-15 Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.10.1 (2018-07-13) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Jan 08, 2021 at 09:21:48PM +0100, Peter Zijlstra wrote: > On Fri, Jan 08, 2021 at 10:27:38AM +0000, Mel Gorman wrote: > > > 1. avg_scan_cost is now based on the average scan cost of a rq but > > avg_idle is still scaled to the domain size. This is a bit problematic > > because it's comparing scan cost of a single rq with the estimated > > average idle time of a domain. As a result, the scan depth can be much > > larger than it was before the patch and led to some regressions. > > > @@ -6164,25 +6164,25 @@ static int select_idle_cpu(struct task_struct *p, struct sched_domain *sd, int t > > */ > > avg_idle = this_rq()->avg_idle / 512; > > avg_cost = this_sd->avg_scan_cost + 1; > > - > > - span_avg = sd->span_weight * avg_idle; > > - if (span_avg > 4*avg_cost) > > - nr = div_u64(span_avg, avg_cost); > > - else > > + nr = div_u64(avg_idle, avg_cost); > > + if (nr < 4) > > nr = 4; > > Oooh, could it be I simply didn't remember how that code was supposed to > work and should kick my (much) younger self for not writing a comment? > > Consider: > > span_weight * avg_idle avg_cost > nr = ---------------------- = avg_idle / ---------- > avg_cost span_weigt > > Where: avg_cost / span_weight ~= cost-per-rq > This would definitely make sense and I even evaluated it but the nature of avg_idle and the scale it works at (up to 2*sched_migration_cost) just ended up generating lunatic values far outside the size of the domain size. Fitting that to the domain size just ended up looking silly too and avg_cost does not decay. Still, in principle, it's the right direction, it's just not what the code does right now. -- Mel Gorman SUSE Labs