Received: by 2002:a25:1985:0:0:0:0:0 with SMTP id 127csp1681573ybz; Thu, 16 Apr 2020 13:35:04 -0700 (PDT) X-Google-Smtp-Source: APiQypLPwOvUiwWp0xu9onfcBZ/gwlLb7Uf55AXONULnxSdnS/Hc3seo2gux3IbnR5HEmC5w9KUo X-Received: by 2002:aa7:dd53:: with SMTP id o19mr41917edw.180.1587069304592; Thu, 16 Apr 2020 13:35:04 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1587069304; cv=none; d=google.com; s=arc-20160816; b=02mTjVHmD6ski34UY247agqN+Lc5VKc5oN1UqdXtO1uIPobkPgbuMCFxfbt0XWNBEg SGko553okkn7sTBGgh/fJcJmqLN4EyvLUdV0MpYGh7pN7UN+OWHwNPOXGNhZkQl2Bz/u L7+7Vr2kLwiQmLuYP9DnrG4wV/v3u/fKqsvsHbY826NgNuHtriUOzKUwJoKtGaIBDc/D RRGwFuaGMFiNE7VsqgshYNpb1R4Hfmo7WIffBsSP/ytM+VRxDMr/wRUv05myX9tfYQJR Reapqd1tzDwq/4NKHxTGis/EZ1T/HARu19nT0xz5z7JEXAOLjPGeSHC3ZcHd/pFLqXXv FKyQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:message-id:date:in-reply-to :subject:cc:to:from:user-agent:references; bh=+uonaYB15rUDFW1LHrqourUlpwlD3BWzSylO21wTqo8=; b=BDkHUBkktxRIG8DOn1E5vnKf1iHBlvo3uzcSZVAthKN0qoF6ZCd7jkzMxG/mRQhumc YJYexPFWaTTGwobbAxYGnl9oVZOQXwC6bhdtKxBoK2BM3VU0MgiJc4DHg2TO1FBRrnUZ e58PajdhsSvLqvfEuIREf+MPkfrHtotzsg1u2Cv9bbXz+kPBbXdSKNpnUs0wJWfA2cuF F96z5K/cIdfwEcGKM1OPeMV6R6xxQ6YzPXxssG/TI5lLQAHfdX/zOaEFdsm0/Yyh5JeJ Go/SbBr939pMpM00XwcJK8u0UqOHFcMR7Iz3JjE7pmIKdPK1wMWW0WhVL0gnjgSnkEyU LQ1w== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id i19si12351598edr.235.2020.04.16.13.34.41; Thu, 16 Apr 2020 13:35:04 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2395246AbgDPP1n (ORCPT + 99 others); Thu, 16 Apr 2020 11:27:43 -0400 Received: from foss.arm.com ([217.140.110.172]:35904 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2395240AbgDPP1i (ORCPT ); Thu, 16 Apr 2020 11:27:38 -0400 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id C115B30E; Thu, 16 Apr 2020 08:27:37 -0700 (PDT) Received: from e113632-lin (e113632-lin.cambridge.arm.com [10.1.194.46]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 035633F237; Thu, 16 Apr 2020 08:27:36 -0700 (PDT) References: <20200415210512.805-1-valentin.schneider@arm.com> <20200415210512.805-10-valentin.schneider@arm.com> User-agent: mu4e 0.9.17; emacs 26.3 From: Valentin Schneider To: Vincent Guittot Cc: Dietmar Eggemann , linux-kernel , Ingo Molnar , Peter Zijlstra Subject: Re: [PATCH v3 9/9] sched/topology: Define and use shortcut pointers for wakeup sd_flag scan In-reply-to: Date: Thu, 16 Apr 2020 16:27:32 +0100 Message-ID: MIME-Version: 1.0 Content-Type: text/plain Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 16/04/20 14:36, Vincent Guittot wrote: >> Coming back to the v2 discussion on this patch >> >> https://lore.kernel.org/r/20200311181601.18314-10-valentin.schneider@arm.com >> >> SD_BALANCE_WAKE is not used in mainline anymore, so wakeups are always >> fast today. >> >> I.e. you wouldn't need a per_cpu(sd_balance_wake, cpu) since it's always >> NULL. >> >> I.e. want_affine logic and the 'for_each_domain(cpu, tmp)' isn't needed >> anymore. >> >> This will dramatically simplify the code in select_task_rq_fair(). >> >> But I guess Vincent wants to keep the functionality so we're able to >> enable SD_BALANCE_WAKE on certain sd's? > > I looked too quickly what was done by this patch. I thought that it > was adding a per_cpu pointer for all cases including the fast path > with wake affine but it only skips the for_each_domain loop for the > slow paths which don't need it because they are already slow. > > It would be better to keep the for_each_domain loop for slow paths and > to use a per_cpu pointer for fast_path/wake affine. Regarding the > wake_affine path, we don't really care about looping all domains and > we could directly use the highest domain because wake_affine() that is > used in the loop, only uses the imbalance_pct of the sched domain for > wake_affine_weight() and it should not harm to use only the highest > domain and then select_idle_sibling doesn't use it but the llc or > asym_capacity pointer instead. So Dietmar's pointing out that sd will always be NULL for want_affine, because want_affine can only be true at wakeups and we don't have any topologies with SD_BALANCE_WAKE anymore. We would still want to call wake_affine() though, because that can change new_cpu. What you are adding on top is that the only sd field used in wake_affine() is the imbalance_pct, so we could take a shortcut and just go for the highest domain with SD_WAKE_AFFINE; i.e. something like this: --- if (want_affine) { // We can cache that at topology buildup sd = highest_flag_domain(cpu, SD_WAKE_AFFINE); if (cpumask_test_cpu(prev_cpu, sched_domain_span(sd) && cpu != prev_cpu) new_cpu = wake_affine(); // Directly go to select_idle_sibling() goto sis; } // !want_affine logic here --- As for the !want_affine part, we could either keep the current domain walk (IIUC this is what you are suggesting) or go for the extra cached pointers I'm introducing. Now if we are a bit bolder than that, because there are no more (mainline) topologies w/ SD_BALANCE_WAKE, we could even turn the above into: --- if (wake_flags & WF_TTWU) { if (want_affine) { // We can cache that at topology buildup sd = highest_flag_domain(cpu, SD_WAKE_AFFINE); if (cpumask_test_cpu(prev_cpu, sched_domain_span(sd) && cpu != prev_cpu) new_cpu = wake_affine(); } // Directly go to select_idle_sibling() goto sis; } // !want_affine logic here --- This in turns mean we could get rid of SD_BALANCE_WAKE entirely... I'm a bit more reluctant to that only because the last SD_BALANCE_WAKE setter was removed fairly recently, see a526d466798d ("sched/topology: Remove SD_BALANCE_WAKE on asymmetric capacity systems")