Received: by 2002:a05:6902:102b:0:0:0:0 with SMTP id x11csp2268280ybt; Fri, 3 Jul 2020 05:11:58 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyjiij/jCH/6zwuriisd8xgJxjWF3f6Zigj3z587acqTDxbdu2pMhxRTIIEHXkoRi9C9N9M X-Received: by 2002:a17:906:355b:: with SMTP id s27mr33586567eja.368.1593778317949; Fri, 03 Jul 2020 05:11:57 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1593778317; cv=none; d=google.com; s=arc-20160816; b=n8VHBYXG0XyUoMK8g4Ia/SozqIwYcWTVF5FPCJS2HS4Up4QC6Uv6IuEnFPqh5x05Dm Q8O88HYfqBkvfAnafJvQylVeH8nibaEjDzIlZC6nK3d2sKHo9Fdia41JdZtaZ3Xca/oL X6rkUtj/14f0reukIOZ2mMGlowy2eN1dF/FPKyb7/gqnAycbZE/lAuehSe4MX8yEqLFs DyQi6A4OjSNeHd87DELCL9e54dbfNXb6BwZGid4c9uQhCio8tjFOWH41fSJIOFGEpAJl AKsWvfSwPQfyMbBwzZx75J2XkWuSOhNYa2uHh1pFYAXSTtxuriNDnXJ/wBiCzq1wrkIs GK7w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=qDWkT4OfDje0f0et6JWDE0B79Oq1t0eeba0qwfgusSk=; b=Ian9UzLvfC+bmC2STEMUdDcKUEXTBiYWk8BiqocETnahdbVrvbevscnkA4Ku6az2Ik efsjKDKX+OCjNpq/7kQWV+2ts40vBqjdcWrfDGQkPaXf/HfBrp/sWzwfRnvdUdmxsJ1F ytCJfv1A5FuQFZB6Dvff0ZozMywTcJWad9Zs+oe3bptmqsatLCEa4iMGVKuji+EbF6nn W6GWvFqYfefCeW27+IAGhjKQ9Gz7uID9602MZiajC9pNu5BlsLBYhhx95Eo/z13CWnwG 0w+Ovwd1d4DDhwFbZHdA5UtMjU6Xqq0rsUEMhe5w5uFSVyBC+E2JlDa0Wf16FCmNQssd Jd0Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=TBKr8hR9; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id 27si8667160edv.33.2020.07.03.05.11.34; Fri, 03 Jul 2020 05:11:57 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=TBKr8hR9; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726183AbgGCMJQ (ORCPT + 99 others); Fri, 3 Jul 2020 08:09:16 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:49844 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725984AbgGCMJQ (ORCPT ); Fri, 3 Jul 2020 08:09:16 -0400 Received: from mail-lf1-x141.google.com (mail-lf1-x141.google.com [IPv6:2a00:1450:4864:20::141]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id E4284C08C5C1 for ; Fri, 3 Jul 2020 05:09:14 -0700 (PDT) Received: by mail-lf1-x141.google.com with SMTP id k15so18311499lfc.4 for ; Fri, 03 Jul 2020 05:09:14 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=qDWkT4OfDje0f0et6JWDE0B79Oq1t0eeba0qwfgusSk=; b=TBKr8hR9IsqEl/xMLoCv8bJqGWCEefjz14LZF3PT3TGDB7LnhGeuna1XWv+tM9j74F tlZU0x/io9vDE1vP+d829LTnIQdg7ZO4IwDpRlSd8AIlWYOUfAoTZRuslX4T5FeGCjQP AJ1uAk7OtSeLnt2SX+HWeWBIHgHzZUlwiewiBtDiQquVE/FDf06trsKdKuv0AW65SzoT OnuOOBMLhTNQdWNGSKJqaEvmhPPxVCijgT8iN5qZdJINFzQIsFynLFe8YN+ZewhTrlDL pnDFppCVOegachLfsVZgQhb3a6nD/Y497/J50PCLLqNzRaLMp5dNKVqAaWMo5kxSq5TV FC5A== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=qDWkT4OfDje0f0et6JWDE0B79Oq1t0eeba0qwfgusSk=; b=R7BKAX7VzL3d/1Hxb3wA43Zj/1BqBGRGBvppEvM13tv+gm5t+CjY66J98/5ZfNouJz Y/CvO3xf7NEOcqHIfvmQF0Hae4YWaCjnjYRX3aeWTQ7RB7FKorhcbGsQsTC7yKysGDcj REpbSj8cNTaN/AI8OJMVoEzNaJq6iAjeQKP2IIX9UVLH/3RFoXqY9x4EEClOALDoLO7J zuYCdPKVjOXXhMMTvc74j5fRs7fb9kQkWkUG87HJ1BA96IJ41BpbZuskBKwE91GXwzX1 dsgIR4BedUhSlc8aGrpJnz4pzp6FRoIDmqFN9sXSWySU/IFmbAxBB/rlQ6XfEJyS0l4O uWoQ== X-Gm-Message-State: AOAM532oMs+2CKIPuffcKxGuHsl3PhrhIqGKJ0hENJ+fll1g1t3liX9L 8Lt3K4VQ/pCL97MdQOqtAnVDsmcPkBWETfcMwJ1XEQ== X-Received: by 2002:a05:6512:3190:: with SMTP id i16mr4757286lfe.184.1593778153369; Fri, 03 Jul 2020 05:09:13 -0700 (PDT) MIME-Version: 1.0 References: <20200630112123.12076-1-qais.yousef@arm.com> In-Reply-To: <20200630112123.12076-1-qais.yousef@arm.com> From: Vincent Guittot Date: Fri, 3 Jul 2020 14:09:02 +0200 Message-ID: Subject: Re: [PATCH v6 0/2] sched: Optionally skip uclamp logic in fast path To: Qais Yousef Cc: Ingo Molnar , Peter Zijlstra , Valentin Schneider , Juri Lelli , Dietmar Eggemann , Steven Rostedt , Ben Segall , Mel Gorman , Patrick Bellasi , Chris Redpath , Lukasz Luba , linux-kernel Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, 30 Jun 2020 at 13:22, Qais Yousef wrote: > > This series attempts to address the report that uclamp logic could be expensive > sometimes and shows a regression in netperf UDP_STREAM under certain > conditions. > > The first patch is a fix for how struct uclamp_rq is initialized which is > required by the 2nd patch which contains the real 'fix'. > > Worth noting that the root cause of the overhead is believed to be system > specific or related to potential certain code/data layout issues, leading to > worse I/D $ performance. > > Different systems exhibited different behaviors and the regression did > disappear in certain kernel version while attempting to reporoduce. > > More info can be found here: > > https://lore.kernel.org/lkml/20200616110824.dgkkbyapn3io6wik@e107158-lin/ > > Having the static key seemed the best thing to do to ensure the effect of > uclamp is minimized for kernels that compile it in but don't have a userspace > that uses it, which will allow distros to distribute uclamp capable kernels by > default without having to compromise on performance for some systems that could > be affected. > > Changes in v6: > * s/uclamp_is_enabled/uclamp_is_used/ + add comment > * Improve the bailout condition for the case where we could end up with > unbalanced call of uclamp_rq_dec_id() > * Clarify some comments. > > Changes in v5: > * Fix a race that could happen when order of enqueue/dequeue of tasks > A and B is not done in order, and sched_uclamp_used is enabled in > between. > * Add more comments explaining the race and the behavior of > uclamp_rq_util_with() which is now protected with a static key to be > a NOP. When no uclamp aggregation at rq level is done, this function > can't do much. > > Changes in v4: > * Fix broken boosting of RT tasks when static key is disabled. > > Changes in v3: > * Avoid double negatives and rename the static key to uclamp_used > * Unconditionally enable the static key through any of the paths where > the user can modify the default uclamp value. > * Use C99 named struct initializer for struct uclamp_rq which is easier > to read than the memset(). > > Changes in v2: > * Add more info in the commit message about the result of perf diff to > demonstrate that the activate/deactivate_task pressure is reduced in > the fast path. > > * Fix sparse warning reported by the test robot. > > * Add an extra commit about using static_branch_likely() instead of > static_branch_unlikely(). > > Thanks > > -- > Qais Yousef > > Cc: Juri Lelli > Cc: Vincent Guittot > Cc: Dietmar Eggemann > Cc: Steven Rostedt > Cc: Ben Segall > Cc: Mel Gorman > CC: Patrick Bellasi > Cc: Chris Redpath > Cc: Lukasz Luba > Cc: linux-kernel@vger.kernel.org > > > Qais Yousef (2): > sched/uclamp: Fix initialization of struct uclamp_rq > sched/uclamp: Protect uclamp fast path code with static key I have run the perf bench sched pipe that have have already run previously with this v6 and the results are similar to my previous tests: The impact is -1.61% similarly to v2 which is better compared the original -3.66% without your patch > > kernel/sched/core.c | 95 ++++++++++++++++++++++++++++++-- > kernel/sched/cpufreq_schedutil.c | 2 +- > kernel/sched/sched.h | 47 +++++++++++++++- > 3 files changed, 135 insertions(+), 9 deletions(-) > > -- > 2.17.1 >