Received: by 2002:a05:6358:4e97:b0:b3:742d:4702 with SMTP id ce23csp2886493rwb; Mon, 15 Aug 2022 13:19:13 -0700 (PDT) X-Google-Smtp-Source: AA6agR5YiuTo2uq5vb6gDszg9kSioJUVD+uawcnFaABNOxwZgI3ULFBraHhR3SJp/67l/rn3BcIM X-Received: by 2002:a17:906:974f:b0:738:4839:4eeb with SMTP id o15-20020a170906974f00b0073848394eebmr4741630ejy.103.1660594752944; Mon, 15 Aug 2022 13:19:12 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1660594752; cv=none; d=google.com; s=arc-20160816; b=rMknj1bzHoPT4OjZwzVkj1bSBmK6P1yTdsU9wa352Gjyb8JmI7VZ5wfpSWc0WArxjW KiuirUFLFX7+xmjlSCLzNrf4rgQNqXSC2WTXJh3S6WkXkyLJJo/9iD8hAxCtNPjbX1kM YqpPPBbGvYpOSxaal/yRZ9FfxN3prbhk97r0BPyIAa95AupizfRzNs2kJSqGqY2dEflq Cz33VFU22jKP46tp3mto8NWN6oKRNLSHpcMPxx8581YagZnGaakSHHcBsGpO8xBTvh0P u2hPN6olW7SpkS79/KEIAjrNoAyr8L1J/IV1kosO0g7CVeF9+QdG9VwQ+HxLzHpxuoGl lNlw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:message-id:date:subject:cc:to :from:dkim-signature; bh=vT/8nGif4dbtBPZPoFUoEdZ0WzMQ9wqEaUTjIjDSpOI=; b=gpJm2s/9u6qahMQvQNgryqC2ao0RfYgZp3qyhlORdAadlcUBx0cVisoN6rzir+CHqX G97dT4ym6099hHfqJKCTvAI9zG0nthjBx8qLDO9HBWRcxAYERl7SM1VajHRS3Bw1ODI8 Q0Hu3ma67r1NCvI22HUNawjtcLW27SQa8jMzS1FQQ+STBzoVoyrQS1moEmrrkdvCUZ7E jfIzc46HhAzslYB7p1XpzPnpPlf8XbSHESwVohypEcKR48zHh+0G+MYhqhBeDW/oi5nf BBt3m3gD3iVjOAmeIXrlO5K2txbGWTUviy51lm59Q2BmSE5Dl2itRoZJ5ogNDz9urt/6 Ns5w== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=korg header.b=R4nSgfQh; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id oz11-20020a1709077d8b00b00730869f1d14si8359895ejc.778.2022.08.15.13.18.45; Mon, 15 Aug 2022 13:19:12 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=korg header.b=R4nSgfQh; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1344983AbiHOTtx (ORCPT + 99 others); Mon, 15 Aug 2022 15:49:53 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:57778 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1344981AbiHOTsE (ORCPT ); Mon, 15 Aug 2022 15:48:04 -0400 Received: from dfw.source.kernel.org (dfw.source.kernel.org [IPv6:2604:1380:4641:c500::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id C8EB042ACB; Mon, 15 Aug 2022 11:49:45 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id D19CA611EC; Mon, 15 Aug 2022 18:49:44 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id D5906C433D6; Mon, 15 Aug 2022 18:49:43 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=linuxfoundation.org; s=korg; t=1660589384; bh=zmO6h4LqP4TkymwQXnLVD7MZL74/ycAA3eFNVw5TmDU=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=R4nSgfQh6VVhsHZGDcfH2yGtkJh9KumSrSVslkKXGtx0pw/uasGEDxy2xnI6XiOJ/ /7d0TPF19Y6ouy3C01uEpYWC32pMxyY6J7J7fd5OnSQQZL4aAMMWShoBzt1e0IDzGd D59r51q4dzHmFeAdiacY0ltKZKYMFDdssDYKHSRs= From: Greg Kroah-Hartman To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman , stable@vger.kernel.org, Valentin Schneider , Tianchen Ding , "Peter Zijlstra (Intel)" , Sasha Levin Subject: [PATCH 5.15 660/779] sched: Fix the check of nr_running at queue wakelist Date: Mon, 15 Aug 2022 20:05:04 +0200 Message-Id: <20220815180405.571526633@linuxfoundation.org> X-Mailer: git-send-email 2.37.2 In-Reply-To: <20220815180337.130757997@linuxfoundation.org> References: <20220815180337.130757997@linuxfoundation.org> User-Agent: quilt/0.67 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-7.1 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_HI, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Tianchen Ding [ Upstream commit 28156108fecb1f808b21d216e8ea8f0d205a530c ] The commit 2ebb17717550 ("sched/core: Offload wakee task activation if it the wakee is descheduling") checked rq->nr_running <= 1 to avoid task stacking when WF_ON_CPU. Per the ordering of writes to p->on_rq and p->on_cpu, observing p->on_cpu (WF_ON_CPU) in ttwu_queue_cond() implies !p->on_rq, IOW p has gone through the deactivate_task() in __schedule(), thus p has been accounted out of rq->nr_running. As such, the task being the only runnable task on the rq implies reading rq->nr_running == 0 at that point. The benchmark result is in [1]. [1] https://lore.kernel.org/all/e34de686-4e85-bde1-9f3c-9bbc86b38627@linux.alibaba.com/ Suggested-by: Valentin Schneider Signed-off-by: Tianchen Ding Signed-off-by: Peter Zijlstra (Intel) Reviewed-by: Valentin Schneider Link: https://lore.kernel.org/r/20220608233412.327341-2-dtcccc@linux.alibaba.com Signed-off-by: Sasha Levin --- kernel/sched/core.c | 6 +++++- 1 file changed, 5 insertions(+), 1 deletion(-) diff --git a/kernel/sched/core.c b/kernel/sched/core.c index 5c7937b504d2..892c06ff9dd0 100644 --- a/kernel/sched/core.c +++ b/kernel/sched/core.c @@ -3735,8 +3735,12 @@ static inline bool ttwu_queue_cond(int cpu, int wake_flags) * CPU then use the wakelist to offload the task activation to * the soon-to-be-idle CPU as the current CPU is likely busy. * nr_running is checked to avoid unnecessary task stacking. + * + * Note that we can only get here with (wakee) p->on_rq=0, + * p->on_cpu can be whatever, we've done the dequeue, so + * the wakee has been accounted out of ->nr_running. */ - if ((wake_flags & WF_ON_CPU) && cpu_rq(cpu)->nr_running <= 1) + if ((wake_flags & WF_ON_CPU) && !cpu_rq(cpu)->nr_running) return true; return false; -- 2.35.1