Received: by 2002:a05:7412:8521:b0:e2:908c:2ebd with SMTP id t33csp2357419rdf; Mon, 6 Nov 2023 11:37:27 -0800 (PST) X-Google-Smtp-Source: AGHT+IEqWqOLbyOk2lY2DIZqqg4kQ9ZNDJbnGxfBIDwt0SwsjwoDLJSC1qOs8sYlzAVsjrirv3IQ X-Received: by 2002:a05:6358:7e07:b0:169:98e4:aeeb with SMTP id o7-20020a0563587e0700b0016998e4aeebmr18001527rwm.23.1699299446915; Mon, 06 Nov 2023 11:37:26 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1699299446; cv=none; d=google.com; s=arc-20160816; b=SIatbH/kPkdWFRtZVCM+kunZlH7cDklyRYHRvuyrqZwStMLx14JNLmqNyGAOi/DL8C 60TgLQXg/bB5tyERQJKcMF4E3H5oZChgDD1Jwqd4haQL9l8HhrcTfmwwUoYsIe3d8L6b isrXsDLhC/aSRYBIuv0NgkoIhr+Bay2aJhlF2ZGGgn5ISru2DEGnInx6oCkFYaO2i2kB 7fFzXuv9KJinjUND5zCfbvm/FnK2zcSSG6F3BzMtOCbhoyq3jrIm1GycK3y57Mt7yg9L 9rZI1Gk1p+MiHJkF+DcHNd6LaWaMFTGeEBLHoeKcyf2tv/SiGLdI9QbU9UfVSjMaUQPL Qs3g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:from:subject:message-id:references :mime-version:in-reply-to:date:dkim-signature; bh=wSuDJnvRlC5g6NjMbMGU+ddx8uvBAfxXgo2joCjm/cQ=; fh=7UDRnXAyFwoHSGMn0t+ldgDKuKM1JE0l1TSpusfeMFI=; b=Hq53xbpc/h5e2iWLzJrkx4KiPu9ahiez524wRwJL0aGjDxc+3ziPljMdA0Wbk2YYuU AsAynblPY0n1Akyz6ZQUu9KiImreVWV477flcHZXcQUKaE3ISCaQUJpQXZZ2WAtDX2Hb wL/yvlpW4S2w956Zk361R6ETlH4AMMOgVdK9E8TAEaiDYGPY5gJWM4PYGNLsbuu2vFsw TnopojOEImlr0usbhSzORRVxwJQFsRRmkv9V5Pu88AEL1ssDIpmzl/+MVNa6uWsXzxcF na/kSdCVXgA0UrWyb4PEiol7HQwL5RZJ85JItJAzDufsZuzp8fAJWW4wAKwOPAGlQZhB XQvA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20230601 header.b=UsPss4Bc; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.37 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from snail.vger.email (snail.vger.email. [23.128.96.37]) by mx.google.com with ESMTPS id w64-20020a638243000000b005b8afa325b8si355711pgd.410.2023.11.06.11.37.26 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 06 Nov 2023 11:37:26 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.37 as permitted sender) client-ip=23.128.96.37; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20230601 header.b=UsPss4Bc; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.37 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by snail.vger.email (Postfix) with ESMTP id F3C96801D49B; Mon, 6 Nov 2023 11:37:23 -0800 (PST) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.10 at snail.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233093AbjKFThB (ORCPT + 99 others); Mon, 6 Nov 2023 14:37:01 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:52444 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232927AbjKFTgj (ORCPT ); Mon, 6 Nov 2023 14:36:39 -0500 Received: from mail-yw1-x1149.google.com (mail-yw1-x1149.google.com [IPv6:2607:f8b0:4864:20::1149]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 547E1D42 for ; Mon, 6 Nov 2023 11:36:06 -0800 (PST) Received: by mail-yw1-x1149.google.com with SMTP id 00721157ae682-5b064442464so66639117b3.3 for ; Mon, 06 Nov 2023 11:36:06 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20230601; t=1699299365; x=1699904165; darn=vger.kernel.org; h=cc:to:from:subject:message-id:references:mime-version:in-reply-to :date:from:to:cc:subject:date:message-id:reply-to; bh=wSuDJnvRlC5g6NjMbMGU+ddx8uvBAfxXgo2joCjm/cQ=; b=UsPss4Bc0ny3I6D0n8iVXhIqmBoX3q/V35zjhKAUkxvwKy01LuU9Qs8dzzqIEPG+tf IOzyuQ9phsFG4Ib+lh3Obgmpz69Q7LUgO4TIKqfHHl76kl3Vd5Tm8aUNVYCmlvISBFg9 qSAVT0wd44yaV3IEYQuP1WD989pJldj+s1tGfveGflPhmRpQYHIWsV/iF6d0Pd6sWl1X Ja54hE8III8lB889Yftiz+PRMaurrPXLkn7qABfheeMknRZN69lSmh94RDgIUx49/s5C ewqla6GB/E+tMj88GLeZR9w0Y9YmyEJNCVoaQXPgjy2Sx5h2kWyMroz10IbxdKar71rw rIMA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1699299365; x=1699904165; h=cc:to:from:subject:message-id:references:mime-version:in-reply-to :date:x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=wSuDJnvRlC5g6NjMbMGU+ddx8uvBAfxXgo2joCjm/cQ=; b=p/mJIwu1g8YY324YFDEcQesE1qDdHwS6XmRbTffWKsrLwWw7pdQ9RgoUiPwRgSD+up vmrcmTepTTXIxuTjBeXjrUAvKF5HlYREsngwk46eTeS38jo/xDL1514BWQO0iS7AiQNp mHzSSRA1f15+Y2axGudXgpAN9axaTQP3Kgme/jObhX+oCmfRVo8vcJ/SiyyODYDn+Rg9 X4yvjGgasVld9p9aYdVDDDgTt0Troye7eP0dmJpuo8KX2Tl5bxg3pNMa5EMuujdC6J1S mmqgI5aYIRJShADeB72vpf2zoxGvyTPNwx5AOf2lPXZInzJtPdRDZzw0JZ53E8kmLxLc +q9g== X-Gm-Message-State: AOJu0Yw83f79s3dSQ3GhGjyW/TWBzda5iUyUU+Rv5TNfngIH7cdBBwVw Cq0bo7t7z+lJvEE7RocTuj4+pPw90RUjjljTCp7s7XlFmzOQfS1xheVxzRNIGhAaUnFw/IPkjRG ixghEvkoZRxfPQDvRh6M14pzUgf8ZtcIBqRWAOEITJ/mhGwK3FTuX4tpcUCvIWsKxwQ4esHM= X-Received: from jstultz-noogler2.c.googlers.com ([fda3:e722:ac3:cc00:24:72f4:c0a8:600]) (user=jstultz job=sendgmr) by 2002:a81:4958:0:b0:59f:3cde:b33a with SMTP id w85-20020a814958000000b0059f3cdeb33amr239418ywa.6.1699299363609; Mon, 06 Nov 2023 11:36:03 -0800 (PST) Date: Mon, 6 Nov 2023 19:34:57 +0000 In-Reply-To: <20231106193524.866104-1-jstultz@google.com> Mime-Version: 1.0 References: <20231106193524.866104-1-jstultz@google.com> X-Mailer: git-send-email 2.42.0.869.gea05f2083d-goog Message-ID: <20231106193524.866104-15-jstultz@google.com> Subject: [PATCH v6 14/20] sched: Add a very simple proxy() function From: John Stultz To: LKML Cc: John Stultz , Joel Fernandes , Qais Yousef , Ingo Molnar , Peter Zijlstra , Juri Lelli , Vincent Guittot , Dietmar Eggemann , Valentin Schneider , Steven Rostedt , Ben Segall , Zimuzo Ezeozue , Youssef Esmat , Mel Gorman , Daniel Bristot de Oliveira , Will Deacon , Waiman Long , Boqun Feng , "Paul E . McKenney" , kernel-team@android.com Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-9.6 required=5.0 tests=BAYES_00,DKIMWL_WL_MED, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE,USER_IN_DEF_DKIM_WL autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (snail.vger.email [0.0.0.0]); Mon, 06 Nov 2023 11:37:24 -0800 (PST) This adds a very simple proxy() function so if we select a blocked task to run, we will deactivate it and pick again. The exception being if it has become unblocked after proxy() was called. Greatly simplified from patch by: Peter Zijlstra (Intel) Juri Lelli Valentin Schneider Connor O'Brien Cc: Joel Fernandes Cc: Qais Yousef Cc: Ingo Molnar Cc: Peter Zijlstra Cc: Juri Lelli Cc: Vincent Guittot Cc: Dietmar Eggemann Cc: Valentin Schneider Cc: Steven Rostedt Cc: Ben Segall Cc: Zimuzo Ezeozue Cc: Youssef Esmat Cc: Mel Gorman Cc: Daniel Bristot de Oliveira Cc: Will Deacon Cc: Waiman Long Cc: Boqun Feng Cc: "Paul E . McKenney" Cc: kernel-team@android.com [jstultz: Split out from larger proxy patch and simplified for review and testing.] Signed-off-by: John Stultz --- v5: * Split out from larger proxy patch --- kernel/sched/core.c | 89 +++++++++++++++++++++++++++++++++++++++++++-- kernel/sched/rt.c | 19 +++++++++- 2 files changed, 102 insertions(+), 6 deletions(-) diff --git a/kernel/sched/core.c b/kernel/sched/core.c index 1b38b34d3f64..5770656b898d 100644 --- a/kernel/sched/core.c +++ b/kernel/sched/core.c @@ -6575,11 +6575,12 @@ pick_next_task(struct rq *rq, struct task_struct *prev, struct rq_flags *rf) # define SM_MASK_PREEMPT SM_PREEMPT #endif -static bool try_to_deactivate_task(struct rq *rq, struct task_struct *p, unsigned long state) +static bool try_to_deactivate_task(struct rq *rq, struct task_struct *p, + unsigned long state, bool deactivate_cond) { if (signal_pending_state(state, p)) { WRITE_ONCE(p->__state, TASK_RUNNING); - } else { + } else if (deactivate_cond) { p->sched_contributes_to_load = (state & TASK_UNINTERRUPTIBLE) && !(state & TASK_NOLOAD) && @@ -6610,6 +6611,74 @@ static bool try_to_deactivate_task(struct rq *rq, struct task_struct *p, unsigne return false; } +#ifdef CONFIG_PROXY_EXEC +/* + * Initial simple proxy that just returns the task if its waking + * or deactivates the blocked task so we can pick something that + * isn't blocked. + */ +static struct task_struct * +proxy(struct rq *rq, struct task_struct *next, struct rq_flags *rf) +{ + struct task_struct *p = next; + struct mutex *mutex; + unsigned long state; + + mutex = p->blocked_on; + /* Something changed in the chain, pick_again */ + if (!mutex) + return NULL; + /* + * By taking mutex->wait_lock we hold off concurrent mutex_unlock() + * and ensure @owner sticks around. + */ + raw_spin_lock(&mutex->wait_lock); + raw_spin_lock(&p->blocked_lock); + + /* Check again that p is blocked with blocked_lock held */ + if (!task_is_blocked(p) || mutex != p->blocked_on) { + /* + * Something changed in the blocked_on chain and + * we don't know if only at this level. So, let's + * just bail out completely and let __schedule + * figure things out (pick_again loop). + */ + raw_spin_unlock(&p->blocked_lock); + raw_spin_unlock(&mutex->wait_lock); + return NULL; + } + + state = READ_ONCE(p->__state); + /* Don't deactivate if the state has been changed to TASK_RUNNING */ + if (!state) { + raw_spin_unlock(&p->blocked_lock); + raw_spin_unlock(&mutex->wait_lock); + return p; + } + + try_to_deactivate_task(rq, next, state, true); + + /* + * If next is the selected task, then remove lingering + * references to it from rq and sched_class structs after + * dequeueing. + */ + put_prev_task(rq, next); + rq_set_selected(rq, rq->idle); + resched_curr(rq); + raw_spin_unlock(&p->blocked_lock); + raw_spin_unlock(&mutex->wait_lock); + return NULL; +} +#else /* PROXY_EXEC */ +static struct task_struct * +proxy(struct rq *rq, struct task_struct *next, struct rq_flags *rf) +{ + BUG(); // This should never be called in the !PROXY case + return next; +} +#endif /* PROXY_EXEC */ + /* * __schedule() is the main scheduler function. * @@ -6700,12 +6769,24 @@ static void __sched notrace __schedule(unsigned int sched_mode) */ prev_state = READ_ONCE(prev->__state); if (!(sched_mode & SM_MASK_PREEMPT) && prev_state) { - try_to_deactivate_task(rq, prev, prev_state); + try_to_deactivate_task(rq, prev, prev_state, + !task_is_blocked(prev)); switch_count = &prev->nvcsw; } - next = pick_next_task(rq, prev, &rf); +pick_again: + next = pick_next_task(rq, rq_selected(rq), &rf); rq_set_selected(rq, next); + if (unlikely(task_is_blocked(next))) { + next = proxy(rq, next, &rf); + if (!next) { + rq_unpin_lock(rq, &rf); + __balance_callbacks(rq); + rq_repin_lock(rq, &rf); + goto pick_again; + } + } + clear_tsk_need_resched(prev); clear_preempt_need_resched(); #ifdef CONFIG_SCHED_DEBUG diff --git a/kernel/sched/rt.c b/kernel/sched/rt.c index bc243e70bc0e..0125a3ae5a7a 100644 --- a/kernel/sched/rt.c +++ b/kernel/sched/rt.c @@ -1537,8 +1537,19 @@ enqueue_task_rt(struct rq *rq, struct task_struct *p, int flags) enqueue_rt_entity(rt_se, flags); - if (!task_current(rq, p) && p->nr_cpus_allowed > 1) - enqueue_pushable_task(rq, p); + /* + * Current can't be pushed away. Selected is tied to current, + * so don't push it either. + */ + if (task_current(rq, p) || task_current_selected(rq, p)) + return; + /* + * Pinned tasks can't be pushed. + */ + if (p->nr_cpus_allowed == 1) + return; + + enqueue_pushable_task(rq, p); } static void dequeue_task_rt(struct rq *rq, struct task_struct *p, int flags) @@ -1825,6 +1836,10 @@ static void put_prev_task_rt(struct rq *rq, struct task_struct *p) update_rt_rq_load_avg(rq_clock_pelt(rq), rq, 1); + /* Avoid marking selected as pushable */ + if (task_current_selected(rq, p)) + return; + /* * The previous task needs to be made eligible for pushing * if it is still active -- 2.42.0.869.gea05f2083d-goog