Received: by 2002:a05:7412:bc1a:b0:d7:7d3a:4fe2 with SMTP id ki26csp385108rdb; Sat, 19 Aug 2023 05:54:02 -0700 (PDT) X-Google-Smtp-Source: AGHT+IFU6nTkY0lGgnmGdTPJ4YywbkNFhGGXxJ/cc3sjyAfmSaMdw8DnO3Y2qiaHzj3TiNnhztgE X-Received: by 2002:a05:6a20:a10c:b0:13f:9233:58d with SMTP id q12-20020a056a20a10c00b0013f9233058dmr2289187pzk.2.1692449641860; Sat, 19 Aug 2023 05:54:01 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1692449641; cv=none; d=google.com; s=arc-20160816; b=o4HaRC6g1cMl1KCq18JM+W0N1yNApQlDb4Tu5j/ESr4HCZDIubWhrKQr0+xwlMrBww FxtKR7AUcooCkivxc33rBqDLf0ouGvZexJpSuibdweLrZ6BrZ4aB0cHSFJRGm7Ugptbq LvxI0dnUy2BcQGzTXULzDvlb6CjrAqjIBZV7yD4ZHLie9wpKEMmqQ5XbWwK4XOlc2l7S ICeZfy+YtQBfAi9UiiaYc0SbR1IjcWnomq3/H6V5hxa25p29UncMQJ8SNczHPAyu3onK 9y3tI1+6xOc4gvYu4rfnBTF21sPdLPFRH1KifkfZDW5JrfpMFTS3ivNMCmYc0jY8JIwX w1dQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:from:subject:message-id:references :mime-version:in-reply-to:date:dkim-signature; bh=qC3vk56gRknF3fOaEKF795ZsBfdlq95wET1mxzdE7Mc=; fh=7FcArFS55vOXS6FxUrURG1n2Wk81YMFhbouUw1hH89k=; b=ZXemLy0IFDM+GeVw+saCIgytIqD51Y9MmbbWd/TXDmMpm2QeKqET/ryimBAk8iU40u IgBMSAwRDBli390X4lZaP6vgpicYfSqVEhojRsC4rkQfCBjLyJPXgi9jFEWY8jMOsgTj 9+24XtAgu6t9JRhkKcM8AI0uLdjkowbvn9kIjh2F2oh9dLwG6rdNPGwTqMfLE3IiqAHZ B4XjRrMYb3fpDMYV0VHYNo3aYn+q4haQyynC92hzzUOL2r/VgG3gcX7v7RP1deAhgBG7 v4loThZrtlqUm183tRunOoyZxcs61IOoOCHxO9MQd2Ew1RJGzeON63E+ykK22hP/XGA4 XmDQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20221208 header.b=g6GB3juR; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [2620:137:e000::1:18]) by mx.google.com with ESMTPS id u185-20020a6385c2000000b00569520c1169si2726014pgd.578.2023.08.19.05.54.01 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sat, 19 Aug 2023 05:54:01 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) client-ip=2620:137:e000::1:18; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20221208 header.b=g6GB3juR; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id DD73B910E5; Sat, 19 Aug 2023 01:38:49 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S245754AbjHSGK0 (ORCPT + 99 others); Sat, 19 Aug 2023 02:10:26 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:36104 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S245378AbjHSGKA (ORCPT ); Sat, 19 Aug 2023 02:10:00 -0400 Received: from mail-yw1-x1149.google.com (mail-yw1-x1149.google.com [IPv6:2607:f8b0:4864:20::1149]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 90F45422A for ; Fri, 18 Aug 2023 23:09:54 -0700 (PDT) Received: by mail-yw1-x1149.google.com with SMTP id 00721157ae682-58419550c3aso25848517b3.0 for ; Fri, 18 Aug 2023 23:09:54 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20221208; t=1692425394; x=1693030194; h=cc:to:from:subject:message-id:references:mime-version:in-reply-to :date:from:to:cc:subject:date:message-id:reply-to; bh=qC3vk56gRknF3fOaEKF795ZsBfdlq95wET1mxzdE7Mc=; b=g6GB3juRYLx+m9aJECt4rR+KJfQHwue08PBPzqoJCw1mhxOdrXu9BHJVTdCwKf60zm 6YfglGIIPI55k9eKKoLKk2JvN2zTnXbZ9ZVXWth9YzAhbUJ9wqqyXtGpb5JJKMasZOYs rNEz4iYOVLVxbTwr6VD2crtnCPp3bU5h9Oa3wj2K13ISZVxTWZGIj1LQIIyugOw2AUT1 2A0JSVe7FROYC+Gnt3TtaaJmOhpDBWKx3j5iJOUQCyss0/faHsarDPP2Jmu0Vck1mFKy M24VkEP3TSCuLQKF/hykmHcnKIVXXe41dE1+QSATR3htYC4xijBwl5IL4WAzsiQTCcya DLww== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1692425394; x=1693030194; h=cc:to:from:subject:message-id:references:mime-version:in-reply-to :date:x-gm-message-state:from:to:cc:subject:date:message-id:reply-to; bh=qC3vk56gRknF3fOaEKF795ZsBfdlq95wET1mxzdE7Mc=; b=PZi+/1C6pJ65e2902T8k8orvP0qTrqW2rRQr5gf1oUCcuBAT7b62VPCeBl71smvYRU E1IzKc5VsjbVI5/k9zywPz4dIOtkS5sqGaRsyJgwomQB8CH6/f9XE0XN5rmqe+MG72aF 8GXjORwqwzn6GRiImrU5I88MY1n3iX2wbA7oAPOts3QZdHaNYOs0VW2m9P3V9WeghozQ h9hGlmGdJFKWV9vn1sqKITREvG2rKJczXe+ey3D+cX/Ua4NeKiMqeJt7t5IqqndSOT7O jneXE44KO5vIwTlnM1U8xno9cnjGs6KY6mPVvaqYT7Hqd+8h8lQz4YnbIKGsYDtoj/tm fRrQ== X-Gm-Message-State: AOJu0YwmIL5b5QPRnbGaJYkqVwBoTf/T25VWUQ/4tSqSlxaOEoSj75+v 9Z1vwEs576t5y3uZwlgmVQkVHlkEpcK6vmBBd1n9eY0Ne4jLVNGw04sXl5o2W/apZeDqFiE2BtB Bnp0+3VGI35Xtqx4P3FB96nYIomr/DBMrfjrS5sKz6nHDafcu/aUyCe7SoUsFh/GruFdfKpA= X-Received: from jstultz-noogler2.c.googlers.com ([fda3:e722:ac3:cc00:24:72f4:c0a8:600]) (user=jstultz job=sendgmr) by 2002:a81:b65c:0:b0:583:a8dc:1165 with SMTP id h28-20020a81b65c000000b00583a8dc1165mr11773ywk.10.1692425393585; Fri, 18 Aug 2023 23:09:53 -0700 (PDT) Date: Sat, 19 Aug 2023 06:08:48 +0000 In-Reply-To: <20230819060915.3001568-1-jstultz@google.com> Mime-Version: 1.0 References: <20230819060915.3001568-1-jstultz@google.com> X-Mailer: git-send-email 2.42.0.rc1.204.g551eb34607-goog Message-ID: <20230819060915.3001568-15-jstultz@google.com> Subject: [PATCH v5 14/19] sched: Add a very simple proxy() function From: John Stultz To: LKML Cc: John Stultz , Joel Fernandes , Qais Yousef , Ingo Molnar , Peter Zijlstra , Juri Lelli , Vincent Guittot , Dietmar Eggemann , Valentin Schneider , Steven Rostedt , Ben Segall , Zimuzo Ezeozue , Youssef Esmat , Mel Gorman , Daniel Bristot de Oliveira , Will Deacon , Waiman Long , Boqun Feng , "Paul E . McKenney" , kernel-team@android.com Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-9.6 required=5.0 tests=BAYES_00,DKIMWL_WL_MED, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, SPF_HELO_NONE,SPF_PASS,USER_IN_DEF_DKIM_WL autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org This adds a very simple proxy() function so if we select a blocked task to run, we will deactivate it and pick again. The exception being if it has become unblocked after proxy() was called. Greatly simplified from patch by: Peter Zijlstra (Intel) Juri Lelli Valentin Schneider Connor O'Brien Cc: Joel Fernandes Cc: Qais Yousef Cc: Ingo Molnar Cc: Peter Zijlstra Cc: Juri Lelli Cc: Vincent Guittot Cc: Dietmar Eggemann Cc: Valentin Schneider Cc: Steven Rostedt Cc: Ben Segall Cc: Zimuzo Ezeozue Cc: Youssef Esmat Cc: Mel Gorman Cc: Daniel Bristot de Oliveira Cc: Will Deacon Cc: Waiman Long Cc: Boqun Feng Cc: "Paul E . McKenney" Cc: kernel-team@android.com [jstultz: Split out from larger proxy patch and simplified for review and testing.] Signed-off-by: John Stultz --- v5: * Split out from larger proxy patch --- kernel/sched/core.c | 89 +++++++++++++++++++++++++++++++++++++++++++-- kernel/sched/rt.c | 19 +++++++++- 2 files changed, 102 insertions(+), 6 deletions(-) diff --git a/kernel/sched/core.c b/kernel/sched/core.c index 76a42f21dda7..72d0803c7d47 100644 --- a/kernel/sched/core.c +++ b/kernel/sched/core.c @@ -6562,11 +6562,12 @@ pick_next_task(struct rq *rq, struct task_struct *prev, struct rq_flags *rf) # define SM_MASK_PREEMPT SM_PREEMPT #endif -bool try_to_deactivate_task(struct rq *rq, struct task_struct *p, unsigned long state) +bool try_to_deactivate_task(struct rq *rq, struct task_struct *p, + unsigned long state, bool deactivate_cond) { if (signal_pending_state(state, p)) { WRITE_ONCE(p->__state, TASK_RUNNING); - } else { + } else if (deactivate_cond) { p->sched_contributes_to_load = (state & TASK_UNINTERRUPTIBLE) && !(state & TASK_NOLOAD) && @@ -6597,6 +6598,74 @@ bool try_to_deactivate_task(struct rq *rq, struct task_struct *p, unsigned long return false; } +#ifdef CONFIG_PROXY_EXEC +/* + * Initial simple proxy that just returns the task if its waking + * or deactivates the blocked task so we can pick something that + * isn't blocked. + */ +static struct task_struct * +proxy(struct rq *rq, struct task_struct *next, struct rq_flags *rf) +{ + struct task_struct *p = next; + struct mutex *mutex; + unsigned long state; + + mutex = p->blocked_on; + /* Something changed in the chain, pick_again */ + if (!mutex) + return NULL; + /* + * By taking mutex->wait_lock we hold off concurrent mutex_unlock() + * and ensure @owner sticks around. + */ + raw_spin_lock(&mutex->wait_lock); + raw_spin_lock(&p->blocked_lock); + + /* Check again that p is blocked with blocked_lock held */ + if (!task_is_blocked(p) || mutex != p->blocked_on) { + /* + * Something changed in the blocked_on chain and + * we don't know if only at this level. So, let's + * just bail out completely and let __schedule + * figure things out (pick_again loop). + */ + raw_spin_unlock(&p->blocked_lock); + raw_spin_unlock(&mutex->wait_lock); + return NULL; + } + + state = READ_ONCE(p->__state); + /* Don't deactivate if the state has been changed to TASK_RUNNING */ + if (!state) { + raw_spin_unlock(&p->blocked_lock); + raw_spin_unlock(&mutex->wait_lock); + return p; + } + + try_to_deactivate_task(rq, next, state, true); + + /* + * If next is the selected task, then remove lingering + * references to it from rq and sched_class structs after + * dequeueing. + */ + put_prev_task(rq, next); + rq_set_selected(rq, rq->idle); + resched_curr(rq); + raw_spin_unlock(&p->blocked_lock); + raw_spin_unlock(&mutex->wait_lock); + return NULL; +} +#else /* PROXY_EXEC */ +static struct task_struct * +proxy(struct rq *rq, struct task_struct *next, struct rq_flags *rf) +{ + BUG(); // This should never be called in the !PROXY case + return next; +} +#endif /* PROXY_EXEC */ + /* * __schedule() is the main scheduler function. * @@ -6687,12 +6756,24 @@ static void __sched notrace __schedule(unsigned int sched_mode) */ prev_state = READ_ONCE(prev->__state); if (!(sched_mode & SM_MASK_PREEMPT) && prev_state) { - try_to_deactivate_task(rq, prev, prev_state); + try_to_deactivate_task(rq, prev, prev_state, + !task_is_blocked(prev)); switch_count = &prev->nvcsw; } - next = pick_next_task(rq, prev, &rf); +pick_again: + next = pick_next_task(rq, rq_selected(rq), &rf); rq_set_selected(rq, next); + if (unlikely(task_is_blocked(next))) { + next = proxy(rq, next, &rf); + if (!next) { + rq_unpin_lock(rq, &rf); + __balance_callbacks(rq); + rq_repin_lock(rq, &rf); + goto pick_again; + } + } + clear_tsk_need_resched(prev); clear_preempt_need_resched(); #ifdef CONFIG_SCHED_DEBUG diff --git a/kernel/sched/rt.c b/kernel/sched/rt.c index 6d9036547c1d..cfbf3925e595 100644 --- a/kernel/sched/rt.c +++ b/kernel/sched/rt.c @@ -1537,8 +1537,19 @@ enqueue_task_rt(struct rq *rq, struct task_struct *p, int flags) enqueue_rt_entity(rt_se, flags); - if (!task_current(rq, p) && p->nr_cpus_allowed > 1) - enqueue_pushable_task(rq, p); + /* + * Current can't be pushed away. Selected is tied to current, + * so don't push it either. + */ + if (task_current(rq, p) || task_current_selected(rq, p)) + return; + /* + * Pinned tasks can't be pushed. + */ + if (p->nr_cpus_allowed == 1) + return; + + enqueue_pushable_task(rq, p); } static void dequeue_task_rt(struct rq *rq, struct task_struct *p, int flags) @@ -1825,6 +1836,10 @@ static void put_prev_task_rt(struct rq *rq, struct task_struct *p) update_rt_rq_load_avg(rq_clock_pelt(rq), rq, 1); + /* Avoid marking selected as pushable */ + if (task_current_selected(rq, p)) + return; + /* * The previous task needs to be made eligible for pushing * if it is still active -- 2.42.0.rc1.204.g551eb34607-goog