Received: by 2002:a05:6a10:af89:0:0:0:0 with SMTP id iu9csp3534639pxb; Mon, 24 Jan 2022 11:37:29 -0800 (PST) X-Google-Smtp-Source: ABdhPJxCugbRFBoyT0pdEZO5ZvcLoZR7Mu8UOEDgZ8TqRCJ7kAfHJxbkcEjsmMAb7mw++Gg7IAo7 X-Received: by 2002:a17:90a:474d:: with SMTP id y13mr3448425pjg.4.1643053049115; Mon, 24 Jan 2022 11:37:29 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1643053049; cv=none; d=google.com; s=arc-20160816; b=YW8UAHzq3adTS9EX5z4uGFAYymsjZngp00So3UdmXlxND8vRYB7u+O8sGo3ubgKKTX GMEnxm6cYKVZ4psac2nftlWrnibFX/8Cwmaug2roEOKUJ3aYVpwmcKXOGE0+3LNaws/L w/GiR9F3JMLtSxeIOsfV/9tnEIhzWeeGHO6nsTNmzOsVD0iirG1nlOhMFH2KGRC86M/t 2kXH1D7mVh6+HCb37OMm1D5QU1xXA55sMs3XNiWfqK3Lz+Y8Dcc/cKeLYCgh71LCX0ZX 2gpO6OnwHJg8armNNLkqu/hYmVWiVrm976dHBS7xWTuUft/0IIgyUZBK97OYE0IaG4P/ Gbhw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:mime-version:message-id:date:references :in-reply-to:subject:cc:to:from; bh=0J3l17uVDRhud8cVtRaJJUwYL88ZuQrrE24lDcbLhg0=; b=kDP0IMwDTgTP/OH+Dc9EG3vpy+TnOSMPoUl0EBIF3dJYccT+lKc1tEjrBWtY9hjC5w /x9294fGKhGzGZh47aHmr/JXGIIs73781QWldW/5cD1I8UMCdNWnAr5u9kg+iNusohrR mn7ro8kWXcwQuOin7NasFoc+7nWzrH8bPhG75xk8AsAqHUlr0GaRvKndIM4gpIIqVEuS l8nruh5jsmBhNV6YO+hkIBFgFlcYE/UK3fKu+Xuw0984EM/IxMZNf3iXkYP+4r96Gqo+ BZC965eynDyMKRxhDl1jsw2IADDjAaF0y3kmau8feroVlcCNnoYu94dC+Ueh9LcVVbSU tl3Q== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id j2si223216pjz.24.2022.01.24.11.37.16; Mon, 24 Jan 2022 11:37:29 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S243937AbiAXQvT (ORCPT + 99 others); Mon, 24 Jan 2022 11:51:19 -0500 Received: from foss.arm.com ([217.140.110.172]:40402 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S243936AbiAXQvR (ORCPT ); Mon, 24 Jan 2022 11:51:17 -0500 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 537EDD6E; Mon, 24 Jan 2022 08:51:17 -0800 (PST) Received: from e113632-lin (usa-sjc-imap-foss1.foss.arm.com [10.121.207.14]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id DBF263F73B; Mon, 24 Jan 2022 08:51:15 -0800 (PST) From: Valentin Schneider To: Dietmar Eggemann , linux-kernel@vger.kernel.org Cc: John Keeping , Ingo Molnar , Peter Zijlstra , Juri Lelli , Vincent Guittot , Steven Rostedt , Ben Segall , Mel Gorman , Daniel Bristot de Oliveira Subject: Re: [PATCH] sched/rt: Plug rt_mutex_setprio() vs push_rt_task() race In-Reply-To: <349ada44-69bd-8653-a9f8-4f3d0f303392@arm.com> References: <20220120194037.650433-1-valentin.schneider@arm.com> <875yq945yi.mognet@arm.com> <349ada44-69bd-8653-a9f8-4f3d0f303392@arm.com> Date: Mon, 24 Jan 2022 16:51:10 +0000 Message-ID: <8735ld3wn5.mognet@arm.com> MIME-Version: 1.0 Content-Type: text/plain Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 24/01/22 16:47, Dietmar Eggemann wrote: > On 24/01/2022 14:29, Valentin Schneider wrote: >> On 24/01/22 10:37, Dietmar Eggemann wrote: >>> On 20/01/2022 20:40, Valentin Schneider wrote: >>> >>> [...] >>> >>>> diff --git a/kernel/sched/rt.c b/kernel/sched/rt.c >>>> index 7b4f4fbbb404..48fc8c04b038 100644 >>>> --- a/kernel/sched/rt.c >>>> +++ b/kernel/sched/rt.c >>>> @@ -2026,6 +2026,16 @@ static int push_rt_task(struct rq *rq, bool pull) >>>> return 0; >>>> >>>> retry: >>>> + /* >>>> + * It's possible that the next_task slipped in of >>>> + * higher priority than current. If that's the case >>>> + * just reschedule current. >>>> + */ >>>> + if (unlikely(next_task->prio < rq->curr->prio)) { >>>> + resched_curr(rq); >>>> + return 0; >>>> + } >>> >>> If we do this before `is_migration_disabled(next_task), shouldn't then >>> the related condition in push_dl_task() also be moved up? >>> >>> if (dl_task(rq->curr) && >>> dl_time_before(next_task->dl.deadline, rq->curr->dl.deadline) && >>> rq->curr->nr_cpus_allowed > 1) >>> >>> To enforce resched_curr(rq) in the `is_migration_disabled(next_task)` >>> case there as well? >>> >> >> I'm not sure if we can hit the same issue with DL since DL doesn't have the >> push irqwork. If there are DL tasks on the rq when current gets demoted, >> switched_from_dl() won't queue pull_dl_task(). > > True. But with your RT change we reschedule current (CFS task or lower > rt task than next_task) now even in case next task is > migration-disabled. I.e. we prefer rescheduling over pushing current away. > > But for DL we wouldn't reschedule current in such a case, we would just > return 0. > > That said, the prio based check in RT includes other sched classes where > the DL check only compares DL tasks. > I think you got a point to at least align the RT and DL code, and yes we shouldn't care whether the next pushable DL task is migration_disabled or not if it's higher prio than current, so I think I'll move that in v2. >> That said, if say we have DL tasks on the rq and demote the current DL task >> to RT, do we currently have anything that will call resched_curr() (I'm >> looking at the rt_mutex path)? >> switched_to_fair() has a resched_curr() (which helps for the RT -> CFS >> case), I don't see anything that would give us that in switched_from_dl() / >> switched_to_rt(), or am I missing something? >> >>>> + >>>> if (is_migration_disabled(next_task)) { >>>> struct task_struct *push_task = NULL; >>>> int cpu; >>>> @@ -2033,6 +2043,17 @@ static int push_rt_task(struct rq *rq, bool pull) >>>> if (!pull || rq->push_busy) >>>> return 0; >>>> >>>> + /* >>>> + * Per the above priority check, curr is at least RT. If it's >>>> + * of a higher class than RT, invoking find_lowest_rq() on it >>>> + * doesn't make sense. >>>> + * >>>> + * Note that the stoppers are masqueraded as SCHED_FIFO >>>> + * (cf. sched_set_stop_task()), so we can't rely on rt_task(). >>>> + */ >>>> + if (rq->curr->sched_class != &rt_sched_class) >>> >>> s/ != / > / ... since the `unlikely(next_task->prio < rq->curr->prio)` >>> already filters tasks from lower sched classes (CFS)? >>> >> >> != points out we won't invoke find_lowest_rq() on anything that isn't RT, >> which makes it a bit clearer IMO, and it's not like either of those >> comparisons is more expensive than the other :) > > Also true, but it would be more aligned to the comment above '... If it > (i.e. curr) 's of a higher class than ...' > Right, I can clean that up! > [...]