Received: by 2002:a25:1985:0:0:0:0:0 with SMTP id 127csp2208577ybz; Thu, 23 Apr 2020 13:37:33 -0700 (PDT) X-Google-Smtp-Source: APiQypI4C+fEZoy8/acHzoi/5GIgjHxgHQY1sjRMfwWH76xjD5mDMh3D0xyOIMNPtJt0Q+N5vGG9 X-Received: by 2002:a17:906:5608:: with SMTP id f8mr4427573ejq.190.1587674252792; Thu, 23 Apr 2020 13:37:32 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1587674252; cv=none; d=google.com; s=arc-20160816; b=xqv4DOAXmQ0a5oKMnK6AlU/t9rpn8AQJDP3y5yhyuAMPNW1+M2gLdRPiE3zRGcOmtO kLhqUIoNVXShskOrc06ldpOWHvjh/ovys+Y0t6iVmRo/quwV2gGwnVqA4QBuRCnIoGnM Hpchmwd/UDoKK0loqgWZVOiDy0uHURBegxYQzn5yUddnKa/f4NZbZE4ygnbbOuAl8rk/ tCan5jklcZ7PsDC89+5JmE5vLG3wp8DmgnL/BQr8IKUc92M3N92h+/VCFLdk+E/OJab/ ALt6d/mspbM97/s0HbUeq/OX53ND1MKZHYEkm9gtisn6IfpdYpDT2+HQuDs9uAV0F68Y doww== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:user-agent:message-id :in-reply-to:date:references:subject:cc:to:from:dkim-signature; bh=WqMcHBrQL5Hrx/5UPlvgx3z4tpcct1oI0fprZGVyhow=; b=QwZWbCEtKIuT8OiwQ5FQ23XQYxad39/tyO710F8aeFu2INGcz2taTW3sZ6BrgYzs7Y OtW0nnxcFTEDBN30BfvQbTfMkpuExWdwqTWfmxj34/A/sYI3omLsaMkMmyy1oz4kSzVZ H17Cj1H5YjGqzFL76c1fQyXQEdN8tbVXnv+lZ2lTsyUCX8Z3F0xMhzxm/0g5LWnpW28s 4hrSnO3dcwFx4ZfMdMh6S5W79aPzpe9mu5ASfozt6+DCCl8DjaVEzALWIBkGF6eU5bmA v3jBqldWBlVUTJ04BZnznSnYAjaECdfOG4MMR+1nAA0/kmSDhRm4X/4eI4LrtDrQMopV u5Tw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=sm3n1yn1; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id a19si1895834ejs.131.2020.04.23.13.37.09; Thu, 23 Apr 2020 13:37:32 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=sm3n1yn1; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726071AbgDWUdj (ORCPT + 99 others); Thu, 23 Apr 2020 16:33:39 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:44518 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725877AbgDWUdj (ORCPT ); Thu, 23 Apr 2020 16:33:39 -0400 Received: from mail-pg1-x541.google.com (mail-pg1-x541.google.com [IPv6:2607:f8b0:4864:20::541]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id CC684C09B042 for ; Thu, 23 Apr 2020 13:33:37 -0700 (PDT) Received: by mail-pg1-x541.google.com with SMTP id n16so3446446pgb.7 for ; Thu, 23 Apr 2020 13:33:37 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=from:to:cc:subject:references:date:in-reply-to:message-id :user-agent:mime-version; bh=WqMcHBrQL5Hrx/5UPlvgx3z4tpcct1oI0fprZGVyhow=; b=sm3n1yn1FEv8KlppYnJA3szSMesRSg+OEYPnihsFc2ITcWOXMlhTGDwdfSRy2OUMTu WD6Pf5xk6bydVlTjzR+2xJZ//sYi0mBGJoID6eJmV8YCgjbXKOaLI0PsFjRPQ7kxziG1 iGqTHbnmqrHdempnnkY3rDxrvTXc55MvLIlU0ZAx5+BLQHxz68U0XcuyAJWchpO4X4lb zx4Dy1CHk4lpIbeY84XVqDfINyVycVreY+5vvnEkf71VzPP+OA0hfoP+aDUnZDuB4XEy Qp/Of4Tj2E3WMiEOqJvvx8tzK+PpGA7QyBG+o/X0PguA6mcYP3v+M3UZng/zxgNMBFNx nJYQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:references:date:in-reply-to :message-id:user-agent:mime-version; bh=WqMcHBrQL5Hrx/5UPlvgx3z4tpcct1oI0fprZGVyhow=; b=CRzHIiQm1ZNDrqrLfVNx72k/sRI7v5zEQsg9Ix1kGmxRmISzdPI/c56gLT+7oIIu/G IzTH5ypNs689Kl5tKS+2atEvVlNXFQtUTXFxBjaNDnwGWrZyrExu2RlWtQ853YS6Sy5+ STIYGUAXaqsuidEDTc6gUrI/YV8rmV9xptkugBebvYTIhRPOTGKT3Q1ohjG8EuoTb1n0 U5z1SUNWjnBs8ruHM+3918kd+0ehZX0UcC3v1uMty5kky+ubnU6KNs1q7eFyWdzbtNDs iRYdnULZbfYsZ9pTa/g85TLIalv/FelNyy3Mrs7WHKyDPQcbhDfhUiE0eHRN4L3aqwVT UOMQ== X-Gm-Message-State: AGi0PuaPkGqaEPzvJXXuNgm6IownU6qEVVJsi3kiD2VDlToBPFFXpoBw 8H1CLGK9OqlrzVDaLjwKLLXIAQ== X-Received: by 2002:a63:1a0b:: with SMTP id a11mr5278109pga.29.1587674016957; Thu, 23 Apr 2020 13:33:36 -0700 (PDT) Received: from bsegall-glaptop.localhost (c-73-71-82-80.hsd1.ca.comcast.net. [73.71.82.80]) by smtp.gmail.com with ESMTPSA id o11sm2832171pgd.58.2020.04.23.13.33.34 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 23 Apr 2020 13:33:35 -0700 (PDT) From: bsegall@google.com To: Huaixin Chang Cc: bsegall@google.com, chiluk+linux@indeed.com, linux-kernel@vger.kernel.org, mingo@redhat.com, pauld@redhead.com, peterz@infradead.org, vincent.guittot@linaro.org Subject: Re: [PATCH] sched: Defend cfs and rt bandwidth quota against overflow References: <20200423133752.12787-1-changhuaixin@linux.alibaba.com> Date: Thu, 23 Apr 2020 13:33:33 -0700 In-Reply-To: <20200423133752.12787-1-changhuaixin@linux.alibaba.com> (Huaixin Chang's message of "Thu, 23 Apr 2020 21:37:52 +0800") Message-ID: User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/26.3 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Huaixin Chang writes: > When users write some huge number into cpu.cfs_quota_us or > cpu.rt_runtime_us, overflow might happen during to_ratio() shifts of > schedulable checks. > > to_ratio() could be altered to avoid unnecessary internal overflow, but > min_cfs_quota_period is less than 1 << BW_SHIFT, so a cutoff would still > be needed. Set a cap MAX_BW for cfs_quota_us and rt_runtime_us to > prevent overflow. > > Signed-off-by: Huaixin Chang > --- > kernel/sched/core.c | 8 ++++++++ > kernel/sched/rt.c | 9 +++++++++ > kernel/sched/sched.h | 2 ++ > 3 files changed, 19 insertions(+) > > diff --git a/kernel/sched/core.c b/kernel/sched/core.c > index 3a61a3b8eaa9..0be1782e15c9 100644 > --- a/kernel/sched/core.c > +++ b/kernel/sched/core.c > @@ -7390,6 +7390,8 @@ static DEFINE_MUTEX(cfs_constraints_mutex); > > const u64 max_cfs_quota_period = 1 * NSEC_PER_SEC; /* 1s */ > static const u64 min_cfs_quota_period = 1 * NSEC_PER_MSEC; /* 1ms */ > +/* More than 203 days if BW_SHIFT equals 20. */ > +static const u64 max_cfs_runtime = MAX_BW * NSEC_PER_USEC; > > static int __cfs_schedulable(struct task_group *tg, u64 period, u64 runtime); > > @@ -7417,6 +7419,12 @@ static int tg_set_cfs_bandwidth(struct task_group *tg, u64 period, u64 quota) > if (period > max_cfs_quota_period) > return -EINVAL; > > + /* > + * Bound quota to defend quota against overflow during bandwidth shift. > + */ > + if (quota != RUNTIME_INF && quota > max_cfs_runtime) > + return -EINVAL; > + > /* > * Prevent race between setting of cfs_rq->runtime_enabled and > * unthrottle_offline_cfs_rqs(). > diff --git a/kernel/sched/rt.c b/kernel/sched/rt.c > index df11d88c9895..7ba49625cdbd 100644 > --- a/kernel/sched/rt.c > +++ b/kernel/sched/rt.c > @@ -2569,6 +2569,9 @@ static int __rt_schedulable(struct task_group *tg, u64 period, u64 runtime) > return ret; > } > > +/* More than 4 hours if BW_SHIFT equals 20. */ > +static const u64 max_rt_runtime = MAX_BW; > + > static int tg_set_rt_bandwidth(struct task_group *tg, > u64 rt_period, u64 rt_runtime) > { > @@ -2585,6 +2588,12 @@ static int tg_set_rt_bandwidth(struct task_group *tg, > if (rt_period == 0) > return -EINVAL; > > + /* > + * Bound quota to defend quota against overflow during bandwidth shift. > + */ > + if (rt_runtime != RUNTIME_INF && rt_runtime > max_rt_runtime) > + return -EINVAL; > + We probably _do_ also want this in sched_rt_global_validate now that I think of it. Other than missing that, it looks good. > mutex_lock(&rt_constraints_mutex); > err = __rt_schedulable(tg, rt_period, rt_runtime); > if (err) > diff --git a/kernel/sched/sched.h b/kernel/sched/sched.h > index db3a57675ccf..1f58677a8f23 100644 > --- a/kernel/sched/sched.h > +++ b/kernel/sched/sched.h > @@ -1918,6 +1918,8 @@ extern void init_dl_inactive_task_timer(struct sched_dl_entity *dl_se); > #define BW_SHIFT 20 > #define BW_UNIT (1 << BW_SHIFT) > #define RATIO_SHIFT 8 > +#define MAX_BW_BITS (64 - BW_SHIFT) > +#define MAX_BW ((1ULL << MAX_BW_BITS) - 1) > unsigned long to_ratio(u64 period, u64 runtime); > > extern void init_entity_runnable_average(struct sched_entity *se);