Received: by 2002:a25:6193:0:0:0:0:0 with SMTP id v141csp100225ybb; Tue, 24 Mar 2020 17:30:41 -0700 (PDT) X-Google-Smtp-Source: ADFU+vtOb1nSHk1m/sGCqLWZdkLp1viT7MnAUCNSqkKYmTzi5Uxv4OUFtMkZu3WTFndx+jiLv2Of X-Received: by 2002:a05:6830:314e:: with SMTP id c14mr603761ots.7.1585096240959; Tue, 24 Mar 2020 17:30:40 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1585096240; cv=none; d=google.com; s=arc-20160816; b=Jk9QYlAxhVWoaCwE0posd/ICYmJCIlopW1OlFp//UjvpRxhA/qKTNREf5zCXLD+cih u5loBEJB57fQIQ2QHmuQMGrB4DgxgpuwhvCqlnlcsbFr3vfIqsC0qi5itkod0ClWef5e ppArVWZJM1uzWgn1+d1Fj0/DGoljGpiBvpdPvZ828JmFwlBaDp+cDJwFHloLf19uY+Sm GZS7zF18uXPYc9YR47QCZyrDgy7FtkAPNiSSnVlr2i9/hqAmsEB6h2nR+nW5qhGvvOul rlxExE/t0ETIdyoS4JDwBXQgmKkCPZxc+inn0Uar0U2Dz6m+hunANF43wQfG83GKqdHt Q3Nw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature; bh=y2gyLyM2lBE9W78IPwkQxOkRUZFRkNUOxfC4rUOWN6c=; b=ivzl5ZqpuyyoKlMWZXptiJO0g2QfU3ipCuHtg/i/PKTFW/+ZDupN8ZbpfUoExm3aTS 9krkRBQn0p3XnLmCzLkACxvT5Ve6BbGwCJMEq0fDQ4+FgKd2IqMDfvvMb9Yv9U6J4BUY QHyd16VNj06P4VnVVEDwwz9ngjGvgtHzKC4np492h12k9EqnpYzUPKQgQK1GX+BotGON PivJ6AVAk30zNK+1/GD7mqsFcHOG+RJVihaYWyOgxSxgu0VpRZo1AEvL2wz/84PGgAoj hHpPcAhud3j1AqNJ+wy+kWFH1STNkp/52ln58sA+qZZKcHuGu8QVTOoC8tlTEC6KPXUH ITIA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b="fX4c/tJn"; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id p9si4273996oti.202.2020.03.24.17.30.26; Tue, 24 Mar 2020 17:30:40 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b="fX4c/tJn"; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727211AbgCYAaE (ORCPT + 99 others); Tue, 24 Mar 2020 20:30:04 -0400 Received: from mail.kernel.org ([198.145.29.99]:52374 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727099AbgCYAaD (ORCPT ); Tue, 24 Mar 2020 20:30:03 -0400 Received: from localhost (lfbn-ncy-1-985-231.w90-101.abo.wanadoo.fr [90.101.63.231]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 9849520719; Wed, 25 Mar 2020 00:30:02 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1585096202; bh=ndvwAe62gvGBtSTp82glPEKs/rc+7+NqWCoaSMkd7VE=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=fX4c/tJn0VgUvU4X49IZnuKH97fO7xrA+OljFUq7huxG7QIQVzW+9PftOOwbdFCqV yLRKD+PDCBlIeni66/wonWLGJQyuW8dtGRSpkIGj+s63aZxn40DT6ICLsffO/uF/Vr UOOHjApbMdpxHzqdHz10GZVuvUHxyUTjed0o63Yg= Date: Wed, 25 Mar 2020 01:30:00 +0100 From: Frederic Weisbecker To: Marcelo Tosatti Cc: Thomas Gleixner , Chris Friesen , linux-kernel@vger.kernel.org, Christoph Lameter , Jim Somerville , Andrew Morton , Frederic Weisbecker , Peter Zijlstra Subject: Re: [PATCH v2] isolcpus: affine kernel threads to specified cpumask Message-ID: <20200325002956.GC20223@lenoir> References: <20200323135414.GA28634@fuller.cnet> <87k13boxcn.fsf@nanos.tec.linutronix.de> <87imiuq0cg.fsf@nanos.tec.linutronix.de> <20200324152016.GA25422@fuller.cnet> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20200324152016.GA25422@fuller.cnet> User-Agent: Mutt/1.9.4 (2018-02-28) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Mar 24, 2020 at 12:20:16PM -0300, Marcelo Tosatti wrote: > > This is a kernel enhancement to configure the cpu affinity of kernel > threads via kernel boot option isolcpus=no_kthreads,, > > When this option is specified, the cpumask is immediately applied upon > thread launch. This does not affect kernel threads that specify cpu > and node. > > This allows CPU isolation (that is not allowing certain threads > to execute on certain CPUs) without using the isolcpus=domain parameter, > making it possible to enable load balancing on such CPUs > during runtime (see > > Note-1: this is based off on Wind River's patch at > https://github.com/starlingx-staging/stx-integ/blob/master/kernel/kernel-std/centos/patches/affine-compute-kernel-threads.patch > > Difference being that this patch is limited to modifying > kernel thread cpumask: Behaviour of other threads can > be controlled via cgroups or sched_setaffinity. > > Note-2: MontaVista's patch was based off Christoph Lameter's patch at > https://lwn.net/Articles/565932/ with the only difference being > the kernel parameter changed from kthread to kthread_cpus. > > Signed-off-by: Marcelo Tosatti I'm wondering, why do you need such a boot shift at all when you can actually affine kthreads on runtime? > > --- > > v2: use isolcpus= subcommand (Thomas Gleixner) > > Documentation/admin-guide/kernel-parameters.txt | 8 ++++++++ > include/linux/cpumask.h | 5 +++++ > include/linux/sched/isolation.h | 1 + > init/main.c | 1 + > kernel/cpu.c | 13 +++++++++++++ > kernel/kthread.c | 4 ++-- > kernel/sched/isolation.c | 6 ++++++ > 7 files changed, 36 insertions(+), 2 deletions(-) > > diff --git a/Documentation/admin-guide/kernel-parameters.txt b/Documentation/admin-guide/kernel-parameters.txt > index c07815d230bc..7318e3057383 100644 > --- a/Documentation/admin-guide/kernel-parameters.txt > +++ b/Documentation/admin-guide/kernel-parameters.txt > @@ -1959,6 +1959,14 @@ > the CPU affinity syscalls or cpuset. > begins at 0 and the maximum value is > "number of CPUs in system - 1". > + When using cpusets, use the isolcpus option no_kthreads > + to avoid creation of kernel threads on isolated CPUs. > + > + no_kthreads > + Adjust the CPU affinity mask of unbound kernel threads to > + not contain CPUs on the isolated list. This complements > + the isolation provided by the cpusets mechanism described > + above. Actually that should be "kthread" instead of no_kthreads. A flag of isolcpus describes what we want a set of CPUs to be isolated from. Well, at least that's how we started with "domain" and "managed_irq". > > managed_irq > > diff --git a/include/linux/sched/isolation.h b/include/linux/sched/isolation.h > index 0fbcbacd1b29..d002332d00eb 100644 > --- a/include/linux/sched/isolation.h > +++ b/include/linux/sched/isolation.h > @@ -14,6 +14,7 @@ enum hk_flags { > HK_FLAG_DOMAIN = (1 << 5), > HK_FLAG_WQ = (1 << 6), > HK_FLAG_MANAGED_IRQ = (1 << 7), > + HK_FLAG_NO_KTHREADS = (1 << 8), Similarly that should be HK_FLAG_KTHREAD. > }; > > #ifdef CONFIG_CPU_ISOLATION > diff --git a/kernel/kthread.c b/kernel/kthread.c > index b262f47046ca..be9c8d53a986 100644 > --- a/kernel/kthread.c > +++ b/kernel/kthread.c > @@ -347,7 +347,7 @@ struct task_struct *__kthread_create_on_node(int (*threadfn)(void *data), > * The kernel thread should not inherit these properties. > */ > sched_setscheduler_nocheck(task, SCHED_NORMAL, ¶m); > - set_cpus_allowed_ptr(task, cpu_all_mask); > + set_cpus_allowed_ptr(task, cpu_kthread_mask); I'm wondering, why are we using cpu_all_mask and not cpu_possible_mask here? If we used the latter, you wouldn't need to create cpu_kthread_mask and you could directly rely on housekeeping_cpumask(HK_FLAG_KTHREAD). > diff --git a/kernel/sched/isolation.c b/kernel/sched/isolation.c > index 008d6ac2342b..e9d48729efd4 100644 > --- a/kernel/sched/isolation.c > +++ b/kernel/sched/isolation.c > @@ -169,6 +169,12 @@ static int __init housekeeping_isolcpus_setup(char *str) > continue; > } > > + if (!strncmp(str, "no_kthreads,", 12)) { > + str += 12; > + flags |= HK_FLAG_NO_KTHREADS; You will certainly want HK_FLAG_WQ as well since workqueue has its own way to deal with unbound affinity. > + continue; > + } > + > pr_warn("isolcpus: Error, unknown flag\n"); > return 0; > } > Thanks.