Received: by 2002:a05:7412:251c:b0:e2:908c:2ebd with SMTP id w28csp2760561rda; Wed, 25 Oct 2023 11:27:24 -0700 (PDT) X-Google-Smtp-Source: AGHT+IEpkpzL1P8RFAVJHIt8NnfRalbTewy9XeHh7ewzzTYBWfUZhni4IXPmxExQM5ooi5EY7fZa X-Received: by 2002:a81:4e94:0:b0:5a8:1d2e:e3e4 with SMTP id c142-20020a814e94000000b005a81d2ee3e4mr18139014ywb.35.1698258444175; Wed, 25 Oct 2023 11:27:24 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1698258444; cv=none; d=google.com; s=arc-20160816; b=uLnwzv6+iF0how+N26ssZq9uPktFc0ZBkZZkeO7VJ7wF8h5PyjQB9Lb5RuOe5TYdqE KgvKEpAI3+4EZSSiEuJl/8ZAqLH8Dc0e7eBaNvvMl3L9kAMiNsTZ9fswPDwA3wieOXA3 1w1Sz33I18++AZ4uM8cLLEVwM0B6uE8odlXpsfErhbxE7kAL2zwAqxa15Hvj3ow3OHSu 0Tcc5CY8mYMbuz31hLIf/tT/Qfkpw0XlXtC0H2ehGztP6tYH+DRwYESC1/p9vNtoXdwH R47v5llrfAe/1HnRoVPG2NxUMhmcqdZicXf8ks1Qf39Gd9JpGfrkeIiuiWs2qOFTzae7 MzaQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from:dkim-signature; bh=jBilnSzSQmWzzYAbkn8SOYsuYBp/vvr53lwEvJHaCik=; fh=bNpy/TkeoCLOfJdH8gD+DRrlc5TtbiBfs9F9HtSKd4A=; b=SkNni712FNNiKv97NqRJZAoW61ZX8sop6wQS0CLrIdG4+BmBdbkYUhUaMJOjr1POiy l4xGgAc4AdGKXpGrhgSYvHhySd2jPh1vttFDDeYtwkUDqhozX7niwHv9UKtpqGkKLUOz 9TTQy8jdptbLoqjrSWHxW0Yf11JykOCj6xQgkr6qAeLe6YQ+FOVjra0wonqTpN+qCGm/ RuBFMKPycMbX/Eumo/3M4iOPeWo286D6ueB64wzq9fP/ovVCIpSbyaa5wwuwjuHhCWNW 54XfeztaooTYol3M+YAmAcqaa8L4Wv13i5jvtCn6KExn65YizaiiRukV947iZy57FRMw 6yQg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b="iA/AmhPB"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:2 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from agentk.vger.email (agentk.vger.email. [2620:137:e000::3:2]) by mx.google.com with ESMTPS id l1-20020a819401000000b0059f80804433si13866349ywg.290.2023.10.25.11.27.23 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 25 Oct 2023 11:27:24 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:2 as permitted sender) client-ip=2620:137:e000::3:2; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b="iA/AmhPB"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:2 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by agentk.vger.email (Postfix) with ESMTP id 688B38025777; Wed, 25 Oct 2023 11:27:11 -0700 (PDT) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.10 at agentk.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229788AbjJYS0t (ORCPT + 99 others); Wed, 25 Oct 2023 14:26:49 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:32888 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229598AbjJYS0s (ORCPT ); Wed, 25 Oct 2023 14:26:48 -0400 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.129.124]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 2E9B1E5 for ; Wed, 25 Oct 2023 11:26:06 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1698258365; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version: content-transfer-encoding:content-transfer-encoding; bh=jBilnSzSQmWzzYAbkn8SOYsuYBp/vvr53lwEvJHaCik=; b=iA/AmhPBujtgVa6ciDxbdv3lw5d9oIL1a/aMQL/h7jAVRckXzAMtaWXuLdKCti+6sWok/K InFzadj5VkW2JCM35EkaoJpPnGIP+Xxd6bMBTD8sw+BetPTzVqrYUFC7kE61tZuKrbv/xl Bg1LsbQNCnkXGvujcXV/Iv6iReGRbMA= Received: from mimecast-mx02.redhat.com (mimecast-mx02.redhat.com [66.187.233.88]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.3, cipher=TLS_AES_256_GCM_SHA384) id us-mta-591-BoSsxMLzNvOV2ZntGf3G_Q-1; Wed, 25 Oct 2023 14:26:02 -0400 X-MC-Unique: BoSsxMLzNvOV2ZntGf3G_Q-1 Received: from smtp.corp.redhat.com (int-mx10.intmail.prod.int.rdu2.redhat.com [10.11.54.10]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits) server-digest SHA256) (No client certificate requested) by mimecast-mx02.redhat.com (Postfix) with ESMTPS id 7AFCF857CE9; Wed, 25 Oct 2023 18:26:01 +0000 (UTC) Received: from llong.com (unknown [10.22.32.140]) by smtp.corp.redhat.com (Postfix) with ESMTP id 68D32492BFC; Wed, 25 Oct 2023 18:26:00 +0000 (UTC) From: Waiman Long To: Tejun Heo , Zefan Li , Johannes Weiner , Jonathan Corbet , Lai Jiangshan , Shuah Khan Cc: cgroups@vger.kernel.org, linux-doc@vger.kernel.org, linux-kernel@vger.kernel.org, linux-kselftest@vger.kernel.org, Peter Hunt , Frederic Weisbecker , Waiman Long Subject: [PATCH v2 0/4] cgroup/cpuset: Improve CPU isolation in isolated partitions Date: Wed, 25 Oct 2023 14:25:51 -0400 Message-Id: <20231025182555.4155614-1-longman@redhat.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Scanned-By: MIMEDefang 3.4.1 on 10.11.54.10 X-Spam-Status: No, score=-0.9 required=5.0 tests=DKIMWL_WL_HIGH,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on agentk.vger.email Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (agentk.vger.email [0.0.0.0]); Wed, 25 Oct 2023 11:27:11 -0700 (PDT) v2: - Add 2 read-only workqueue sysfs files to expose the user requested cpumask as well as the isolated CPUs to be excluded from wq_unbound_cpumask. - Ensure that caller of the new workqueue_unbound_exclude_cpumask() hold cpus_read_lock. - Update the cpuset code to make sure the cpus_read_lock is held whenever workqueue_unbound_exclude_cpumask() may be called. Isolated cpuset partition can currently be created to contain an exclusive set of CPUs not used in other cgroups and with load balancing disabled to reduce interference from the scheduler. The main purpose of this isolated partition type is to dynamically emulate what can be done via the "isolcpus" boot command line option, specifically the default domain flag. One effect of the "isolcpus" option is to remove the isolated CPUs from the cpumasks of unbound workqueues since running work functions in an isolated CPU can be a major source of interference. Changing the unbound workqueue cpumasks can be done at run time by writing an appropriate cpumask without the isolated CPUs to /sys/devices/virtual/workqueue/cpumask. So one can set up an isolated cpuset partition and then write to the cpumask sysfs file to achieve similar level of CPU isolation. However, this manual process can be error prone. This patch series implements automatic exclusion of isolated CPUs from unbound workqueue cpumasks when an isolated cpuset partition is created and then adds those CPUs back when the isolated partition is destroyed. There are also other places in the kernel that look at the HK_FLAG_DOMAIN cpumask or other HK_FLAG_* cpumasks and exclude the isolated CPUs from certain actions to further reduce interference. CPUs in an isolated cpuset partition will not be able to avoid those interferences yet. That may change in the future as the need arises. Waiman Long (4): workqueue: Add workqueue_unbound_exclude_cpumask() to exclude CPUs from wq_unbound_cpumask selftests/cgroup: Minor code cleanup and reorganization of test_cpuset_prs.sh cgroup/cpuset: Keep track of CPUs in isolated partitions cgroup/cpuset: Take isolated CPUs out of workqueue unbound cpumask Documentation/admin-guide/cgroup-v2.rst | 10 +- include/linux/workqueue.h | 2 +- kernel/cgroup/cpuset.c | 286 +++++++++++++----- kernel/workqueue.c | 91 +++++- .../selftests/cgroup/test_cpuset_prs.sh | 216 ++++++++----- 5 files changed, 438 insertions(+), 167 deletions(-) -- 2.39.3