Received: by 2002:a05:7412:251c:b0:e2:908c:2ebd with SMTP id w28csp1998007rda; Tue, 24 Oct 2023 09:14:11 -0700 (PDT) X-Google-Smtp-Source: AGHT+IH14kYna27TVHvPJ6HIu6wsWakOfwv5TDtik6qBVEs7c0tqr89vDxl6wlm66ovPf/FOCHf5 X-Received: by 2002:a05:6359:b92:b0:168:e9e5:b407 with SMTP id gf18-20020a0563590b9200b00168e9e5b407mr3693943rwb.10.1698164050725; Tue, 24 Oct 2023 09:14:10 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1698164050; cv=none; d=google.com; s=arc-20160816; b=o9IyKZlSKOoOIMAv9R6vet7oB26GX50YBFv5RzPL9OwTZzmZtUTfJga3rHRV44jS7o XB28Dr3czGckaEPPYt8KcVMXByVMixQU062EjjLvz3ZKzAVUPDJjrfSvEW9ulXXaPKNi Ak4TN1SihRTkzLb0haqRAzHNcVFkO46Vq9STomSXJSelBhYi7Stfw9X8v0xCiUeFS+Eg Phe9Inu6a+AcW3BrOyY1yrmHwjXzrJ/WAT8pLoHLmL4xm8X9IC6lsP0VSvZF2cXjrzL4 Oct3ihPYtUrhM3DqWs3Dcrnhsvpf/eYu6zVKn7hSeK9kURp5abqKjFpxuyjReUakdBS2 VNoQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=65G5ZNQWiDjqZVlLnnNnzZnK3gIH0P1VMKPcZjBriZw=; fh=dAGaHPokgghRjbRM0yT5hu6FSLPtG4jzfhyLLWfY+Mk=; b=iPx3CZ+IBPKpjNGjuyiM54iJ6xY6Dh5wAN60/ethbFo2dpqHh1bj4qtBinC3c5eR+6 sbpSXQK7aFaQc8hdHfov3jCiDDvNA1NIuT+wOm25EkUQLC9We0Bi+YjJEDiNk8hCggUA N+lHEzIllkAAPn9PtSt11rOXs/ZsNyQ5hSjKoD0NzePs5ZhnC9oeCgePsnm/Wdv7WEju uqoQ9d+Zpnd1eVSwdINbSBABEs7VdpbYu7+eHYQTRDieECORr0BL2crQGiajDO39aVNN yKST0eUzq1r8GZlzPrmyM/p5RpcsN4erLiJwlXR3jreVW97vBHm/Ar2c59SDjsUmdJqI c0RA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@suse.com header.s=susede1 header.b=TOKvoMev; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:2 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=suse.com Return-Path: Received: from agentk.vger.email (agentk.vger.email. [2620:137:e000::3:2]) by mx.google.com with ESMTPS id p17-20020a631e51000000b005859da6172bsi8509174pgm.727.2023.10.24.09.14.10 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 24 Oct 2023 09:14:10 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:2 as permitted sender) client-ip=2620:137:e000::3:2; Authentication-Results: mx.google.com; dkim=pass header.i=@suse.com header.s=susede1 header.b=TOKvoMev; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:2 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=suse.com Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by agentk.vger.email (Postfix) with ESMTP id 5066F803DB49; Tue, 24 Oct 2023 09:14:08 -0700 (PDT) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.10 at agentk.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234926AbjJXQOD (ORCPT + 99 others); Tue, 24 Oct 2023 12:14:03 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:40934 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1344146AbjJXQN4 (ORCPT ); Tue, 24 Oct 2023 12:13:56 -0400 Received: from smtp-out1.suse.de (smtp-out1.suse.de [195.135.220.28]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 672A310C2; Tue, 24 Oct 2023 09:13:52 -0700 (PDT) Received: from imap2.suse-dmz.suse.de (imap2.suse-dmz.suse.de [192.168.254.74]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (P-521) server-digest SHA512) (No client certificate requested) by smtp-out1.suse.de (Postfix) with ESMTPS id 136262188C; Tue, 24 Oct 2023 16:13:51 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=suse.com; s=susede1; t=1698164031; h=from:from:reply-to:date:date:message-id:message-id:to:to:cc:cc: mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=65G5ZNQWiDjqZVlLnnNnzZnK3gIH0P1VMKPcZjBriZw=; b=TOKvoMevPK9ep1x5Y2CeGPRRLW0DAGWpE6ozaK8R+CC/sxs3PwbkaAoHUQqKadsBI7W5CT 8medDQxT17HuuX36Ew6x5AEaX+tYoYm9IGay5lKwvgK5sx5/PyJlXgrZ+S25XTGRmaLrxl tUF+0blB2NJIJsdSqHNta8nTaylXRN4= Received: from imap2.suse-dmz.suse.de (imap2.suse-dmz.suse.de [192.168.254.74]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature ECDSA (P-521) server-digest SHA512) (No client certificate requested) by imap2.suse-dmz.suse.de (Postfix) with ESMTPS id C98321391C; Tue, 24 Oct 2023 16:13:50 +0000 (UTC) Received: from dovecot-director2.suse.de ([192.168.254.65]) by imap2.suse-dmz.suse.de with ESMTPSA id qXU6MD7tN2UIZAAAMHmgww (envelope-from ); Tue, 24 Oct 2023 16:13:50 +0000 Date: Tue, 24 Oct 2023 18:13:49 +0200 From: Michal =?utf-8?Q?Koutn=C3=BD?= To: Waiman Long Cc: Tejun Heo , Zefan Li , Johannes Weiner , Christian Brauner , Jonathan Corbet , Shuah Khan , cgroups@vger.kernel.org, linux-kernel@vger.kernel.org, Juri Lelli , Dietmar Eggemann , Giuseppe Scrivano Subject: Re: [PATCH v8 0/7] cgroup/cpuset: Support remote partitions Message-ID: References: <20230905133243.91107-1-longman@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: Authentication-Results: smtp-out1.suse.de; none X-Spam-Level: X-Spam-Score: -6.60 X-Spamd-Result: default: False [-6.60 / 50.00]; ARC_NA(0.00)[]; RCVD_VIA_SMTP_AUTH(0.00)[]; FROM_HAS_DN(0.00)[]; TO_DN_SOME(0.00)[]; TO_MATCH_ENVRCPT_ALL(0.00)[]; NEURAL_HAM_LONG(-3.00)[-1.000]; MIME_GOOD(-0.10)[text/plain]; DKIM_SIGNED(0.00)[suse.com:s=susede1]; NEURAL_HAM_SHORT(-1.00)[-1.000]; RCPT_COUNT_TWELVE(0.00)[12]; FROM_EQ_ENVFROM(0.00)[]; MIME_TRACE(0.00)[0:+]; MID_RHS_NOT_FQDN(0.50)[]; RCVD_COUNT_TWO(0.00)[2]; RCVD_TLS_ALL(0.00)[]; BAYES_HAM(-3.00)[100.00%] X-Spam-Status: No, score=-0.8 required=5.0 tests=DKIM_SIGNED,DKIM_VALID, DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI, SPF_HELO_NONE,SPF_PASS autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on agentk.vger.email Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (agentk.vger.email [0.0.0.0]); Tue, 24 Oct 2023 09:14:08 -0700 (PDT) On Fri, Oct 13, 2023 at 12:03:18PM -0400, Waiman Long wrote: > > [chain] > > root > > | \ > > mid1a mid1b > > cpuset.cpus=0-1 cpuset.cpus=2-15 > > cpuset.cpus.partition=root > > | > > mid2 > > cpuset.cpus=0-1 > > cpuset.cpus.partition=root > > | > > cont > > cpuset.cpus=0-1 > > cpuset.cpus.partition=root > In this case, the effective CPUs of both mid1a and mid2 will be empty. IOW, > you can't have any task in these 2 cpusets. I see, that is relevant to a threaded subtree only where the admin / app can know how to distribute CPUs and place threads to internal nodes. > For the remote case, you can have intermediate tasks in both mid1a and mid2 > as long as cpuset.cpus contains more CPUs than cpuset.cpus.exclusive. It's obvious that cpuset.cpus.exclusive should be exclusive among siblings. Should it also be so along the vertical path? root | mid1a cpuset.cpus=0-2 cpuset.cpus.exclusive=0 | mid2 cpuset.cpus=0-2 cpuset.cpus.exclusive=1 | cont cpuset.cpus=0-2 cpuset.cpus.exclusive=2 cpuset.cpus.partition=root IIUC, this should be a valid config regardless of cpuset.cpus.partition setting on mid1a and mid2. Whereas root | mid1a cpuset.cpus=0-2 cpuset.cpus.exclusive=0 | mid2 cpuset.cpus=0-2 cpuset.cpus.exclusive=1-2 cpuset.cpus.partition=root | cont cpuset.cpus=1-2 cpuset.cpus.exclusive=1-2 cpuset.cpus.partition=root Here, I'm hesitating, will mid2 have any exclusively owned cpus? (I have flashes of understading cpus.exclusive as being a more expressive mechanism than partitions. OTOH, it seems non-intuitive when both are combined, thus I'm asking to internalize it better. Should partitions be deprecated for simplicty? They're still good to provide the notification mechanism of invalidation. cpuset.cpus.exclusive.effective don't have that.) > They will be ready eventually. This requirement of remote partition actually > came from our OpenShift team as the use of just local partition did not meet > their need. They don't need access to exclusive CPUs in the parent cgroup > layer for their management daemons. They do need to activate isolated > partition in selected child cgroups to support our Telco customers to run > workloads like DPDK. > > So they will add the support to upstream Kubernetes. Is it worth implementing anything touching (ancestral) cpuset.cpus.partition then? Thanks, Michal