Received: by 2002:a05:7412:31a9:b0:e2:908c:2ebd with SMTP id et41csp4795034rdb; Fri, 15 Sep 2023 12:27:58 -0700 (PDT) X-Google-Smtp-Source: AGHT+IFyoEZQKfoV8LTez+v6IcnQNKmu6lD17aO1heGwnmGAqXH0lMyFrsI2ZFBfIB9VHwQAZfl5 X-Received: by 2002:a17:902:e744:b0:1c1:ee23:bb75 with SMTP id p4-20020a170902e74400b001c1ee23bb75mr2684217plf.1.1694806077984; Fri, 15 Sep 2023 12:27:57 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1694806077; cv=none; d=google.com; s=arc-20160816; b=TnbnO2RGSsX7z0DAO3Q6NtmRMboJW90Aqpik2vPTL7sCV7pjJn32opmEeUX+UYRyX3 XFBtBkZkAfLRmKk5H6bxWdzyITFbUIGGITmUq+fx2YDDO5If+8tMOyVsg42/pSb68BYY XarqR2Q1bg8UHJ+OVm5x8AS+1KtQIm+xd7pSuNl57M6CXXFJeZthW2n0SFckTsUYaXmq wa9UB06QhS+YooTiJnxCgRS/i9lOMuFwzdfk5UGdiPIOJoaTOGRj/9R5vzDGFYj76u2Y hCRCqVS7uRNDxU/mMWi1Pn9pc+5f+/Lv3YY+HZMP5FRzdC6CxlcCC/g4MsP+ckU4rRFt H2Dg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:subject:user-agent:mime-version:date:message-id :dkim-signature; bh=g+9LelNNuQQBY0BeTA1DgI1taOTB4NaXhHyNv+e8K0s=; fh=v95LKH/FC/ox3RbaAOhxEcPZKseyeUOjmVin6/WheA4=; b=laGFin1o8y9W5FoVnPe/qSYjnBRfOq2vA27fmpEc6pILUrbyDd1ZOsKAKj1NgUTUDj 0codKudX6ZwB5PGewpB3/YaRPd3jNoMez55OuVSu7DlF0WsR7Z5Pv8JkTtqkKEuurbK3 6R/OrOIAbythdRp8C/4FdNEo5x3AWKI9G43LPr4iqtOIm+z7q6+ahO2lVKg6LtpBKrNe Een31IXGKliB1ITqcHbpaF9nOJjHvBrvQlY95PwFElUnhvA35OXWTkRvXj2pVlxCkufj FRiQraW1zPeqCKcX7BMLovM2Yj+rYt8FuaYwoFhzjB75nwgfPs/vFbCKFApWbK8q7Uh5 pY6w== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@bytedance.com header.s=google header.b=UdQheZSZ; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.34 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=bytedance.com Return-Path: Received: from howler.vger.email (howler.vger.email. [23.128.96.34]) by mx.google.com with ESMTPS id x10-20020a170902ec8a00b001b894687ec9si3964612plg.462.2023.09.15.12.27.57 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 15 Sep 2023 12:27:57 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.34 as permitted sender) client-ip=23.128.96.34; Authentication-Results: mx.google.com; dkim=pass header.i=@bytedance.com header.s=google header.b=UdQheZSZ; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.34 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=bytedance.com Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by howler.vger.email (Postfix) with ESMTP id 383868373420; Fri, 15 Sep 2023 04:58:22 -0700 (PDT) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.10 at howler.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234243AbjIOL6X (ORCPT + 99 others); Fri, 15 Sep 2023 07:58:23 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:36236 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232836AbjIOL6V (ORCPT ); Fri, 15 Sep 2023 07:58:21 -0400 Received: from mail-pf1-x42a.google.com (mail-pf1-x42a.google.com [IPv6:2607:f8b0:4864:20::42a]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 9A066CD8 for ; Fri, 15 Sep 2023 04:57:54 -0700 (PDT) Received: by mail-pf1-x42a.google.com with SMTP id d2e1a72fcca58-68fc9a4ebe9so1841422b3a.2 for ; Fri, 15 Sep 2023 04:57:54 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=bytedance.com; s=google; t=1694779074; x=1695383874; darn=vger.kernel.org; h=content-transfer-encoding:in-reply-to:from:references:cc:to:subject :user-agent:mime-version:date:message-id:from:to:cc:subject:date :message-id:reply-to; bh=g+9LelNNuQQBY0BeTA1DgI1taOTB4NaXhHyNv+e8K0s=; b=UdQheZSZV9QfcEhsP8HuGQM/lnr/qgd9hFQM7J29BQg5IckDP02IrsCpp98t6heXXr Btq11mEajyqJt5JUjZSWxCeOlsldWvboxvyzdKL8vXw1cDQUTd/IrkkPLxQCgGNr+6E6 2yaK63Iq+h4nwYaTVgwoIzBFeByy6Udm+X0AVqFSIKRFxj9GqYAMvZcqBdSvxelzn9P0 aNHhS74NGuzjVZE/QxD1OktTuuemNSzY7GnA6hewTBpfe7hy26faaguZhizfI1nEVLT4 yOB4Y/L9ZmGREUt7A1Wec4w0qT7JRBvjvH7B2Ft0BxFT9A/OQzEBDpVp59T2flFDYzCC R7VA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1694779074; x=1695383874; h=content-transfer-encoding:in-reply-to:from:references:cc:to:subject :user-agent:mime-version:date:message-id:x-gm-message-state:from:to :cc:subject:date:message-id:reply-to; bh=g+9LelNNuQQBY0BeTA1DgI1taOTB4NaXhHyNv+e8K0s=; b=TCzFSCxTZJnsZXsNw9G5k+SVKO0CpXkOzFCH4J2oxx8lg9K3dEc4KJMQ6sL3Wezmrl kR55XXk7WMmM5tVxjQ1fmEmqcOEXNI5xQoDVCC7qUZMKVbsQLBUm9yaegeP4ukYtRD6q KHKk2rr+Z8BrEjypVtiON256jxFp4e9Y3VE3FI/eV73rR0x5+vbMuQ75Kii2cdr8GHb8 wToi8NeIkC9yj5yh9XTVCKT/SBZHzyifCSY8bCjGjCqOE6Sm3OXz05iasw/aXzS0JPpD RB6biZ/8JJyxTvvMlJ5VmtA9mjihRUoqlN+nokZJpgFk13yqb5AucK6zASdytm2dhyQS 17iQ== X-Gm-Message-State: AOJu0YwWBY5sZn2WonW4o79Kw9KJkqVUgu3ToB4P9A1UGQ61gohySite JqqEh5feuepYltvl7baloObRPDR95sdSwf13IBA= X-Received: by 2002:a05:6a00:c96:b0:690:28d0:b7b3 with SMTP id a22-20020a056a000c9600b0069028d0b7b3mr1607774pfv.13.1694779074048; Fri, 15 Sep 2023 04:57:54 -0700 (PDT) Received: from [10.255.88.66] ([139.177.225.230]) by smtp.gmail.com with ESMTPSA id e7-20020aa78247000000b0063f0068cf6csm2834552pfn.198.2023.09.15.04.57.48 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Fri, 15 Sep 2023 04:57:53 -0700 (PDT) Message-ID: <8f27e07e-e23c-af80-90eb-b1123e1f68cd@bytedance.com> Date: Fri, 15 Sep 2023 19:57:46 +0800 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:102.0) Gecko/20100101 Thunderbird/102.15.1 Subject: Re: [PATCH bpf-next v2 4/6] bpf: Introduce css_descendant open-coded iterator kfuncs To: Andrii Nakryiko Cc: bpf@vger.kernel.org, ast@kernel.org, daniel@iogearbox.net, andrii@kernel.org, martin.lau@kernel.org, tj@kernel.org, linux-kernel@vger.kernel.org References: <20230912070149.969939-1-zhouchuyi@bytedance.com> <20230912070149.969939-5-zhouchuyi@bytedance.com> From: Chuyi Zhou In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-3.6 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,NICE_REPLY_A, RCVD_IN_DNSWL_BLOCKED,SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (howler.vger.email [0.0.0.0]); Fri, 15 Sep 2023 04:58:22 -0700 (PDT) Hello. 在 2023/9/15 07:26, Andrii Nakryiko 写道: > On Tue, Sep 12, 2023 at 12:02 AM Chuyi Zhou wrote: >> >> This Patch adds kfuncs bpf_iter_css_{pre,post}_{new,next,destroy} which >> allow creation and manipulation of struct bpf_iter_css in open-coded >> iterator style. These kfuncs actually wrapps css_next_descendant_{pre, >> post}. BPF programs can use these kfuncs through bpf_for_each macro for >> iteration of all descendant css under a root css. >> >> Signed-off-by: Chuyi Zhou >> --- >> include/uapi/linux/bpf.h | 8 +++++ >> kernel/bpf/helpers.c | 6 ++++ >> kernel/bpf/task_iter.c | 53 ++++++++++++++++++++++++++++++++++ >> tools/include/uapi/linux/bpf.h | 8 +++++ >> tools/lib/bpf/bpf_helpers.h | 12 ++++++++ >> 5 files changed, 87 insertions(+) >> >> diff --git a/include/uapi/linux/bpf.h b/include/uapi/linux/bpf.h >> index befa55b52e29..57760afc13d0 100644 >> --- a/include/uapi/linux/bpf.h >> +++ b/include/uapi/linux/bpf.h >> @@ -7326,4 +7326,12 @@ struct bpf_iter_process { >> __u64 __opaque[1]; >> } __attribute__((aligned(8))); >> >> +struct bpf_iter_css_pre { >> + __u64 __opaque[2]; >> +} __attribute__((aligned(8))); >> + >> +struct bpf_iter_css_post { >> + __u64 __opaque[2]; >> +} __attribute__((aligned(8))); >> + >> #endif /* _UAPI__LINUX_BPF_H__ */ >> diff --git a/kernel/bpf/helpers.c b/kernel/bpf/helpers.c >> index 9b7d2c6f99d1..ca1f6404af9e 100644 >> --- a/kernel/bpf/helpers.c >> +++ b/kernel/bpf/helpers.c >> @@ -2510,6 +2510,12 @@ BTF_ID_FLAGS(func, bpf_iter_css_task_destroy, KF_ITER_DESTROY) >> BTF_ID_FLAGS(func, bpf_iter_process_new, KF_ITER_NEW) >> BTF_ID_FLAGS(func, bpf_iter_process_next, KF_ITER_NEXT | KF_RET_NULL) >> BTF_ID_FLAGS(func, bpf_iter_process_destroy, KF_ITER_DESTROY) >> +BTF_ID_FLAGS(func, bpf_iter_css_pre_new, KF_ITER_NEW) >> +BTF_ID_FLAGS(func, bpf_iter_css_pre_next, KF_ITER_NEXT | KF_RET_NULL) >> +BTF_ID_FLAGS(func, bpf_iter_css_pre_destroy, KF_ITER_DESTROY) >> +BTF_ID_FLAGS(func, bpf_iter_css_post_new, KF_ITER_NEW) >> +BTF_ID_FLAGS(func, bpf_iter_css_post_next, KF_ITER_NEXT | KF_RET_NULL) >> +BTF_ID_FLAGS(func, bpf_iter_css_post_destroy, KF_ITER_DESTROY) >> BTF_ID_FLAGS(func, bpf_dynptr_adjust) >> BTF_ID_FLAGS(func, bpf_dynptr_is_null) >> BTF_ID_FLAGS(func, bpf_dynptr_is_rdonly) >> diff --git a/kernel/bpf/task_iter.c b/kernel/bpf/task_iter.c >> index 9d1927dc3a06..8963fc779b87 100644 >> --- a/kernel/bpf/task_iter.c >> +++ b/kernel/bpf/task_iter.c >> @@ -880,6 +880,59 @@ __bpf_kfunc void bpf_iter_process_destroy(struct bpf_iter_process *it) >> { >> } >> >> +struct bpf_iter_css_kern { >> + struct cgroup_subsys_state *root; >> + struct cgroup_subsys_state *pos; >> +} __attribute__((aligned(8))); >> + >> +__bpf_kfunc int bpf_iter_css_pre_new(struct bpf_iter_css_pre *it, >> + struct cgroup_subsys_state *root) > > similar to my comment on previous patches, please see > kernel/bpf/cgroup_iter.c for iter/cgroup iterator program. Let's stay > consistent. We have one iterator that accepts parameters defining > iteration order and starting cgroup. Unless there are some technical > reasons we can't follow similar approach with this open-coded iter, > let's use the same approach. We can even reuse > BPF_CGROUP_ITER_DESCENDANTS_PRE, BPF_CGROUP_ITER_DESCENDANTS_POST, > BPF_CGROUP_ITER_ANCESTORS_UP enums. > I know your concern. It would be nice if we keep consistent with kernel/bpf/cgroup_iter.c But this patch actually want to support iterating css (cgroup_subsys_state) not cgroup (css is more low lever). With css_iter we can do something like "for_each_mem_cgroup_tree/cpuset_for_each_descendant_pre" in BPF Progs which is hard for cgroup_iter. In the future we can use this iterator to plug some customizable policy in other resource control system. BTW, what I did in RFC actually very similar with the approach of cgroup_iter. (https://lore.kernel.org/all/20230827072057.1591929-4-zhouchuyi@bytedance.com/). Thanks.