Received: by 2002:a05:7412:31a9:b0:e2:908c:2ebd with SMTP id et41csp4686357rdb; Fri, 15 Sep 2023 09:15:12 -0700 (PDT) X-Google-Smtp-Source: AGHT+IFLy8f7C4bbCS7/qxIf2aCfPYpUt7m72q06Z37geFcq3PSlpp4333PLLKwoKBAPutYt0xVa X-Received: by 2002:a17:902:c794:b0:1c4:1195:6de4 with SMTP id w20-20020a170902c79400b001c411956de4mr4987476pla.9.1694794512270; Fri, 15 Sep 2023 09:15:12 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1694794512; cv=none; d=google.com; s=arc-20160816; b=kkZhtME6Et1TVWwvWyxmClPQNzRkyoUNOH+4nq2Y53j/DZt8yoFdRs+MuKH3rd5A1F DOtCNyOknG2FvDi92XivAnPLNmTj4h1qrViUJu5dgJJMglfmDUxG8ZTO0vMDwfi2rIdE eFXYxCct/ODpR0edceiz8fUSJsmgbVxBRcXI/3x63BDIqNxOEcr+F6H4klbZbVV2FOWK 1JekbdXL03MiJQq5cZ1BPmp0yzRIrofmTFiUvsOvjGo3uO3LmwJQ92RZmUcWHnJKcZkq lzc4ojEBc6qDCfG39ZNgyZMcYNUgV9g7ldjl3jyedBpnUE5kAw8w2Wd7emidHVEOz1Ni UgiQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:subject:user-agent:mime-version:date:message-id :dkim-signature; bh=VYaTEiec+VTKSIfvTEBOea2V06zSSSiqn+piG2v1UMY=; fh=v95LKH/FC/ox3RbaAOhxEcPZKseyeUOjmVin6/WheA4=; b=n79zT9Z1QF/x4zMfveHm0OS9aHIJrNDDY1AiHebf8PtzZzXDLBjytuxwlG1LoMgktz wXQ0LKK6kKUdkZG/v55xVDhOY1N/sVYrOBiNZ39sEbsZ5OJo6JY6S+xO2HxrCwvyvZFj FveT251mtQZVyf7hRF6RyGDDa7QonkojXs/KnYh+U8dnrtMGuE3oC0X6UnCvMoxk2Rhv UCgRWZNwpmqwS0d0PHxpMaYfmGUWkrELOeCh4LsQ67IJDrCUmyr+iMW7KTMQ9mqrMDXX iYWy0L8Zpf6rCGY5YDlRYWPl1/6hzAVy8CaNY5cVvitSU3+Vupfy5H0CV8tb8NxxCl+x EtUA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@bytedance.com header.s=google header.b=PMuxnMhb; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.33 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=bytedance.com Return-Path: Received: from lipwig.vger.email (lipwig.vger.email. [23.128.96.33]) by mx.google.com with ESMTPS id l8-20020a170902f68800b001bc079974dbsi3869427plg.355.2023.09.15.09.15.11 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 15 Sep 2023 09:15:12 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.33 as permitted sender) client-ip=23.128.96.33; Authentication-Results: mx.google.com; dkim=pass header.i=@bytedance.com header.s=google header.b=PMuxnMhb; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.33 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=bytedance.com Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by lipwig.vger.email (Postfix) with ESMTP id 280AB8320A1A; Fri, 15 Sep 2023 08:05:05 -0700 (PDT) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.10 at lipwig.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S236077AbjIOPEm (ORCPT + 99 others); Fri, 15 Sep 2023 11:04:42 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:33364 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S235998AbjIOPEL (ORCPT ); Fri, 15 Sep 2023 11:04:11 -0400 Received: from mail-pl1-x633.google.com (mail-pl1-x633.google.com [IPv6:2607:f8b0:4864:20::633]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 90F2F271B for ; Fri, 15 Sep 2023 08:03:57 -0700 (PDT) Received: by mail-pl1-x633.google.com with SMTP id d9443c01a7336-1c4194f7635so12103335ad.0 for ; Fri, 15 Sep 2023 08:03:57 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=bytedance.com; s=google; t=1694790237; x=1695395037; darn=vger.kernel.org; h=content-transfer-encoding:in-reply-to:from:references:cc:to:subject :user-agent:mime-version:date:message-id:from:to:cc:subject:date :message-id:reply-to; bh=VYaTEiec+VTKSIfvTEBOea2V06zSSSiqn+piG2v1UMY=; b=PMuxnMhb32gX7gC1k8o+3kYuHVnbWmoAEFoD8bkfrqKt6TdH4lCTL8LArPXVa7XZaO watcEU1rfP4JpgIkWRulNd17mS6rt8rIhd3jLsbc6o4WRZNpyEHDRWdISQPTT8wtPOZC TuTFGsFcW0kN3MoNzo6YdVg/fFossXfN3zX5srnPl93r/d5OvTqsZ/DCAYv75vBUbiOp 3m/r38YxcQWeXXl0bEOO5zumhI6OsS0pfSdsbuuDk+H/UvOJYvLyp9CKzur1g/j6KVs2 WXzY42AJLjpt6I7zYnfEz14ZeKbbxNtaBKFyeYW6KcKGci3KKjN1xy38wpRy/Ez2ZbnO bZMQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1694790237; x=1695395037; h=content-transfer-encoding:in-reply-to:from:references:cc:to:subject :user-agent:mime-version:date:message-id:x-gm-message-state:from:to :cc:subject:date:message-id:reply-to; bh=VYaTEiec+VTKSIfvTEBOea2V06zSSSiqn+piG2v1UMY=; b=lFdKxnKUhJySDH22gmGvFiKScb/pbIwmaHzR/mQGNN7P02ZhYx7CSJLpdWTOU0KQv/ kgrOEuEfAeZJJGqQcSVqbLb5VGrLDOgBwXHfY1PU1X6TIteJT2iD2NwzOxqLH/uhKef9 MJSfZeZh552GgIif1q59YH7ijo1AK4UDabXTbb/zOomS/ySSumxZXnNEQSAw78HjLVC3 8XUcSs8CkvYbu2/Ssmq/oF/p7eB06xh4T/t0m6tuFOIjysZ0RQmEMA20gZUgF6N0t9ak Tne2IxSDF4yXNUXxiEmZuOHoX+w6Yradnxs6/EeYQMxTBrzV2rG+jYW8xNxc4X8eEuIn VeQg== X-Gm-Message-State: AOJu0Yw3kZndJUTRU+gdu4DvceLMrCbBG6xMtY6WfFBldu/DDVs9K8QQ opgcp24F/bjgcmML9icAIJDfJQ== X-Received: by 2002:a17:902:e80b:b0:1bc:3908:14d with SMTP id u11-20020a170902e80b00b001bc3908014dmr7842218plg.2.1694790236600; Fri, 15 Sep 2023 08:03:56 -0700 (PDT) Received: from [10.5.75.238] ([139.177.225.244]) by smtp.gmail.com with ESMTPSA id q6-20020a17090311c600b001bbb25dd3a7sm3607798plh.187.2023.09.15.08.03.52 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Fri, 15 Sep 2023 08:03:55 -0700 (PDT) Message-ID: <30eadbff-8340-a721-362b-ff82de03cb9f@bytedance.com> Date: Fri, 15 Sep 2023 23:03:50 +0800 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:102.0) Gecko/20100101 Thunderbird/102.15.1 Subject: Re: [PATCH bpf-next v2 3/6] bpf: Introduce process open coded iterator kfuncs To: Andrii Nakryiko Cc: bpf@vger.kernel.org, ast@kernel.org, daniel@iogearbox.net, andrii@kernel.org, martin.lau@kernel.org, tj@kernel.org, linux-kernel@vger.kernel.org References: <20230912070149.969939-1-zhouchuyi@bytedance.com> <20230912070149.969939-4-zhouchuyi@bytedance.com> From: Chuyi Zhou In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-2.3 required=5.0 tests=DKIM_SIGNED,DKIM_VALID, DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI, NICE_REPLY_A,SPF_HELO_NONE,SPF_PASS autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lipwig.vger.email Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (lipwig.vger.email [0.0.0.0]); Fri, 15 Sep 2023 08:05:05 -0700 (PDT) 在 2023/9/15 07:26, Andrii Nakryiko 写道: > On Tue, Sep 12, 2023 at 12:02 AM Chuyi Zhou wrote: >> >> This patch adds kfuncs bpf_iter_process_{new,next,destroy} which allow >> creation and manipulation of struct bpf_iter_process in open-coded iterator >> style. BPF programs can use these kfuncs or through bpf_for_each macro to >> iterate all processes in the system. >> >> Signed-off-by: Chuyi Zhou >> --- >> include/uapi/linux/bpf.h | 4 ++++ >> kernel/bpf/helpers.c | 3 +++ >> kernel/bpf/task_iter.c | 29 +++++++++++++++++++++++++++++ >> tools/include/uapi/linux/bpf.h | 4 ++++ >> tools/lib/bpf/bpf_helpers.h | 5 +++++ >> 5 files changed, 45 insertions(+) >> >> diff --git a/include/uapi/linux/bpf.h b/include/uapi/linux/bpf.h >> index de02c0971428..befa55b52e29 100644 >> --- a/include/uapi/linux/bpf.h >> +++ b/include/uapi/linux/bpf.h >> @@ -7322,4 +7322,8 @@ struct bpf_iter_css_task { >> __u64 __opaque[1]; >> } __attribute__((aligned(8))); >> >> +struct bpf_iter_process { >> + __u64 __opaque[1]; >> +} __attribute__((aligned(8))); >> + >> #endif /* _UAPI__LINUX_BPF_H__ */ >> diff --git a/kernel/bpf/helpers.c b/kernel/bpf/helpers.c >> index d6a16becfbb9..9b7d2c6f99d1 100644 >> --- a/kernel/bpf/helpers.c >> +++ b/kernel/bpf/helpers.c >> @@ -2507,6 +2507,9 @@ BTF_ID_FLAGS(func, bpf_iter_num_destroy, KF_ITER_DESTROY) >> BTF_ID_FLAGS(func, bpf_iter_css_task_new, KF_ITER_NEW) >> BTF_ID_FLAGS(func, bpf_iter_css_task_next, KF_ITER_NEXT | KF_RET_NULL) >> BTF_ID_FLAGS(func, bpf_iter_css_task_destroy, KF_ITER_DESTROY) >> +BTF_ID_FLAGS(func, bpf_iter_process_new, KF_ITER_NEW) >> +BTF_ID_FLAGS(func, bpf_iter_process_next, KF_ITER_NEXT | KF_RET_NULL) >> +BTF_ID_FLAGS(func, bpf_iter_process_destroy, KF_ITER_DESTROY) >> BTF_ID_FLAGS(func, bpf_dynptr_adjust) >> BTF_ID_FLAGS(func, bpf_dynptr_is_null) >> BTF_ID_FLAGS(func, bpf_dynptr_is_rdonly) >> diff --git a/kernel/bpf/task_iter.c b/kernel/bpf/task_iter.c >> index d8539cc05ffd..9d1927dc3a06 100644 >> --- a/kernel/bpf/task_iter.c >> +++ b/kernel/bpf/task_iter.c >> @@ -851,6 +851,35 @@ __bpf_kfunc void bpf_iter_css_task_destroy(struct bpf_iter_css_task *it) >> kfree(kit->css_it); >> } >> >> +struct bpf_iter_process_kern { >> + struct task_struct *tsk; >> +} __attribute__((aligned(8))); >> + > > Few high level thoughts. I think it would be good to follow > SEC("iter/task") naming and approach. Open-coded iterators in many > ways are in-kernel counterpart to iterator programs, so keeping them > close enough within reason is useful for knowledge transfer. > > SEC("iter/task") allows to: > a) iterate all threads in the system > b) iterate all threads for a given TGID > c) it also allows to "iterate" a single thread or process, but that's > a bit less relevant for in-kernel iterator, but we can still support > them, why not? > > I'm not sure if it supports iterating all processes (as in group > leaders of each task group) in the system, but if it's possible I > think we should support it at least for open-coded iterator, seems > like a very useful functionality. > > So to that end, let's design a small set of input arguments for > bpf_iter_process_new() that would allow to specify this as flags + > either (optional) struct task_struct * pointer to represent > task/process or PID/TGID. > Another concern from Alexei was the readability of the API of open-coded in BPF Program[1]. bpf_for_each(task, curr) is straightforward. Users can easily understand that this API does the same thing as 'for_each_process' in kernel. However, if we keep the approach of SEC("iter/task") enum ITER_ITEM { ITER_TASK, ITER_THREAD, } __bpf_kfunc int bpf_iter_task_new(struct bpf_iter_process *it, struct task_struct *group_task, enum ITER_ITEM type) the API have to chang: bpf_for_each(task, curr, NULL, ITERATE_TASK) // iterate all process in the system bpf_for_each(task, curr, group_leader, ITERATE_THREAD) // iterate all thread of group_leader bpf_for_each(task, curr, NULL, ITERATE_THREAD) //iterate all threads of all the process in the system Useres may guess what are this API actually doing.... So, I'm thinking if we can add a layer of abstraction to hide the details from the users: #define bpf_for_each_process(task) \ bpf_for_each(task, curr, NULL, ITERATE_TASK) It would be nice if you could give me some better suggestions. Thanks! [1] https://lore.kernel.org/lkml/CAADnVQLbDWUxFen-RS67C86sOE5DykEPD8xyihJ2RnG1WEnTQg@mail.gmail.com/