Received: by 2002:ad5:4acb:0:0:0:0:0 with SMTP id n11csp3410762imw; Mon, 18 Jul 2022 07:39:14 -0700 (PDT) X-Google-Smtp-Source: AGRyM1tgVAE3dLc05l0Ywd3nbmY6BVrnl5GsuU+M4Alia9vdBrYoC+uspdV//XvmvWIN40BCrvln X-Received: by 2002:a05:622a:114:b0:31d:2388:4df4 with SMTP id u20-20020a05622a011400b0031d23884df4mr21773248qtw.488.1658155153834; Mon, 18 Jul 2022 07:39:13 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1658155153; cv=none; d=google.com; s=arc-20160816; b=Dru9MRr2nQ7nyolZiMbdFBgaNylaHHLoep74DFPf+kyaiTxkBgZneuDwYbZyDu6aAk QlVCe93tEm647nw9vuzGoX1bxUw65uvdjRnlJi+ubex6qWpT0MGeIwXj0crzckh+TwVB HWnsC+Agq8jzPPqUi2mO2KWYUukBHedfv5YEaRzFUV0KzqN3PU+TgfQazH1pwqoMmvRU sl5aDHgHiCAEkLXLLlkV49BlHg5MEzABuunairzxX84iGDlY1l1kIVISNhYmpDafuNwM 2c7hGqh6rdknhK9veU+whEdsqgZTP4BdkRpQ0opSUqVC9Tk5+FRh6ykr11MlZKp/izPz Pqig== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=rFQMscdcwUkaTGb2PKaBiEdNVFGKZ4TJiBENKOnBRwk=; b=IqJby3MpDSt6qq7BB72sZOqrHzvQdvqkx2U0nfGefw6quAU7sGi3M5vett85dtLhxK D2WrdELUG8UCA21a8Ae8Z4e0hzuVETr7p9byhB4l83ZZPhCbdzL6XNVfHgEPrvUNzdeI M54s60t4yZ28UxhK1sl9MquuYInJuN6gHPZa8+lwJPdxVnJihBnRmcM613gZco7O6hwB ZMzyEnsJtWk4xXrarMqAcIk9MdNpc4EvBDHthNZ+FQvX/N+MZ9GvbVUE/zw7iPNzJH5N XyiMBCJhqN267uVOhOIhVpmqei35ojYy5TBV3PBe+ZWFfAvZKRER3S+AnyAeWYX8bshY Bo+g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=aNmYeABo; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id ca15-20020a05622a1f0f00b0031798653d60si6339566qtb.321.2022.07.18.07.38.59; Mon, 18 Jul 2022 07:39:13 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=aNmYeABo; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S234208AbiGROgi (ORCPT + 99 others); Mon, 18 Jul 2022 10:36:38 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:33200 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234528AbiGROg0 (ORCPT ); Mon, 18 Jul 2022 10:36:26 -0400 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 5FA4711473 for ; Mon, 18 Jul 2022 07:36:25 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1658154984; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=rFQMscdcwUkaTGb2PKaBiEdNVFGKZ4TJiBENKOnBRwk=; b=aNmYeABowJTmeIipOhAPFBG2Jcdv0DAsYLgMHVdeM+CofGS8btNPT3H3wzzK9jkVX/bZJZ pDTGT9+K9iYnieQnDAHRfMUAMiesEsAeYcK9JGH2MtZS3J2IAUme/eRA/8M9ooRAXD+TzR Ax0SKuKpfo/5ot/PcZBlh6/d/kIWrI4= Received: from mail-pg1-f199.google.com (mail-pg1-f199.google.com [209.85.215.199]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id us-mta-320-C_so-FQQPoC4Mnrty1syQA-1; Mon, 18 Jul 2022 10:36:16 -0400 X-MC-Unique: C_so-FQQPoC4Mnrty1syQA-1 Received: by mail-pg1-f199.google.com with SMTP id p35-20020a631e63000000b0041992866de0so5737103pgm.19 for ; Mon, 18 Jul 2022 07:36:16 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=rFQMscdcwUkaTGb2PKaBiEdNVFGKZ4TJiBENKOnBRwk=; b=qz9EFdGKUrknxtUMkOSGOdE0UoXxMMJmtVMWk7zLqBSz8YuvAkiAbZ/Xu/izIBXW18 FKDDjl860WFPbjAXrmU9sNJE4sF/KKcfKEzNk33i9enUvY0zVbBD8Zqbus0u918qj1q+ i6HrixRzv9Hmc4KcxyjYzIPaHnOsTV+xmpCVo/PbxOx10Q8mY99BWkV/M8wImR4OwDEI OpDtNSwN/kRsuq6cqj89ZufXmyfMs/OgtzEr//BWXW5PbyfWllCbzqWdXgHDFCleZf0f zy7LZ+a1FzMsAMlLgCn/E25kwqf4Q84IcIA9DJGk96jchczXItQ1pasLx3PwINEwD4tu oC/A== X-Gm-Message-State: AJIora85quPguiU3ayxjXDQkON9+B3AknizuzioJEWM/+7edMdcmWiJ7 MaVEAHCG6thILfaTYrizfyTwLfNkyk/UIS8SgSRXSECUjpzFqA6zhWLxHnozJPiFhJnygvnmHso +12oe0SuYtvh1HbfFXAmkH15xWRCyq6uXfOOms4/k X-Received: by 2002:a17:90b:4a08:b0:1ef:f36b:18e1 with SMTP id kk8-20020a17090b4a0800b001eff36b18e1mr39392472pjb.246.1658154975484; Mon, 18 Jul 2022 07:36:15 -0700 (PDT) X-Received: by 2002:a17:90b:4a08:b0:1ef:f36b:18e1 with SMTP id kk8-20020a17090b4a0800b001eff36b18e1mr39392441pjb.246.1658154975155; Mon, 18 Jul 2022 07:36:15 -0700 (PDT) MIME-Version: 1.0 References: <20220712145850.599666-1-benjamin.tissoires@redhat.com> <20220712145850.599666-6-benjamin.tissoires@redhat.com> <7fc49373-55df-c7fd-4a73-c2cf8a62748d@fb.com> In-Reply-To: <7fc49373-55df-c7fd-4a73-c2cf8a62748d@fb.com> From: Benjamin Tissoires Date: Mon, 18 Jul 2022 16:36:03 +0200 Message-ID: Subject: Re: [PATCH bpf-next v6 05/23] bpf/verifier: allow kfunc to return an allocated mem To: Yonghong Song Cc: Greg KH , Jiri Kosina , Alexei Starovoitov , Daniel Borkmann , Andrii Nakryiko , Martin KaFai Lau , Song Liu , John Fastabend , KP Singh , Shuah Khan , Dave Marchevsky , Joe Stringer , Jonathan Corbet , Tero Kristo , lkml , "open list:HID CORE LAYER" , Networking , bpf , "open list:KERNEL SELFTEST FRAMEWORK" , Linux Doc Mailing List Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-2.8 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, SPF_HELO_NONE,SPF_NONE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sat, Jul 16, 2022 at 6:29 AM Yonghong Song wrote: > > > > On 7/12/22 7:58 AM, Benjamin Tissoires wrote: > > When a kfunc is not returning a pointer to a struct but to a plain type, > > we can consider it is a valid allocated memory assuming that: > > - one of the arguments is either called rdonly_buf_size or > > rdwr_buf_size > > - and this argument is a const from the caller point of view > > > > We can then use this parameter as the size of the allocated memory. > > > > The memory is either read-only or read-write based on the name > > of the size parameter. > > If I understand correctly, this permits a kfunc like > int *kfunc(..., int rdonly_buf_size); > ... > int *p = kfunc(..., 20); > so the 'p' points to a memory buffer with size 20. Yes, exactly. > > This looks like a strange interface although probably there > is a valid reason for this as I didn't participated in > earlier discussions. Well, the point is I need to be able to access a memory region that was allocated dynamically. For drivers, the incoming data can not usually be bound to a static value, and so we can not have the data statically defined in the matching struct. So this allows defining a kfunc to return any memory properly allocated and owned by the device. > > > > > Signed-off-by: Benjamin Tissoires > > > > --- > > > > changes in v6: > > - code review from Kartikeya: > > - remove comment change that had no reasons to be > > - remove handling of PTR_TO_MEM with kfunc releases > > - introduce struct bpf_kfunc_arg_meta > > - do rdonly/rdwr_buf_size check in btf_check_kfunc_arg_match > > - reverted most of the changes in verifier.c > > - make sure kfunc acquire is using a struct pointer, not just a plain > > pointer > > - also forward ref_obj_id to PTR_TO_MEM in kfunc to not use after free > > the allocated memory > > > > changes in v5: > > - updated PTR_TO_MEM comment in btf.c to match upstream > > - make it read-only or read-write based on the name of size > > > > new in v4 > > --- > > include/linux/bpf.h | 10 ++++++- > > include/linux/btf.h | 12 ++++++++ > > kernel/bpf/btf.c | 67 ++++++++++++++++++++++++++++++++++++++++--- > > kernel/bpf/verifier.c | 49 +++++++++++++++++++++++-------- > > 4 files changed, 121 insertions(+), 17 deletions(-) > > > > diff --git a/include/linux/bpf.h b/include/linux/bpf.h > > index 2b21f2a3452f..5b8eadb6e7bc 100644 > > --- a/include/linux/bpf.h > > +++ b/include/linux/bpf.h > > @@ -1916,12 +1916,20 @@ int btf_distill_func_proto(struct bpf_verifier_log *log, > > const char *func_name, > > struct btf_func_model *m); > > > > +struct bpf_kfunc_arg_meta { > > + u64 r0_size; > > + bool r0_rdonly; > > + int ref_obj_id; > > + bool multiple_ref_obj_id; > > +}; > > + > > struct bpf_reg_state; > > int btf_check_subprog_arg_match(struct bpf_verifier_env *env, int subprog, > > struct bpf_reg_state *regs); > > int btf_check_kfunc_arg_match(struct bpf_verifier_env *env, > > const struct btf *btf, u32 func_id, > > - struct bpf_reg_state *regs); > > + struct bpf_reg_state *regs, > > + struct bpf_kfunc_arg_meta *meta); > > int btf_prepare_func_args(struct bpf_verifier_env *env, int subprog, > > struct bpf_reg_state *reg); > > int btf_check_type_match(struct bpf_verifier_log *log, const struct bpf_prog *prog, > > diff --git a/include/linux/btf.h b/include/linux/btf.h > > index 1bfed7fa0428..31da4273c2ec 100644 > > --- a/include/linux/btf.h > > +++ b/include/linux/btf.h > > @@ -420,4 +420,16 @@ static inline int register_btf_id_dtor_kfuncs(const struct btf_id_dtor_kfunc *dt > > } > > #endif > > > > +static inline bool btf_type_is_struct_ptr(struct btf *btf, const struct btf_type *t) > > +{ > > + /* t comes in already as a pointer */ > > + t = btf_type_by_id(btf, t->type); > > + > > + /* allow const */ > > + if (BTF_INFO_KIND(t->info) == BTF_KIND_CONST) > > + t = btf_type_by_id(btf, t->type); > > + > > + return btf_type_is_struct(t); > > +} > > + > > #endif > > diff --git a/kernel/bpf/btf.c b/kernel/bpf/btf.c > > index 4423045b8ff3..552d7bc05a0c 100644 > > --- a/kernel/bpf/btf.c > > +++ b/kernel/bpf/btf.c > > @@ -6168,10 +6168,36 @@ static bool is_kfunc_arg_mem_size(const struct btf *btf, > > return true; > > } > > > > +static bool btf_is_kfunc_arg_mem_size(const struct btf *btf, > > + const struct btf_param *arg, > > + const struct bpf_reg_state *reg, > > + const char *name) > > +{ > > + int len, target_len = strlen(name); > > + const struct btf_type *t; > > + const char *param_name; > > + > > + t = btf_type_skip_modifiers(btf, arg->type, NULL); > > + if (!btf_type_is_scalar(t) || reg->type != SCALAR_VALUE) > > + return false; > > + > > + param_name = btf_name_by_offset(btf, arg->name_off); > > + if (str_is_empty(param_name)) > > + return false; > > + len = strlen(param_name); > > + if (len != target_len) > > + return false; > > + if (strncmp(param_name, name, target_len)) > > strcmp(param_name, name) is enough. len == target_len and both len and > target_len is computed from strlen(...). Ack, fixed locally > > > + return false; > > + > > + return true; > > +} > > + > > static int btf_check_func_arg_match(struct bpf_verifier_env *env, > > const struct btf *btf, u32 func_id, > > struct bpf_reg_state *regs, > > - bool ptr_to_mem_ok) > > + bool ptr_to_mem_ok, > > + struct bpf_kfunc_arg_meta *kfunc_meta) > > { > > enum bpf_prog_type prog_type = resolve_prog_type(env->prog); > > struct bpf_verifier_log *log = &env->log; > > @@ -6225,6 +6251,30 @@ static int btf_check_func_arg_match(struct bpf_verifier_env *env, > > > > t = btf_type_skip_modifiers(btf, args[i].type, NULL); > > if (btf_type_is_scalar(t)) { > > + if (is_kfunc && kfunc_meta) { > > + bool is_buf_size = false; > > + > > + /* check for any const scalar parameter of name "rdonly_buf_size" > > + * or "rdwr_buf_size" > > + */ > > + if (btf_is_kfunc_arg_mem_size(btf, &args[i], reg, > > + "rdonly_buf_size")) { > > + kfunc_meta->r0_rdonly = true; > > + is_buf_size = true; > > + } else if (btf_is_kfunc_arg_mem_size(btf, &args[i], reg, > > + "rdwr_buf_size")) > > + is_buf_size = true; > > + > > + if (is_buf_size) { > > + if (kfunc_meta->r0_size) { > > + bpf_log(log, "2 or more rdonly/rdwr_buf_size parameters for kfunc"); > > + return -EINVAL; > > + } > > + > > + kfunc_meta->r0_size = reg->var_off.value; > > Did we check 'reg' is a constant somewhere? I used to check for it in the previous version, but I think it got dropped in this revision. Re-adding this thanks to Kumar's help :) Cheers, Benjamin > > > + } > > + } > > + > > if (reg->type == SCALAR_VALUE) > > continue; > > bpf_log(log, "R%d is not a scalar\n", regno); > > @@ -6246,6 +6296,14 @@ static int btf_check_func_arg_match(struct bpf_verifier_env *env, > > if (ret < 0) > > return ret; > > > > + /* kptr_get is only valid for kfunc */ > > + if (kfunc_meta && reg->ref_obj_id) { > > + /* check for any one ref_obj_id to keep track of memory */ > > + if (kfunc_meta->ref_obj_id) > > + kfunc_meta->multiple_ref_obj_id = true; > > + kfunc_meta->ref_obj_id = reg->ref_obj_id; > > + } > > + > > /* kptr_get is only true for kfunc */ > > if (i == 0 && kptr_get) { > > struct bpf_map_value_off_desc *off_desc; > > @@ -6441,7 +6499,7 @@ int btf_check_subprog_arg_match(struct bpf_verifier_env *env, int subprog, > > return -EINVAL; > > > > is_global = prog->aux->func_info_aux[subprog].linkage == BTF_FUNC_GLOBAL; > > - err = btf_check_func_arg_match(env, btf, btf_id, regs, is_global); > > + err = btf_check_func_arg_match(env, btf, btf_id, regs, is_global, NULL); > > > > /* Compiler optimizations can remove arguments from static functions > > * or mismatched type can be passed into a global function. > [...] >