Received: by 2002:a05:7412:31a9:b0:e2:908c:2ebd with SMTP id et41csp3536841rdb; Wed, 13 Sep 2023 15:30:18 -0700 (PDT) X-Google-Smtp-Source: AGHT+IEUaU26i+LzFQWXxUS0s5If3muWwrDA4qg63/xhu5vtWdcsxtKJuPPEGmYG4evzYKr3oxDD X-Received: by 2002:a17:90a:2dc2:b0:268:ca63:e412 with SMTP id q2-20020a17090a2dc200b00268ca63e412mr3271091pjm.4.1694644217885; Wed, 13 Sep 2023 15:30:17 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1694644217; cv=none; d=google.com; s=arc-20160816; b=CQQXpNMZLleJWz2tvbYA0nLys2AeTwNehL2DxQQKknvCduByqbhRHluQmBfth0igkC 2uvSthLbEU6H9IThL3MGSTZ7hkYdoy7Q8luy/lirPLG8HzcUwqmQE/fAAV0n3d4GpUQB SG1+SRiTAZcTrD7YaAuVZTDxJ2qfTR1cL4XhNbZgbOyoc+4o4l9ohiFeFHDVF1oCSoGF cW6t1U/FNRoOmT084yxuuLu3U422hle3/X8WpvJS8R3ZQjsvnwHZK+9CA9UIuACkLw1J A12EILzNG0hhqzM1r0kN1JirfxhsWSMvVPRNHIhDGCKctaLzz37a3zOCqTMPhLMztiJ/ Dhgg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:subject:user-agent:mime-version:date:message-id; bh=DT8Z38G+FMR31+33Z5JLVDnTdm12ySuHuIgI3FjRCm4=; fh=96LgAWwgpb7dRelRnlPpSE3q5eaSTQeareOPQC7IOEE=; b=yNeFhxXlRBmbiTfSl1AFwqzgCwTY8cJt4SybKyCRdGQMTjhnqrQRbsD6986ICQusCL D1VNdbBnG6SD2Ty0voV69q+GOw7QpW4BDOvbP5IeC0mVDHqirnwttiVjUGodUJdpSnz7 4dGIcG3EYr2gIdhdKeBKYLg/Fk1TAjhDLZlSlWbAEr6iKb055nG/e2c7aMDZELwmaLor xJM9UksiVIHg3xhCfBA3pfCkkivNJTqyoLEk2LWM22j8JfJKF09/AEuIbdmCT+orvAtE J95m5PvMvkCMZDk7OMDnf/AQro93Nx3TTGf2p1SBs6t6+84PCWZC7xlO9TuJgAevdPi1 ibhA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.37 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=sangfor.com.cn Return-Path: Received: from snail.vger.email (snail.vger.email. [23.128.96.37]) by mx.google.com with ESMTPS id ne1-20020a17090b374100b0026816382fdbsi261244pjb.40.2023.09.13.15.30.17 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 13 Sep 2023 15:30:17 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.37 as permitted sender) client-ip=23.128.96.37; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.37 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=sangfor.com.cn Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by snail.vger.email (Postfix) with ESMTP id 3C72E81DC7BF; Wed, 13 Sep 2023 03:33:38 -0700 (PDT) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.10 at snail.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S237598AbjIMKd2 (ORCPT + 99 others); Wed, 13 Sep 2023 06:33:28 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:45778 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S238013AbjIMKdZ (ORCPT ); Wed, 13 Sep 2023 06:33:25 -0400 Received: from mail-m49247.qiye.163.com (mail-m49247.qiye.163.com [45.254.49.247]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 4C1DBD3; Wed, 13 Sep 2023 03:33:20 -0700 (PDT) Received: from [10.128.10.193] (unknown [123.120.52.233]) by mail-m12739.qiye.163.com (Hmail) with ESMTPA id BAEF74A0314; Wed, 13 Sep 2023 18:32:46 +0800 (CST) Message-ID: <5f8d82c3-838e-4d75-bb25-7d98a6d0a37c@sangfor.com.cn> Date: Wed, 13 Sep 2023 18:32:43 +0800 MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: [RFC PATCH v2] bpf: Using binary search to improve the performance of btf_find_by_name_kind To: Alexei Starovoitov , Eduard Zingerman Cc: Martin KaFai Lau , Alexei Starovoitov , Song Liu , Yonghong Song , Steven Rostedt , Masami Hiramatsu , dinghui@sangfor.com.cn, huangcun@sangfor.com.cn, bpf , LKML References: <20230909091646.420163-1-pengdonglin@sangfor.com.cn> <20ef8441084c9d5fd54f84987afa77eed7fe148e.camel@gmail.com> <035ab912d7d6bd11c54c038464795da01dbed2de.camel@gmail.com> From: pengdonglin In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-HM-Spam-Status: e1kfGhgUHx5ZQUpXWQgPGg8OCBgUHx5ZQUlOS1dZFg8aDwILHllBWSg2Ly tZV1koWUFITzdXWS1ZQUlXWQ8JGhUIEh9ZQVlCHh4YVklLTkxLHh9CTU1LGlUTARMWGhIXJBQOD1 lXWRgSC1lBWUpJSFVKSUtVTklVSUhIWVdZFhoPEhUdFFlBWU9LSFVKSEpOTUlVSktLVUtZBg++ X-HM-Tid: 0a8a8e19c1c8b212kuuubaef74a0314 X-HM-MType: 1 X-HM-Sender-Digest: e1kMHhlZQR0aFwgeV1kSHx4VD1lBWUc6NxA6Lgw*KD1WPw86NQ4VSk03 IS4KFAxVSlVKTUJPTUtKSk1MQ0xNVTMWGhIXVQseFRwfFBUcFxIVOwgaFRwdFAlVGBQWVRgVRVlX WRILWUFZSklIVUpJS1VOSVVJSEhZV1kIAVlBTE9IQzcG Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (snail.vger.email [0.0.0.0]); Wed, 13 Sep 2023 03:33:38 -0700 (PDT) On 2023/9/13 2:46, Alexei Starovoitov wrote: > On Tue, Sep 12, 2023 at 10:03 AM Eduard Zingerman wrote: >> >> On Tue, 2023-09-12 at 09:40 -0700, Alexei Starovoitov wrote: >>> On Tue, Sep 12, 2023 at 7:19 AM Eduard Zingerman wrote: >>>> >>>> On Tue, 2023-09-12 at 16:51 +0300, Eduard Zingerman wrote: >>>>> On Sat, 2023-09-09 at 02:16 -0700, Donglin Peng wrote: >>>>>> Currently, we are only using the linear search method to find the type id >>>>>> by the name, which has a time complexity of O(n). This change involves >>>>>> sorting the names of btf types in ascending order and using binary search, >>>>>> which has a time complexity of O(log(n)). This idea was inspired by the >>>>>> following patch: >>>>>> >>>>>> 60443c88f3a8 ("kallsyms: Improve the performance of kallsyms_lookup_name()"). >>>>>> >>>>>> At present, this improvement is only for searching in vmlinux's and >>>>>> module's BTFs, and the kind should only be BTF_KIND_FUNC or BTF_KIND_STRUCT. >>>>>> >>>>>> Another change is the search direction, where we search the BTF first and >>>>>> then its base, the type id of the first matched btf_type will be returned. >>>>>> >>>>>> Here is a time-consuming result that finding all the type ids of 67,819 kernel >>>>>> functions in vmlinux's BTF by their names: >>>>>> >>>>>> Before: 17000 ms >>>>>> After: 10 ms >>>>>> >>>>>> The average lookup performance has improved about 1700x at the above scenario. >>>>>> >>>>>> However, this change will consume more memory, for example, 67,819 kernel >>>>>> functions will allocate about 530KB memory. >>>>> >>>>> Hi Donglin, >>>>> >>>>> I think this is a good improvement. However, I wonder, why did you >>>>> choose to have a separate name map for each BTF kind? >>>>> >>>>> I did some analysis for my local testing kernel config and got such numbers: >>>>> - total number of BTF objects: 97350 >>>>> - number of FUNC and STRUCT objects: 51597 >>>>> - number of FUNC, STRUCT, UNION, ENUM, ENUM64, TYPEDEF, DATASEC objects: 56817 >>>>> (these are all kinds for which lookup by name might make sense) >>>>> - number of named objects: 54246 >>>>> - number of name collisions: >>>>> - unique names: 53985 counts >>>>> - 2 objects with the same name: 129 counts >>>>> - 3 objects with the same name: 3 counts >>>>> >>>>> So, it appears that having a single map for all named objects makes >>>>> sense and would also simplify the implementation, what do you think? >>>> >>>> Some more numbers for my config: >>>> - 13241 types (struct, union, typedef, enum), log2 13241 = 13.7 >>>> - 43575 funcs, log2 43575 = 15.4 >>>> Thus, having separate map for types vs functions might save ~1.7 >>>> search iterations. Is this a significant slowdown in practice? >>> >>> What do you propose to do in case of duplicates ? >>> func and struct can have the same name, but they will have two different >>> btf_ids. How do we store them ? >>> Also we might add global vars to BTF. Such request came up several times. >>> So we need to make sure our search approach scales to >>> func, struct, vars. I don't recall whether we search any other kinds. >>> Separate arrays for different kinds seems ok. >>> It's a bit of code complexity, but it's not an increase in memory. >> >> Binary search gives, say, lowest index of a thing with name A, then >> increment index while name remains A looking for correct kind. >> Given the name conflicts info from above, 99% of times there would be >> no need to iterate and in very few cases there would a couple of iterations. >> >> Same logic would be necessary with current approach if different BTF >> kinds would be allowed in BTF_ID_NAME_* cohorts. I figured that these >> cohorts are mainly a way to split the tree for faster lookups, but >> maybe that is not the main intent. >> >>> With 13k structs and 43k funcs it's 56k * (4 + 4) that's 0.5 Mbyte >>> extra memory. That's quite a bit. Anything we can do to compress it? >> >> That's an interesting question, from the top of my head: >> pre-sort in pahole (re-assign IDs so that increasing ID also would >> mean "increasing" name), shouldn't be that difficult. > > That sounds great. kallsyms are pre-sorted at build time. > We should do the same with BTF. > I think GCC can emit BTF directly now and LLVM emits it for bpf progs too, > but since vmlinux and kernel module BTFs will keep being processed > through pahole we don't have to make gcc/llvm sort things right away. > pahole will be enough. The kernel might do 'is it sorted' check > during BTF validation and then use binary search or fall back to linear > when not-sorted == old pahole. > Yeah, I agree and will attempt to modify the pahole and perform a test. Do we need to introduce a new macro to control the behavior when the BTF is not sorted? If it is not sorted, we can use the method mentioned in this patch or use linear search.