Received: by 2002:a05:6a10:1a4d:0:0:0:0 with SMTP id nk13csp4333189pxb; Sat, 5 Feb 2022 10:09:23 -0800 (PST) X-Google-Smtp-Source: ABdhPJwQclFjx+HLHY2kb3y29otI7VbXKTexhMyAUxU1VT4D3PWGzU4IK0MqVr5nz3a55wkLdQ6d X-Received: by 2002:a05:6a00:228e:: with SMTP id f14mr8868495pfe.33.1644084562915; Sat, 05 Feb 2022 10:09:22 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1644084562; cv=none; d=google.com; s=arc-20160816; b=07x1iqF0YjVkhwnzVYjxaMdZuauHU0fJlx0dlHK2vLKkhPFE1T+99rAvbBi9Rhdtaw Nh2UtIcc/5rkol2/iGz0yDpKq0kvpSqfkSiA7+hyp6LgSel6nvtBpuqPvwgpNDPByvwe 8xZfV+lPyP8wBvs6blbTzLPCohme1N09OpqFydJZbcLMHAFVg1Vi51EcAF9wXGtKAU3e uaiLzSlxFbIObr+P4mlAQV8oYyoMc+MKhrbC3p7xiI87MuBbATkJb1DWgD3mJpnSjjrY g6OUGI4GdeSSD57SfhkX/kp5JXxMcRoD+et5eFdMPqBkxpzhqBEM+vu6+fvG1vEXtL8+ 6iFw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from; bh=P+D81v5KEr4fisu8j0Gre66sE16AgYibENuanaS8Lwo=; b=QHll3DHckykMXEGoaLxOaIg+ougWnF+y09rQiUhsEdtDnvcHcG+TgqbfHli3IiwLrI dINFokPQo5mOFqbgV/CIcmSv74lbhkm/jBFxNo1KjSBrI6/FMG1BrBScsnlfKPi58gHb 6pi9MbxSDdJqmArtb4xsfXQoohlIuAPRj2DmHf/zXfmPSVvHCyFU3EnZEdSD2j0LCQCM G5HdlcRlmLYIRCi7CT6vfnq4G5Z+CTf78s8d7FrHjFbPCSU3Ydu/qJ40+OxNTsekD0/J DNlSR2GcsrIeVx7vSoVrwx6rt86Uvy+VWtnSqmI/Fqr7jiFIUjVBxpi+lFC6QenmpKty gMqg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id 11si5079796pgx.754.2022.02.05.10.09.09; Sat, 05 Feb 2022 10:09:22 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229714AbiBDS77 convert rfc822-to-8bit (ORCPT + 99 others); Fri, 4 Feb 2022 13:59:59 -0500 Received: from mx0b-00082601.pphosted.com ([67.231.153.30]:47778 "EHLO mx0b-00082601.pphosted.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229457AbiBDS76 (ORCPT ); Fri, 4 Feb 2022 13:59:58 -0500 Received: from pps.filterd (m0148460.ppops.net [127.0.0.1]) by mx0a-00082601.pphosted.com (8.16.1.2/8.16.1.2) with ESMTP id 214IithJ025902 for ; Fri, 4 Feb 2022 10:59:58 -0800 Received: from mail.thefacebook.com ([163.114.132.120]) by mx0a-00082601.pphosted.com (PPS) with ESMTPS id 3e0v4gcenq-3 (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128 verify=NOT) for ; Fri, 04 Feb 2022 10:59:57 -0800 Received: from twshared11487.23.frc3.facebook.com (2620:10d:c085:208::11) by mail.thefacebook.com (2620:10d:c085:21d::4) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.2308.21; Fri, 4 Feb 2022 10:59:56 -0800 Received: by devbig006.ftw2.facebook.com (Postfix, from userid 4523) id 9CDC5296C5722; Fri, 4 Feb 2022 10:57:49 -0800 (PST) From: Song Liu To: , , CC: , , , , , , , Song Liu Subject: [PATCH v9 bpf-next 0/9] bpf_prog_pack allocator Date: Fri, 4 Feb 2022 10:57:33 -0800 Message-ID: <20220204185742.271030-1-song@kernel.org> X-Mailer: git-send-email 2.30.2 MIME-Version: 1.0 Content-Transfer-Encoding: 8BIT X-FB-Internal: Safe Content-Type: text/plain X-Proofpoint-GUID: S0IO6EHucRqtp3mVsFFwTqqgTTdzs0X7 X-Proofpoint-ORIG-GUID: S0IO6EHucRqtp3mVsFFwTqqgTTdzs0X7 X-Proofpoint-Virus-Version: vendor=baseguard engine=ICAP:2.0.205,Aquarius:18.0.816,Hydra:6.0.425,FMLib:17.11.62.513 definitions=2022-02-04_07,2022-02-03_01,2021-12-02_01 X-Proofpoint-Spam-Details: rule=fb_outbound_notspam policy=fb_outbound score=0 priorityscore=1501 lowpriorityscore=0 malwarescore=0 mlxlogscore=719 clxscore=1015 suspectscore=0 spamscore=0 phishscore=0 adultscore=0 impostorscore=0 mlxscore=0 bulkscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2201110000 definitions=main-2202040104 X-FB-Internal: deliver Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Changes v8 => v9: 1. Fix an error with multi function program, in 4/9. Changes v7 => v8: 1. Rebase and fix conflicts. 2. Lock text_mutex for text_poke_copy. (Daniel) Changes v6 => v7: 1. Redesign the interface between generic and arch logic, based on feedback from Alexei and Ilya. 2. Split 6/7 of v6 to 7/9 and 8/9 in v7, for cleaner logic. 3. Add bpf_arch_text_copy in 6/9. Changes v5 => v6: 1. Make jit_hole_buffer 128 byte long. Only fill the first and last 128 bytes of header with INT3. (Alexei) 2. Use kvmalloc for temporary buffer. (Alexei) 3. Rename tmp_header/tmp_image => rw_header/rw_image. Remove tmp_image from x64_jit_data. (Alexei) 4. Change fall back round_up_to in bpf_jit_binary_alloc_pack() from BPF_PROG_MAX_PACK_PROG_SIZE to PAGE_SIZE. Changes v4 => v5: 1. Do not use atomic64 for bpf_jit_current. (Alexei) Changes v3 => v4: 1. Rename text_poke_jit() => text_poke_copy(). (Peter) 2. Change comment style. (Peter) Changes v2 => v3: 1. Fix tailcall. Changes v1 => v2: 1. Use text_poke instead of writing through linear mapping. (Peter) 2. Avoid making changes to non-x86_64 code. Most BPF programs are small, but they consume a page each. For systems with busy traffic and many BPF programs, this could also add significant pressure to instruction TLB. High iTLB pressure usually causes slow down for the whole system, which includes visible performance degradation for production workloads. This set tries to solve this problem with customized allocator that pack multiple programs into a huge page. Patches 1-6 prepare the work. Patch 7 contains key logic of bpf_prog_pack allocator. Patch 8 contains bpf_jit_binary_pack_alloc logic on top of bpf_prog_pack allocator. Patch 9 uses this allocator in x86_64 jit. Song Liu (9): x86/Kconfig: select HAVE_ARCH_HUGE_VMALLOC with HAVE_ARCH_HUGE_VMAP bpf: use bytes instead of pages for bpf_jit_[charge|uncharge]_modmem bpf: use size instead of pages in bpf_binary_header bpf: use prog->jited_len in bpf_prog_ksym_set_addr() x86/alternative: introduce text_poke_copy bpf: introduce bpf_arch_text_copy bpf: introduce bpf_prog_pack allocator bpf: introduce bpf_jit_binary_pack_[alloc|finalize|free] bpf, x86_64: use bpf_jit_binary_pack_alloc arch/x86/Kconfig | 1 + arch/x86/include/asm/text-patching.h | 1 + arch/x86/kernel/alternative.c | 34 ++++ arch/x86/net/bpf_jit_comp.c | 65 ++++--- include/linux/bpf.h | 7 +- include/linux/filter.h | 27 +-- kernel/bpf/core.c | 269 +++++++++++++++++++++++++-- kernel/bpf/trampoline.c | 6 +- kernel/bpf/verifier.c | 1 + 9 files changed, 349 insertions(+), 62 deletions(-) -- 2.30.2