Received: by 2002:a05:7412:419a:b0:f3:1519:9f41 with SMTP id i26csp3486475rdh; Mon, 27 Nov 2023 16:01:34 -0800 (PST) X-Google-Smtp-Source: AGHT+IHdrTGzqeuVy0MGxJuvGqOyaFDXj9hXN26eKepjC1DjJGM/sooxSQtqZGVdj9OwrMI4v8ye X-Received: by 2002:a4a:d29a:0:b0:58d:7ec8:9c5c with SMTP id h26-20020a4ad29a000000b0058d7ec89c5cmr4699211oos.0.1701129693757; Mon, 27 Nov 2023 16:01:33 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1701129693; cv=none; d=google.com; s=arc-20160816; b=Ypj/1+VH4uVxFi+0Rcg2cfraVV8Ax9XnOVFyYzZh/OYBoOhHIwLwfhwzRoZls/pez6 w7ryYbmofGaPfaIH3um6YnkaWRoP3z8q6zivNpgVRh8WiNHfNI+kZJ41fKDUpOGsNaiK mWsVVJBho3Eb8e9SttcKTETDsa2pGrH+oaR4y7Ue2iCrvj31pnpVYw7dbKjiPMAvpdYC 9t8gr4W/5UkU5rFA8sGE2laa22vml/aUkJkIItTnxqvVa2wCN3cDeZGQJD5GeUKRo533 CUC/WEvHC/rJkaHKWpZ4q5LrbiUftLFZAEurvubT0pTBUJEGtrtbZ6nD/3kr6KxECJzP hL6w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-transfer-encoding :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:feedback-id:dkim-signature:dkim-signature; bh=OSQtcO0pegrxarZcG5C5VOBtpHVJhZcct9moQPXmvT4=; fh=7FYOkeFAH0oGBRwt5+KHxAjaBmVOD5DOXmHIrEP75hE=; b=eRpSD+Z93Est1r0kmF4FfUO1s9u9X7Fj0U+t6RJg5WFKAb3pJnzNjcn3j7D/CnW5Jj zqkwLhiwQyhyHNlqqPCGHAMCbrBpuXYBvcmQfpnCqXBGFU9SAbGmC2OSCWb3ZPgluusA UZDXUHNrv+cg/g3+cnESsLyvjHAXavLS0XurP5S7piCNWTR2nQjaEvNKjvEf7amtjngp MFbvSpzroCfJ43Y+t35aTezdJBVPzJ1NEScX5eyA1Sa8l9e23J09hirEV86vkruBO1B2 ZKpr9SVPxUFnYlSMIBAWzaY4EMoyFkoMwZ7B6aGzaPafbSbeJbYqsmv59QnkR1B9Scyg x2Og== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@dxuuu.xyz header.s=fm3 header.b=h0vUhgD6; dkim=pass header.i=@messagingengine.com header.s=fm1 header.b=awkJk73X; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.35 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from groat.vger.email (groat.vger.email. [23.128.96.35]) by mx.google.com with ESMTPS id l15-20020a4a350f000000b005820ade092esi4115748ooa.8.2023.11.27.16.01.29 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 27 Nov 2023 16:01:33 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.35 as permitted sender) client-ip=23.128.96.35; Authentication-Results: mx.google.com; dkim=pass header.i=@dxuuu.xyz header.s=fm3 header.b=h0vUhgD6; dkim=pass header.i=@messagingengine.com header.s=fm1 header.b=awkJk73X; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.35 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by groat.vger.email (Postfix) with ESMTP id 004DB8075038; Mon, 27 Nov 2023 16:01:26 -0800 (PST) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.11 at groat.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233867AbjK1ABI (ORCPT + 99 others); Mon, 27 Nov 2023 19:01:08 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:44962 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229879AbjK1ABF (ORCPT ); Mon, 27 Nov 2023 19:01:05 -0500 Received: from new3-smtp.messagingengine.com (new3-smtp.messagingengine.com [66.111.4.229]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id D1824101; Mon, 27 Nov 2023 16:01:10 -0800 (PST) Received: from compute5.internal (compute5.nyi.internal [10.202.2.45]) by mailnew.nyi.internal (Postfix) with ESMTP id 6ACEB580979; Mon, 27 Nov 2023 19:01:07 -0500 (EST) Received: from mailfrontend1 ([10.202.2.162]) by compute5.internal (MEProxy); Mon, 27 Nov 2023 19:01:07 -0500 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=dxuuu.xyz; h=cc :cc:content-transfer-encoding:content-type:content-type:date :date:from:from:in-reply-to:in-reply-to:message-id:mime-version :references:reply-to:sender:subject:subject:to:to; s=fm3; t= 1701129667; x=1701136867; bh=OSQtcO0pegrxarZcG5C5VOBtpHVJhZcct9m oQPXmvT4=; b=h0vUhgD60RYO7nCWRGpot3ijJG19C3YfA4IIwFyt5kXWyvb9puA AWklEiuo9lkbPYgqPBm+2ytpwqLWv92V3IFFRnw6+Rix78s95QWtzD2eNtSNh9MU KAx7SQ36fYiuB2pu6ZIn6I5UzbWhCYN0C+vkTtYL4/oR9sB5wZETNiHhRqQLNPIb NPIGXPvNNH6fAgi1IztrtsLv7z/FkVWLs49gojsQCNR8ouH2JHr+RY0kd5rtvxZL tJ6nE1jzcThqBubP7Llz+8WPQ0SNyce3wca8453C24xAH2Jh6tzGjvihH/Ab+VU/ sPti/9X12x1it+XA/cy54WoIimh/YQVFKUg== DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d= messagingengine.com; h=cc:cc:content-transfer-encoding :content-type:content-type:date:date:feedback-id:feedback-id :from:from:in-reply-to:in-reply-to:message-id:mime-version :references:reply-to:sender:subject:subject:to:to:x-me-proxy :x-me-proxy:x-me-sender:x-me-sender:x-sasl-enc; s=fm1; t= 1701129667; x=1701136867; bh=OSQtcO0pegrxarZcG5C5VOBtpHVJhZcct9m oQPXmvT4=; b=awkJk73X+I1uiAWNEQNYVfVRIMq1ZItqrORBmTZep8lSK1xxZkY DtfTBYj+ETVL2WGPowa6TQNRc0pr+lBFrUNDwhYtfWy68hW5yvbWFDUDpLg6GHDt QyYDf0s0B6Yh5NzguaHrASuO91Eek9LcP1FfwggGl9oE7l6l8KxQKyGMQ2c+ViWD aNRW9/AokxHlC4Y8eahUqlQMmwwyfkkjjpEQTD8bAAPIl20seqA5fkxm1VAewsJK 0ims1oXiSYZjbRadYrgAKD3+t2JUpSZtzOV3eyy0+11wNgve47OuUJKVQqOuphDU hR6w3hI5ilKBqyo4usayhnAerJpaKSJIkgw== X-ME-Sender: X-ME-Received: X-ME-Proxy-Cause: gggruggvucftvghtrhhoucdtuddrgedvkedrudeivddgudejucetufdoteggodetrfdotf fvucfrrhhofhhilhgvmecuhfgrshhtofgrihhlpdfqfgfvpdfurfetoffkrfgpnffqhgen uceurghilhhouhhtmecufedttdenucesvcftvggtihhpihgvnhhtshculddquddttddmne gfrhhlucfvnfffucdljedtmdenucfjughrpeffhffvvefukfhfgggtugfgjgestheksfdt tddtudenucfhrhhomhepffgrnhhivghlucgiuhcuoegugihusegugihuuhhurdighiiiqe enucggtffrrghtthgvrhhnpeehteelgfffiedtledulefgteejhedufeehuefhvefhhfff hfekhfefkefhkeegudenucffohhmrghinhepugiguhhuuhdrgiihiienucevlhhushhtvg hrufhiiigvpedtnecurfgrrhgrmhepmhgrihhlfhhrohhmpegugihusegugihuuhhurdig hiii X-ME-Proxy: Feedback-ID: i6a694271:Fastmail Received: by mail.messagingengine.com (Postfix) with ESMTPA; Mon, 27 Nov 2023 19:01:04 -0500 (EST) Date: Mon, 27 Nov 2023 18:01:03 -0600 From: Daniel Xu To: Yonghong Song Cc: Eduard Zingerman , Alexei Starovoitov , Shuah Khan , Daniel Borkmann , Andrii Nakryiko , Alexei Starovoitov , Steffen Klassert , antony.antony@secunet.com, Mykola Lysenko , Martin KaFai Lau , Song Liu , John Fastabend , KP Singh , Stanislav Fomichev , Hao Luo , Jiri Olsa , bpf , "open list:KERNEL SELFTEST FRAMEWORK" , LKML , devel@linux-ipsec.org, Network Development Subject: Re: [PATCH ipsec-next v1 6/7] bpf: selftests: test_tunnel: Disable CO-RE relocations Message-ID: <53jaqi72ef4gynyafxidl5veb54kfs7dttxezkarwg75t7szd4@cvfg5pc7pyum> References: <391d524c496acc97a8801d8bea80976f58485810.1700676682.git.dxu@dxuuu.xyz> <0f210cef-c6e9-41c1-9ba8-225f046435e5@linux.dev> <3ec6c068-7f95-419a-a0ae-a901f95e4838@linux.dev> <18e43cdf65e7ba0d8f6912364fbc5b08a6928b35.camel@gmail.com> <0535eb913f1a0c2d3c291478fde07e0aa2b333f1.camel@gmail.com> <42f9bf0d-695a-412d-bea5-cb7036fa7418@linux.dev> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: X-Spam-Status: No, score=-0.9 required=5.0 tests=DKIM_SIGNED,DKIM_VALID, DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on groat.vger.email Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (groat.vger.email [0.0.0.0]); Mon, 27 Nov 2023 16:01:27 -0800 (PST) On Mon, Nov 27, 2023 at 02:45:11PM -0600, Daniel Xu wrote: > On Sun, Nov 26, 2023 at 09:53:04PM -0800, Yonghong Song wrote: > > > > On 11/27/23 12:44 AM, Yonghong Song wrote: > > > > > > On 11/26/23 8:52 PM, Eduard Zingerman wrote: > > > > On Sun, 2023-11-26 at 18:04 -0600, Daniel Xu wrote: > > > > [...] > > > > > > Tbh I'm not sure. This test passes with preserve_static_offset > > > > > > because it suppresses preserve_access_index. In general clang > > > > > > translates bitfield access to a set of IR statements like: > > > > > > > > > > > > ?? C: > > > > > > ???? struct foo { > > > > > > ?????? unsigned _; > > > > > > ?????? unsigned a:1; > > > > > > ?????? ... > > > > > > ???? }; > > > > > > ???? ... foo->a ... > > > > > > > > > > > > ?? IR: > > > > > > ???? %a = getelementptr inbounds %struct.foo, ptr %0, i32 0, i32 1 > > > > > > ???? %bf.load = load i8, ptr %a, align 4 > > > > > > ???? %bf.clear = and i8 %bf.load, 1 > > > > > > ???? %bf.cast = zext i8 %bf.clear to i32 > > > > > > > > > > > > With preserve_static_offset the getelementptr+load are replaced by a > > > > > > single statement which is preserved as-is till code generation, > > > > > > thus load with align 4 is preserved. > > > > > > > > > > > > On the other hand, I'm not sure that clang guarantees that load or > > > > > > stores used for bitfield access would be always aligned according to > > > > > > verifier expectations. > > > > > > > > > > > > I think we should check if there are some clang knobs that prevent > > > > > > generation of unaligned memory access. I'll take a look. > > > > > Is there a reason to prefer fixing in compiler? I'm not opposed to it, > > > > > but the downside to compiler fix is it takes years to propagate and > > > > > sprinkles ifdefs into the code. > > > > > > > > > > Would it be possible to have an analogue of BPF_CORE_READ_BITFIELD()? > > > > Well, the contraption below passes verification, tunnel selftest > > > > appears to work. I might have messed up some shifts in the macro, > > > > though. > > > > > > I didn't test it. But from high level it should work. > > > > > > > > > > > Still, if clang would peek unlucky BYTE_{OFFSET,SIZE} for a particular > > > > field access might be unaligned. > > > > > > clang should pick a sensible BYTE_SIZE/BYTE_OFFSET to meet > > > alignment requirement. This is also required for BPF_CORE_READ_BITFIELD. > > > > > > > > > > > --- > > > > > > > > diff --git a/tools/testing/selftests/bpf/progs/test_tunnel_kern.c > > > > b/tools/testing/selftests/bpf/progs/test_tunnel_kern.c > > > > index 3065a716544d..41cd913ac7ff 100644 > > > > --- a/tools/testing/selftests/bpf/progs/test_tunnel_kern.c > > > > +++ b/tools/testing/selftests/bpf/progs/test_tunnel_kern.c > > > > @@ -9,6 +9,7 @@ > > > > ? #include "vmlinux.h" > > > > ? #include > > > > ? #include > > > > +#include > > > > ? #include "bpf_kfuncs.h" > > > > ? #include "bpf_tracing_net.h" > > > > ? @@ -144,6 +145,38 @@ int ip6gretap_get_tunnel(struct __sk_buff *skb) > > > > ????? return TC_ACT_OK; > > > > ? } > > > > ? +#define BPF_CORE_WRITE_BITFIELD(s, field, new_val) ({??????????? \ > > > > +??? void *p = (void *)s + __CORE_RELO(s, field, BYTE_OFFSET);??? \ > > > > +??? unsigned byte_size = __CORE_RELO(s, field, BYTE_SIZE);??????? \ > > > > +??? unsigned lshift = __CORE_RELO(s, field, LSHIFT_U64); \ > > > > +??? unsigned rshift = __CORE_RELO(s, field, RSHIFT_U64); \ > > > > +??? unsigned bit_size = (rshift - lshift);??????????????? \ > > > > +??? unsigned long long nval, val, hi, lo;??????????????? \ > > > > +??????????????????????????????????? \ > > > > +??? asm volatile("" : "=r"(p) : "0"(p));??????????????? \ > > > > > > Use asm volatile("" : "+r"(p)) ? > > > > > > > +??????????????????????????????????? \ > > > > +??? switch (byte_size) {??????????????????????? \ > > > > +??? case 1: val = *(unsigned char *)p; break;??????????? \ > > > > +??? case 2: val = *(unsigned short *)p; break;??????????? \ > > > > +??? case 4: val = *(unsigned int *)p; break;??????????? \ > > > > +??? case 8: val = *(unsigned long long *)p; break;??????????? \ > > > > +??? }??????????????????????????????? \ > > > > +??? hi = val >> (bit_size + rshift);??????????????? \ > > > > +??? hi <<= bit_size + rshift;??????????????????? \ > > > > +??? lo = val << (bit_size + lshift);??????????????? \ > > > > +??? lo >>= bit_size + lshift;??????????????????? \ > > > > +??? nval = new_val;??????????????????????????? \ > > > > +??? nval <<= lshift;??????????????????????? \ > > > > +??? nval >>= rshift;??????????????????????? \ > > > > +??? val = hi | nval | lo;??????????????????????? \ > > > > +??? switch (byte_size) {??????????????????????? \ > > > > +??? case 1: *(unsigned char *)p????? = val; break;??????????? \ > > > > +??? case 2: *(unsigned short *)p???? = val; break;??????????? \ > > > > +??? case 4: *(unsigned int *)p?????? = val; break;??????????? \ > > > > +??? case 8: *(unsigned long long *)p = val; break;??????????? \ > > > > +??? }??????????????????????????????? \ > > > > +}) > > > > > > I think this should be put in libbpf public header files but not sure > > > where to put it. bpf_core_read.h although it is core write? > > > > > > But on the other hand, this is a uapi struct bitfield write, > > > strictly speaking, CORE write is really unnecessary here. It > > > would be great if we can relieve users from dealing with > > > such unnecessary CORE writes. In that sense, for this particular > > > case, I would prefer rewriting the code by using byte-level > > > stores... > > or preserve_static_offset to clearly mean to undo bitfield CORE ... > > Ok, I will do byte-level rewrite for next revision. [...] This patch seems to work: https://pastes.dxuuu.xyz/0glrf9 . But I don't think it's very pretty. Also I'm seeing on the internet that people are saying the exact layout of bitfields is compiler dependent. So I am wondering if these byte sized writes are correct. For that matter, I am wondering how the GCC generated bitfield accesses line up with clang generated BPF bytecode. Or why uapi contains a bitfield. WDYT, should I send up v2 with this or should I do one of the other approaches in this thread? I am ok with any of the approaches. Thanks, Daniel