Received: by 2002:a25:8b91:0:0:0:0:0 with SMTP id j17csp3268279ybl; Fri, 20 Dec 2019 06:36:16 -0800 (PST) X-Google-Smtp-Source: APXvYqw/6AxMHN+xAenAENQ0dbKNnXenGFR/G/ZkxIc7uJDwXWZAOqqg5TFI1BYVvjjxP+gnE1bH X-Received: by 2002:a9d:ed5:: with SMTP id 79mr15423144otj.72.1576852576117; Fri, 20 Dec 2019 06:36:16 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1576852576; cv=none; d=google.com; s=arc-20160816; b=Sr4LZD27N9y2Cf8wlhvaOeIzhmVls3scIVh2Dkyx85rytaADAZTomKxVStDst3uHNa 9Uehi2Vu+yCxmPV01KK9oXId54EEW5j1cCS5kiHXy+PjkJ5TrzvNRmcCECBr6weDHY48 /qSbjp0rT+S2j3QxBT7dsV/h6/7FF+VNBgJsF0Lp93SvCmE+Eoc5K5Rx1L1/bVP/7uCt Teva+EJVTtm0s9LQxMVaHir/7mGm0aABy8s+wLeZ7sEVCTlJdSKoh/7dn9FFifUFjpAw 2RoualFFeBKdVSJZArhEA9agFm0aSU6GQOZVOJ4/yicxecDpPxezvTEp4hLGoKqb02YQ 7bLA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:from:cc:to:subject :content-transfer-encoding:mime-version:references:in-reply-to :user-agent:date; bh=WzJe+8kWycWfn8GVDJZIX0zKvZBIyw9XoHqhyuSJRao=; b=blM3weGhReZ/tE8fMPZb51nMJJ2lGqADnaCpQuO91U8UM4KhwEqUes3YG0BqikSEdL 15a4JlSMgxlefuec/mYdCDXHPlo9e1RDVcctxgzbzz19qiDJ+2ivskSLim7wOhoUG1qh +eaUCzRIMB0RLvRimyAUFliGYpxqLc09Iu8aMeD29Op3rVuaN69mZtRfuIWkkfhcvmgC YINoIBLh2si5JkgZrqXgJbmsjoyr577UJiAL9NQRlJEd91Ez11p3UV0OIplec78p3Fha 5cSxKhYxUEmg4SxrwQadDsBk8MXrWsPUkfhQl5EVNw25ad1k1PP6bOjMe274EE1KixbP kGSg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id l21si5489838otk.142.2019.12.20.06.36.04; Fri, 20 Dec 2019 06:36:16 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728264AbfLTOdu convert rfc822-to-8bit (ORCPT + 99 others); Fri, 20 Dec 2019 09:33:50 -0500 Received: from mail-wm1-f68.google.com ([209.85.128.68]:55294 "EHLO mail-wm1-f68.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728266AbfLTOds (ORCPT ); Fri, 20 Dec 2019 09:33:48 -0500 Received: by mail-wm1-f68.google.com with SMTP id b19so9141800wmj.4 for ; Fri, 20 Dec 2019 06:33:46 -0800 (PST) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:user-agent:in-reply-to:references :mime-version:content-transfer-encoding:subject:to:cc:from :message-id; bh=V6peUYVw+8q1XalOpqZ1ayFyqbTEFtfcEYX2FP9pROo=; b=DGIqFWbneDiWc+SMGgiTvURuXdwebez9c2sTP56btp/Q2J4E6Jau9/AXQyFyw/2Ajl uuwmXdsgW/9nxENozyheDMw9qtg7EyKkMyOGQ52zIs7GM1XKAUpBelADn+Icw7x3cwzD akO2wnfHkpd+E9gQKCDa0gLWGVD6gDX/2ZM2ksQqHXvzFP3f54SxOMBYT18CtGSt9YHy GKBfZRrCRZd1DQOLg/fx6ufo/x4bsFT3oTglgzHTD1aHJ1DFwzqG4q8TtyBLkc3UosTO fbMmYhKsYZqaaZZLXQ/112uf6iwVupY/ImOyq7+re0EQI8cPN0rbWTjXns8VL/oZftGQ 8eaA== X-Gm-Message-State: APjAAAVrf8vq4JCXYKNyggCwUsfP94uadnMKA3xOlqwLcsKG+I5ardQJ z60TIYY3O6cgxRmWh8Qd48Mtiw== X-Received: by 2002:a7b:c851:: with SMTP id c17mr17423248wml.71.1576852426193; Fri, 20 Dec 2019 06:33:46 -0800 (PST) Received: from [10.140.78.238] ([46.114.38.238]) by smtp.gmail.com with ESMTPSA id l15sm9907506wrv.39.2019.12.20.06.33.42 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Fri, 20 Dec 2019 06:33:43 -0800 (PST) Date: Fri, 20 Dec 2019 15:33:39 +0100 User-Agent: K-9 Mail for Android In-Reply-To: <20191220140328.20907-2-cyphar@cyphar.com> References: <20191220140328.20907-1-cyphar@cyphar.com> <20191220140328.20907-2-cyphar@cyphar.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8BIT Subject: Re: [PATCH v2 1/2] openat2: drop open_how->__padding field To: containers@lists.linux-foundation.org, Aleksa Sarai , Alexander Viro , Jeff Layton , "J. Bruce Fields" , Shuah Khan , Arnd Bergmann CC: libc-alpha@sourceware.org, linux-api@vger.kernel.org, linux-kernel@vger.kernel.org, dev@opencontainers.org, David Laight , linux-kselftest@vger.kernel.org, linux-fsdevel@vger.kernel.org From: Christian Brauner Message-ID: <88F5533A-9C91-41D1-A2E6-3622FCEEEDB1@ubuntu.com> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org [Cc Arnd for struct layout sanity checking] On December 20, 2019 3:03:27 PM GMT+01:00, Aleksa Sarai wrote: >The purpose of explicit padding was to allow us to use the space in the >future (C provides no guarantee about the value of padding bytes and >thus userspace could've provided garbage). > >However, the downside of explicit padding is that any extension we wish >to add should fit the space exactly (otherwise we may end up with a u16 >which will never be used). In addition, the correct error to return for >non-zero padding is not clear (-EINVAL doesn't imply "you're using an >extension field unsupported by this kernel", but -E2BIG seems a bit odd >if the structure size isn't different). > >The simplest solution is to just match the design of clone3(2) -- use >u64s for all fields. The extra few-bytes cost of extra fields is not >significant (it's unlikely configuration structs will ever be extremely >large) and it allows for more flag space if necessary. There is also no >need to align the u64s because we will not permit any padding in the >structure. > >As openat2(2) is not yet in Linus's tree, we can iron out these minor >warts before we commit to this as a stable ABI. > >Acked-by: Christian Brauner >Suggested-by: David Laight >Signed-off-by: Aleksa Sarai >--- > fs/open.c | 2 -- > include/uapi/linux/fcntl.h | 17 +++++++------ > tools/testing/selftests/openat2/helpers.h | 7 +++--- > .../testing/selftests/openat2/openat2_test.c | 24 +++++++------------ > 4 files changed, 19 insertions(+), 31 deletions(-) > >diff --git a/fs/open.c b/fs/open.c >index 50a46501bcc9..8cdb2b675867 100644 >--- a/fs/open.c >+++ b/fs/open.c >@@ -993,8 +993,6 @@ static inline int build_open_flags(const struct >open_how *how, > return -EINVAL; > if (how->resolve & ~VALID_RESOLVE_FLAGS) > return -EINVAL; >- if (memchr_inv(how->__padding, 0, sizeof(how->__padding))) >- return -EINVAL; > > /* Deal with the mode. */ > if (WILL_CREATE(flags)) { >diff --git a/include/uapi/linux/fcntl.h b/include/uapi/linux/fcntl.h >index d886bdb585e4..5aaadfd79dd5 100644 >--- a/include/uapi/linux/fcntl.h >+++ b/include/uapi/linux/fcntl.h >@@ -101,22 +101,21 @@ > #define AT_RECURSIVE 0x8000 /* Apply to the entire subtree */ > > /* >- * Arguments for how openat2(2) should open the target path. If >@resolve is >- * zero, then openat2(2) operates very similarly to openat(2). >+ * Arguments for how openat2(2) should open the target path. If only >@flags and >+ * @mode are non-zero, then openat2(2) operates very similarly to >openat(2). > * >- * However, unlike openat(2), unknown bits in @flags result in -EINVAL >rather >- * than being silently ignored. @mode must be zero unless one of >{O_CREAT, >- * O_TMPFILE} are set. >+ * However, unlike openat(2), unknown or invalid bits in @flags result >in >+ * -EINVAL rather than being silently ignored. @mode must be zero >unless one of >+ * {O_CREAT, O_TMPFILE} are set. > * > * @flags: O_* flags. > * @mode: O_CREAT/O_TMPFILE file mode. > * @resolve: RESOLVE_* flags. > */ > struct open_how { >- __aligned_u64 flags; >- __u16 mode; >- __u16 __padding[3]; /* must be zeroed */ >- __aligned_u64 resolve; >+ __u64 flags; >+ __u64 mode; >+ __u64 resolve; > }; > > #define OPEN_HOW_SIZE_VER0 24 /* sizeof first published struct */ >diff --git a/tools/testing/selftests/openat2/helpers.h >b/tools/testing/selftests/openat2/helpers.h >index 43ca5ceab6e3..a6ea27344db2 100644 >--- a/tools/testing/selftests/openat2/helpers.h >+++ b/tools/testing/selftests/openat2/helpers.h >@@ -36,10 +36,9 @@ > * @resolve: RESOLVE_* flags. > */ > struct open_how { >- __aligned_u64 flags; >- __u16 mode; >- __u16 __padding[3]; /* must be zeroed */ >- __aligned_u64 resolve; >+ __u64 flags; >+ __u64 mode; >+ __u64 resolve; > }; > > #define OPEN_HOW_SIZE_VER0 24 /* sizeof first published struct */ >diff --git a/tools/testing/selftests/openat2/openat2_test.c >b/tools/testing/selftests/openat2/openat2_test.c >index 0b64fedc008b..b386367c606b 100644 >--- a/tools/testing/selftests/openat2/openat2_test.c >+++ b/tools/testing/selftests/openat2/openat2_test.c >@@ -40,7 +40,7 @@ struct struct_test { > int err; > }; > >-#define NUM_OPENAT2_STRUCT_TESTS 10 >+#define NUM_OPENAT2_STRUCT_TESTS 7 > #define NUM_OPENAT2_STRUCT_VARIATIONS 13 > > void test_openat2_struct(void) >@@ -57,20 +57,6 @@ void test_openat2_struct(void) > .arg.inner.flags = O_RDONLY, > .size = sizeof(struct open_how_ext) }, > >- /* Normal struct with broken padding. */ >- { .name = "normal struct (non-zero padding[0])", >- .arg.inner.flags = O_RDONLY, >- .arg.inner.__padding = {0xa0, 0x00, 0x00}, >- .size = sizeof(struct open_how_ext), .err = -EINVAL }, >- { .name = "normal struct (non-zero padding[1])", >- .arg.inner.flags = O_RDONLY, >- .arg.inner.__padding = {0x00, 0x1a, 0x00}, >- .size = sizeof(struct open_how_ext), .err = -EINVAL }, >- { .name = "normal struct (non-zero padding[2])", >- .arg.inner.flags = O_RDONLY, >- .arg.inner.__padding = {0x00, 0x00, 0xef}, >- .size = sizeof(struct open_how_ext), .err = -EINVAL }, >- > /* TODO: Once expanded, check zero-padding. */ > > /* Smaller than version-0 struct. */ >@@ -169,7 +155,7 @@ struct flag_test { > int err; > }; > >-#define NUM_OPENAT2_FLAG_TESTS 21 >+#define NUM_OPENAT2_FLAG_TESTS 23 > > void test_openat2_flags(void) > { >@@ -214,9 +200,15 @@ void test_openat2_flags(void) > { .name = "invalid how.mode and O_CREAT", > .how.flags = O_CREAT, > .how.mode = 0xFFFF, .err = -EINVAL }, >+ { .name = "invalid (very large) how.mode and O_CREAT", >+ .how.flags = O_CREAT, >+ .how.mode = 0xC000000000000000ULL, .err = -EINVAL }, > { .name = "invalid how.mode and O_TMPFILE", > .how.flags = O_TMPFILE | O_RDWR, > .how.mode = 0x1337, .err = -EINVAL }, >+ { .name = "invalid (very large) how.mode and O_TMPFILE", >+ .how.flags = O_TMPFILE | O_RDWR, >+ .how.mode = 0x0000A00000000000ULL, .err = -EINVAL }, > > /* ->resolve must only contain RESOLVE_* flags. */ > { .name = "invalid how.resolve and O_RDONLY",