Received: by 2002:a05:6a10:1d13:0:0:0:0 with SMTP id pp19csp46610pxb; Wed, 18 Aug 2021 15:35:53 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzulgI+qHaUnfKOSgiH9zIEQQbrDP5ET+jgPbQwTYgUsLojgezVaFm49mBoPuIV3vZQi3uQ X-Received: by 2002:a17:906:c087:: with SMTP id f7mr11765439ejz.487.1629326152820; Wed, 18 Aug 2021 15:35:52 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1629326152; cv=none; d=google.com; s=arc-20160816; b=0M3w0YQHgkRNh5z+SOUzBTkOTqAGAyHM22R2e9Sujr58K3UJUSLc+Q+JLGt68zVKl1 ElZSc3v07GK9SQZKyUaUKPEK2e5tXFHe8b1RCG00yNNanaqt9IDGZb1Srfy+aI6b5Ikp 4VgmMtP9HdVX61NTaWCiMTusa72BfuD6pkApxGNps7v3j6JZ3wmgs52X0L2WXVFzl2sZ i4EJuwu/q9D7R2irj+V6UWQ2X+jyvu2aZdvtlPLfQZcRNHFplA+zo0K+Sa2yTuvetVVN AzV0zHnH6ZMgAals/mtdax2aUEwXobcyOWjGTaOM7+B+MlNara9smgLxmdNhkX6i/a7v CRCw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=P+5mSEtJrjnggVNveih68RddTImV6HnC03hWsATd0wA=; b=En/7gfOo6g6wNCufH4TBdUzN1W+uTEOrCzz4Zwkx0G/4c4w8JL7/1F03BXfX8Cx7jh 0yTgQxmAIzpQVgPEmKCNF5RXmOBHCcajljOjx40j0Ww/VC9oGdsb1rLH1W1bkkNOyLnR rhyDx+0hd+f+sYTzTHlnHcy4rAx82xMr4vTrSUyvTmhPv4ieFjhCCfhS2ic/ZQ8aZ/P6 ZE1mKqio+WULHILGocZuWZ/p4G6TfLeWyhzrELuqL935vtAFLzlC20pPKKL2U7jfE8vm N7bdLm/Ae9uzNJrgUVvfjTNEqbo1PEzYjvMRrVvMkHcasrC4FjTfvIuA/Dd1YEdtHyIt p8cA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@intel-com.20150623.gappssmtp.com header.s=20150623 header.b=kRIj8JIq; spf=pass (google.com: domain of linux-wireless-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-wireless-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id jz19si1415048ejb.37.2021.08.18.15.35.29; Wed, 18 Aug 2021 15:35:52 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-wireless-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@intel-com.20150623.gappssmtp.com header.s=20150623 header.b=kRIj8JIq; spf=pass (google.com: domain of linux-wireless-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-wireless-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232456AbhHRWgA (ORCPT + 99 others); Wed, 18 Aug 2021 18:36:00 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:50388 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234246AbhHRWgA (ORCPT ); Wed, 18 Aug 2021 18:36:00 -0400 Received: from mail-pg1-x532.google.com (mail-pg1-x532.google.com [IPv6:2607:f8b0:4864:20::532]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 1E018C0613D9 for ; Wed, 18 Aug 2021 15:35:25 -0700 (PDT) Received: by mail-pg1-x532.google.com with SMTP id c17so3891374pgc.0 for ; Wed, 18 Aug 2021 15:35:25 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=intel-com.20150623.gappssmtp.com; s=20150623; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=P+5mSEtJrjnggVNveih68RddTImV6HnC03hWsATd0wA=; b=kRIj8JIqpPRaKtf6CIkL46zXyN+TG7NSk6bw8gu7+8srWnLtPpMsCTfSAl6mGxQkPO AO2tJAjBgrQqg48z/R0w1gBkmv7qgCaYMxOgKctRX1l4WVxZaQ6vNnvX5LAGRKfFudjG PsQYY9oWNpYKfElRnHpaNeEhFchqOLvvFftCGuTzj/AyqbjlyvdzlVVTJ1IB4Ngv7/Ms u4r7kwHlTNVtr67hCv3yPPJX5ARYvjGYympkKth1dgkHgrQuh2AbLYKgiGUMqV8NtvDX +pRCiBA2g2BclAPoPSaP8V7pu3JHnBNUDDtuik7qEFxuxYTic4rAadkAFa3uqJjCsduq ulwQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=P+5mSEtJrjnggVNveih68RddTImV6HnC03hWsATd0wA=; b=kcNIw51ma+InZY/38jXB/Se2T7qt/UXcvzlW04w+UNapQNyP3fLKuiHR1S3lifwnm6 xGeSObPWDU7mais3szpc5zc1WnIHEIcwgzBQ6GVk1ao6BxH3YQ5apu5XdVAOHhaP0bcp AWhcPSiD91U0OkwxVnzl6hI6dO2iuy3NFvWRUj4BiX2auZLQ16fxZEbiccVHNRELRD9K 82aFcTFmuQOdZU4LG+xa4TAXpztrFa76zyf+OcJhnE6lxskcS/znVe77bbRISlYO9xB9 +xp13o+fMfW/b1hJmfRIPFY1iZffq8HL0ow68gcQEMGLF0u1ZAQ7IeQIiRY9OtxbZDfq VJ/Q== X-Gm-Message-State: AOAM53039oXABCH487g8S40BbfIfKwUWPdx+aM+57IBJsV4b4OdVLV33 l6uEEFmLRQ6XxrWaa92gAR8tub/sVjaJ+dW5rpuUzQ== X-Received: by 2002:a63:311:: with SMTP id 17mr10873581pgd.450.1629326124479; Wed, 18 Aug 2021 15:35:24 -0700 (PDT) MIME-Version: 1.0 References: <20210818060533.3569517-1-keescook@chromium.org> <20210818060533.3569517-6-keescook@chromium.org> In-Reply-To: <20210818060533.3569517-6-keescook@chromium.org> From: Dan Williams Date: Wed, 18 Aug 2021 15:35:13 -0700 Message-ID: Subject: Re: [PATCH v2 05/63] stddef: Introduce struct_group() helper macro To: Kees Cook Cc: Linux Kernel Mailing List , Keith Packard , "Gustavo A . R . Silva" , Rasmus Villemoes , Daniel Vetter , Greg Kroah-Hartman , Andrew Morton , Linux Wireless List , Netdev , Maling list - DRI developers , linux-staging@lists.linux.dev, linux-block@vger.kernel.org, Linux Kbuild mailing list , clang-built-linux , linux-hardening@vger.kernel.org Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-wireless@vger.kernel.org On Tue, Aug 17, 2021 at 11:06 PM Kees Cook wrote: > > Kernel code has a regular need to describe groups of members within a > structure usually when they need to be copied or initialized separately > from the rest of the surrounding structure. The generally accepted design > pattern in C is to use a named sub-struct: > > struct foo { > int one; > struct { > int two; > int three, four; > } thing; > int five; > }; > > This would allow for traditional references and sizing: > > memcpy(&dst.thing, &src.thing, sizeof(dst.thing)); > > However, doing this would mean that referencing struct members enclosed > by such named structs would always require including the sub-struct name > in identifiers: > > do_something(dst.thing.three); > > This has tended to be quite inflexible, especially when such groupings > need to be added to established code which causes huge naming churn. > Three workarounds exist in the kernel for this problem, and each have > other negative properties. > > To avoid the naming churn, there is a design pattern of adding macro > aliases for the named struct: > > #define f_three thing.three > > This ends up polluting the global namespace, and makes it difficult to > search for identifiers. > > Another common work-around in kernel code avoids the pollution by avoiding > the named struct entirely, instead identifying the group's boundaries using > either a pair of empty anonymous structs of a pair of zero-element arrays: > > struct foo { > int one; > struct { } start; > int two; > int three, four; > struct { } finish; > int five; > }; > > struct foo { > int one; > int start[0]; > int two; > int three, four; > int finish[0]; > int five; > }; > > This allows code to avoid needing to use a sub-struct named for member > references within the surrounding structure, but loses the benefits of > being able to actually use such a struct, making it rather fragile. Using > these requires open-coded calculation of sizes and offsets. The efforts > made to avoid common mistakes include lots of comments, or adding various > BUILD_BUG_ON()s. Such code is left with no way for the compiler to reason > about the boundaries (e.g. the "start" object looks like it's 0 bytes > in length), making bounds checking depend on open-coded calculations: > > if (length > offsetof(struct foo, finish) - > offsetof(struct foo, start)) > return -EINVAL; > memcpy(&dst.start, &src.start, offsetof(struct foo, finish) - > offsetof(struct foo, start)); > > However, the vast majority of places in the kernel that operate on > groups of members do so without any identification of the grouping, > relying either on comments or implicit knowledge of the struct contents, > which is even harder for the compiler to reason about, and results in > even more fragile manual sizing, usually depending on member locations > outside of the region (e.g. to copy "two" and "three", use the start of > "four" to find the size): > > BUILD_BUG_ON((offsetof(struct foo, four) < > offsetof(struct foo, two)) || > (offsetof(struct foo, four) < > offsetof(struct foo, three)); > if (length > offsetof(struct foo, four) - > offsetof(struct foo, two)) > return -EINVAL; > memcpy(&dst.two, &src.two, length); > > In order to have a regular programmatic way to describe a struct > region that can be used for references and sizing, can be examined for > bounds checking, avoids forcing the use of intermediate identifiers, > and avoids polluting the global namespace, introduce the struct_group() > macro. This macro wraps the member declarations to create an anonymous > union of an anonymous struct (no intermediate name) and a named struct > (for references and sizing): > > struct foo { > int one; > struct_group(thing, > int two; > int three, four; > ); > int five; > }; > > if (length > sizeof(src.thing)) > return -EINVAL; > memcpy(&dst.thing, &src.thing, length); > do_something(dst.three); > > There are some rare cases where the resulting struct_group() needs > attributes added, so struct_group_attr() is also introduced to allow > for specifying struct attributes (e.g. __align(x) or __packed). > Additionally, there are places where such declarations would like to > have the struct be typed, so struct_group_typed() is added. > > Given there is a need for a handful of UAPI uses too, the underlying > __struct_group() macro has been defined in UAPI so it can be used there > too. > > Co-developed-by: Keith Packard > Signed-off-by: Keith Packard > Signed-off-by: Kees Cook > Acked-by: Gustavo A. R. Silva > Link: https://lore.kernel.org/lkml/20210728023217.GC35706@embeddedor > Enhanced-by: Rasmus Villemoes > Link: https://lore.kernel.org/lkml/41183a98-bdb9-4ad6-7eab-5a7292a6df84@rasmusvillemoes.dk > Enhanced-by: Dan Williams Acked-by: Dan Williams