Received: by 2002:a25:8b12:0:0:0:0:0 with SMTP id i18csp1149155ybl; Wed, 28 Aug 2019 10:18:55 -0700 (PDT) X-Google-Smtp-Source: APXvYqySVaN470nrTadjqUZliXzLBGc3NsM6vqsd06vf8hqYyJ2MkAc2pKdfxymS2OlPQ/MMVTBF X-Received: by 2002:a17:902:b582:: with SMTP id a2mr5404920pls.199.1567012735265; Wed, 28 Aug 2019 10:18:55 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1567012735; cv=none; d=google.com; s=arc-20160816; b=oMl2Il0iGzKQyw1zIX0mGQ5PMmFVsxd9HKIE0XxnRm6/PwaKYtkdyZAumc1UBrJeho 9xh+mmLtPhCuoA3CBBomtjmPkmBsOpwIUKIK6RBK4KIonSKJfxMl5hcAP9SyUzN9p7+Y KVZrTdwXz1hy3YGC4FBwk9/i1ERXXF+rPPdF/LMZPgXMNLmWcvz6Wcghy6EuIcJdGfM0 mRN93K6W/xzxuGtBqPXEOvnIbXX5TOgD/flQ1cXQHgO81nyovemAPQRyIOTVH6SxSPQ/ BpM8d9cGc0WboAqUubkOL6E2v9CxBCA27oYD4Jb5iAFwkxovYhGhFD4U7wt+9+y8ZZH9 LCog== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=eaCLGK9FXrF5m526qncRT9MCFot6dMXXZVurplrH24k=; b=MKkGF2rMYjExCAR1Speubm3PqsLUhDPNNBYnAmzbtqE6eIvJmpY/K/8NjElUSee06Q X4xTCzKAE+xduMWmLMgiZz1pBuAITGJihWcY8QnrdLi45I9EizvGbXP8WiGDNBzeWIe5 6mvmkGi3jss5QA9nMmApE081yua0WczSVlqFXouX01tBTWTvarANWLwMKrnECFHsm+OI leLbGYGxMmm69wWfp9+M4bWAAn0Kfudrr3gndukedgBoOrZvMAhHSi12N2oP072T3lzN cBzFd8Z1CgCLD+JJYTcJwGhMFUmzJoLWYRi5T0vRO8DNIoKO5TW+h+d6V7Lip4C3Ajyb NhwA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=Kx6LzpL4; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id a127si3188058pfa.151.2019.08.28.10.18.38; Wed, 28 Aug 2019 10:18:55 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20161025 header.b=Kx6LzpL4; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726605AbfH1RRk (ORCPT + 99 others); Wed, 28 Aug 2019 13:17:40 -0400 Received: from mail-pg1-f196.google.com ([209.85.215.196]:34108 "EHLO mail-pg1-f196.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726315AbfH1RRj (ORCPT ); Wed, 28 Aug 2019 13:17:39 -0400 Received: by mail-pg1-f196.google.com with SMTP id n9so92591pgc.1 for ; Wed, 28 Aug 2019 10:17:39 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=eaCLGK9FXrF5m526qncRT9MCFot6dMXXZVurplrH24k=; b=Kx6LzpL4IVpTiL/ZUYhKdYOjHsHia7ypBR/g3SYArnZfCLpub/KjdVvucucMjRdQ/v mF9FRIo1fYghuXp+7S3eRGP1DPggB1Gxv7ZdBkEzQot4rYuYQbSTzmSBt2lzISWEiwwk 9Vyi0dFYnaD4ozub6xKO7uR23Opqc/Bm0ZMugtYWpGtCvFU8O/WAGrGTaUGxBjP7gxAE qV0ybQVJWUmCEOCDUq8Q2OJR1uaw1YswQ79daDiZFpM2kLMm2S2quEMi2bMe6BJWWBOJ SgpOH5F13jsDPHUC7Jnm7S845UkX21L73Oe+YSdLpf6+mjQkpUWBLT44uc+FDEoTHBG7 0hqw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=eaCLGK9FXrF5m526qncRT9MCFot6dMXXZVurplrH24k=; b=A9xqCr9pOLNV97M4Oo+b4E2VBK+LwCeDD40ZGlmbgj7w+gRsl3dpc2kZYAivWl+wKl +XOmo4A//e4+QDDPSPxaA2z0x29OBpIfy9jhL0YPEZ28tK/+8orrCJg1vAoVuxtE/oT/ baX9UYVGMBHC5amqJRNInTEUUQKwuEQCDhYxk0R6PqYNr7hRBFtNJdD6huMc6+X1Sfw9 Qwvg24YozJ1l+5Vpl2+//AoniqkcgsR/aeFBx3krQB8bdO0eQ1tuVxHiglqDhlWse6os efS6HqURImvaoCZpZ+BE1/14/W1bu6L+M7MOrxippT43JiVkdhq4qENi2PSDWHF3qQj/ SMrA== X-Gm-Message-State: APjAAAWpIUrNGTj1UAnsGedvAIfosUrDzobtZtHgzLXgRmeuj+SmtWGf owlLenkwK+4p/EvPToxyopK6V0kt5MrY04Y7dEG2GA== X-Received: by 2002:a62:cec4:: with SMTP id y187mr5940897pfg.84.1567012658389; Wed, 28 Aug 2019 10:17:38 -0700 (PDT) MIME-Version: 1.0 References: <1566899033.o5acyopsar.astroid@bobo.none> <1566908344.dio7j9zb2h.astroid@bobo.none> <1566955930.uir50f8wen.astroid@bobo.none> In-Reply-To: <1566955930.uir50f8wen.astroid@bobo.none> From: Nick Desaulniers Date: Wed, 28 Aug 2019 10:17:27 -0700 Message-ID: Subject: Re: a bug in genksysms/CONFIG_MODVERSIONS w/ __attribute__((foo))? To: Nicholas Piggin Cc: Ben Hutchings , Masahiro Yamada , Ard Biesheuvel , Arnd Bergmann , Debian kernel maintainers , Greg KH , LKML , Michal Marek , Linus Torvalds , Will Deacon , Sandeep Patil , =?UTF-8?Q?Matthias_M=C3=A4nnich?= , Steve Muckle , Alessio Balsini , Stephen Hines Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Aug 27, 2019 at 7:26 PM Nicholas Piggin wrote: > > Ben Hutchings's on August 28, 2019 1:34 am: > > On Tue, 2019-08-27 at 22:42 +1000, Nicholas Piggin wrote: > >> Masahiro Yamada's on August 27, 2019 8:49 pm: > >> > Hi. > >> > > >> > On Tue, Aug 27, 2019 at 6:59 PM Nicholas Piggin wrote: > >> > > Nick Desaulniers's on August 27, 2019 8:57 am: > >> > > > On Mon, Aug 26, 2019 at 2:22 PM Nick Desaulniers > >> > > > wrote: > >> > > > > I'm looking into a linkage failure for one of our device kernels, and > >> > > > > it seems that genksyms isn't producing a hash value correctly for > >> > > > > aggregate definitions that contain __attribute__s like > >> > > > > __attribute__((packed)). > >> > > > > > >> > > > > Example: > >> > > > > $ echo 'struct foo { int bar; };' | ./scripts/genksyms/genksyms -d > >> > > > > Defn for struct foo == > >> > > > > Hash table occupancy 1/4096 = 0.000244141 > >> > > > > $ echo 'struct __attribute__((packed)) foo { int bar; };' | > >> > > > > ./scripts/genksyms/genksyms -d > >> > > > > Hash table occupancy 0/4096 = 0 > >> > > > > > >> > > > > I assume the __attribute__ part isn't being parsed correctly (looks > >> > > > > like genksyms is a lex/yacc based C parser). > >> > > > > > >> > > > > The issue we have in our out of tree driver (*sadface*) is basically a > >> > > > > EXPORT_SYMBOL'd function whose signature contains a packed struct. > >> > > > > > >> > > > > Theoretically, there should be nothing wrong with exporting a function > >> > > > > that requires packed structs, and this is just a bug in the lex/yacc > >> > > > > based parser, right? I assume that not having CONFIG_MODVERSIONS > >> > > > > coverage of packed structs in particular could lead to potentially > >> > > > > not-fun bugs? Or is using packed structs in exported function symbols > >> > > > > with CONFIG_MODVERSIONS forbidden in some documentation somewhere I > >> > > > > missed? > >> > > > > >> > > > Ah, looks like I'm late to the party: > >> > > > https://lwn.net/Articles/707520/ > >> > > > >> > > Yeah, would be nice to do something about this. > >> > > >> > modversions is ugly, so it would be great if we could dump it. > >> > > >> > > IIRC (without re-reading it all), in theory distros would be okay > >> > > without modversions if they could just provide their own explicit > >> > > versioning. They take care about ABIs, so they can version things > >> > > carefully if they had to change. > > > > Debian doesn't currently have any other way of detecting ABI changes > > (other than eyeballing diffs). > > > > I know there have been proposals of using libabigail for this instead, > > but I'm not sure how far those progressed. > > > >> > We have not provided any alternative solution for this, haven't we? > >> > > >> > In your patch (https://lwn.net/Articles/707729/), > >> > you proposed CONFIG_MODULE_ABI_EXPLICIT. > >> > >> Right, that was just my first proposal, but I am not confident that I > >> understood everybody's requirements. I don't think the distro people > >> had much time to to test things out. > >> > >> One possible shortcoming with that patch is no per-symbol version. The > >> distro may break an ABI for a security fix, but they don't want to break > >> all out of tree modules if it's an obscure ABI. > > > > Right, for example the KVM kABI is only meant for in-tree modules (like > > kvm_intel) and in Debian we do not change the "ABI version" and require > > rebuilding out-of-tree modules just because that ABI changes. > > Currently we maintain explicit lists of exported symbols and exporting > > modules for which we ignore ABI changes at build time. > > > >> The counter argument to > >> that is they should just rename the symbol in their kernel for such > >> cases, so I didn't implement it without somebody describing a good > >> requirement. > > [...] > > > > Sometimes it is just a single function that changes, but often a > > structure change can affect large numbers of functions. For example, > > if KVM adds a member to an operations struct that can indirectly change > > the ABI for most of its exported functions. We wouldn't want to change > > the ABI version but would still want to prevent loading mismatched kvm > > and kvm_intel versions. It would be a lot more work to change all of > > the affected function names. > > You could change just a single symbol name though :) > > > An alternative to symbol version matching that I think would work for > > us is: if a module's exports or imports match the "changes ignored" > > list then the module can only be loaded on the exact version of the > > kernel, otherwise it only needs to match the ABI version. I think that > > would avoid the need for carrying symbol versions, but we would still > > need a build-time ABI check and a way of flagging which symbols need > > the tighter version match. > > Just trying to think how best to express that. > > [ Aside, the whole symbol name resolution linking stuff does matching on > on any number of ~arbitrary strings that you can generate as you like, > and symbol tables are something that all existing tools and libs > understand. > > So I strongly favour using that as the back end for our "version" > resolution system _if at all possible_ rather than adding these extra > bits of crud that really just do the same thing. At least for a first > pass, I don't want to over-engineer things. > > Then it hopefully becomes a matter of adding some helper macros and > build facilities on top of that which can contain everyone's > requirements mostly within .config and perhaps a very small patch. > A bit more work with preprocessor macros etc is far preferable to > linking and loading "features" IMO] > > Back to your case, is it sufficient to have just an internal and an > external module version where the kernel provides both and your in-tree > modules match on the internal, others match on external? > > Thanks, > Nick > + some Android folks who are looking into libabigail (root thread for context: https://lore.kernel.org/lkml/CAKwvOdnJAApaUhTQs7w_VjSeYBQa0c-TNxRB4xPLi0Y0sOQMMQ@mail.gmail.com/) I'm only roughly aware of their issue/work, but hopefully they can collaborate and add their insights to a list of requirements we could collect, for any kind of replacement to CONFIG_MODVERSIONS/genksyms. I'm not sure if they were planning on presenting this work at Linux Plumber's Conference coming up or not, but I hope so. Hopefully that work will be a general solution for all kernel developers and distros, not just Android. This issue is now blocking my compiler upgrade, so I'm going to focus on just fixing the yacc based parser to understand __attribute__'s on struct/enum/union declarations. Speaking with Arnd about this issue, he came up with the test case: $ echo 'struct __attribute__((packed)) foo { char bar; int baz; };' | ./scripts/genksyms/genksyms -d Hash table occupancy 0/4096 = 0 $ echo 'struct __attribute__((packed)) foo { char bar; };' | ./scripts/genksyms/genksyms -d Hash table occupancy 0/4096 = 0 The implication being that because attributes don't parse correctly, these symbols get a hash value of 0x00 in Module.symvers, meaning we currently can't detect size changes of packed structs (or any other attribute-qualified type) and happily proceed loading modules with different ABIs than the kernel. I'll probably add some kind of script for unit tests for genksyms, too. Feels like overkill for something known to be broken where active replacements are being discussed, but it's blocking my compiler upgrade for our products, so I'm just going to fix it for now. -- Thanks, ~Nick Desaulniers