Received: by 2002:a05:6358:111d:b0:dc:6189:e246 with SMTP id f29csp3737262rwi; Wed, 2 Nov 2022 02:28:42 -0700 (PDT) X-Google-Smtp-Source: AMsMyM7zJHL73TwJn5dn21/kabP3/KYFTeUEbW2K5aUkt9tIX6vPh6UetVK2EVeFOw+ageQyZS6m X-Received: by 2002:a17:907:168e:b0:7a1:6786:f16 with SMTP id hc14-20020a170907168e00b007a167860f16mr22881785ejc.590.1667381321880; Wed, 02 Nov 2022 02:28:41 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1667381321; cv=none; d=google.com; s=arc-20160816; b=DDL4DBrI0GdacIR9ULoH3h+3ZtDFlXrSxoLqY2ssqizMlr3VLPTGUkcl384t2WcHYp K5YbPYvVDOajpbZ/0HzAWBgszOWqH40zOi7oB5VNO7as0Ct2Y2yFeXgjlYj1PTnCiBMg HsOHqB/j5zJeR1kyAeCOt2niDZ/GF21eIoumFzD9SuY8koFmHGcdUet6yMNJLB7bH1WP qEFUaVewmXMyNNRGLda8e+Ec2jlhnlGCKmNEhJgtIrfqeB4P2t+5VoUO0ma+8D064Sd4 094m3mYsfd8glvbjKzhepGsORSNZdaKOyjSIHSMdcuEENK1eERuRPTzIL+p/jpwnSeay aX4w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=Mnu5d3OxV8QGDlr9+s7CJTAXDXQrR/hBs42IJyhL9zs=; b=SirvyoTXDdUp+kfZ57QRFOwkrqRwHiw74PhxLFpvr1afIJwyf3YWoYAsgXanKHzCYp 9CqACoLLdps1V32+avyGpGhVF+vB/gQBiGCkDxuJx2scWhLK5pOXx25/XqhB54JHr0cF /3G8pXIIWQ2lA4faIsR+DTNv+WxBVw1K/cKXYCiO5oZVAgn68d+zTx15rJ9Kf8uq7d8Q APYm/gAICSFaApb1yYbQQMCFkHkA2iVuTx7I7CpIURFJT3ceyXbwNjwH+U/XWsYUhfo7 /jceSNi1oSviKTCIQ2T0HTy52NQlFBeoXRSGx4tbcgCSFuVBSMdWhNhx+8c+YbJRq6YI 4v7A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=SXCjTcq9; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id z2-20020a056402274200b004542e6bd242si17297172edd.621.2022.11.02.02.28.18; Wed, 02 Nov 2022 02:28:41 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=SXCjTcq9; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230516AbiKBJX6 (ORCPT + 97 others); Wed, 2 Nov 2022 05:23:58 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:53122 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231755AbiKBJWO (ORCPT ); Wed, 2 Nov 2022 05:22:14 -0400 Received: from dfw.source.kernel.org (dfw.source.kernel.org [IPv6:2604:1380:4641:c500::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 8B96A659C; Wed, 2 Nov 2022 02:21:49 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id EE9A561876; Wed, 2 Nov 2022 09:21:48 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 52FDEC433C1; Wed, 2 Nov 2022 09:21:48 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1667380908; bh=+6G9Z16Q17XA4hNmJQj+d3VGJ/WPDeI5Inl1wZwKjC0=; h=References:In-Reply-To:From:Date:Subject:To:Cc:From; b=SXCjTcq9gG+xGOVpMO6KNvSPI0lfr9/KZFoiDNTFz3q9Al/aHKvVjLV6SNkxoQTD8 oG68fA1J78adjWfNoEXg8nVRT68wIiMNcj1IXAzjJcAzvIQkS+Vvg8amEex9+jfEkP sxXv2Mvswlh2vhSw08NbChZjx1v3LZ15v6h9m6pv6jHKFO+sRyA3tCT+ZcvCi4HF5h BBk+pTS2FjuBlv+QfGRHedc9MVMiB1nJrSX/k1maFeE/DPuJwzRo5rcLAMjH+9yrqi GcCn9epNn293LKDeeNUaR+NIfxOfxg2fUKrmNmQNGs9QHw50kk5OVOISO24PXCUORn xr2uQCLV6ATSQ== Received: by mail-lf1-f51.google.com with SMTP id p8so27163996lfu.11; Wed, 02 Nov 2022 02:21:48 -0700 (PDT) X-Gm-Message-State: ACrzQf0SUMW3iucQzkznVzBn4G3H72hSbfe638FaMFHDtZ3cCSLzXejN MJ52SD9cu3ZBq7zn2ZoH3a9wE8VGdO/tLaYKW2Q= X-Received: by 2002:ac2:4d05:0:b0:4a7:7e1b:1c81 with SMTP id r5-20020ac24d05000000b004a77e1b1c81mr9761651lfi.110.1667380906244; Wed, 02 Nov 2022 02:21:46 -0700 (PDT) MIME-Version: 1.0 References: <20221102091308.11568-1-masahiroy@kernel.org> In-Reply-To: <20221102091308.11568-1-masahiroy@kernel.org> From: Ard Biesheuvel Date: Wed, 2 Nov 2022 10:21:35 +0100 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: [RFC PATCH] kbuild: pass objects instead of archives to linker To: Masahiro Yamada Cc: linux-kbuild@vger.kernel.org, Jiri Slaby , Michael Matz , Kees Cook , Sami Tolvanen , Fangrui Song , Michal Marek , Nathan Chancellor , Nick Desaulniers , Tom Rix , linux-kernel@vger.kernel.org, llvm@lists.linux.dev Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-8.1 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_HI, SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hello Masahiro, On Wed, 2 Nov 2022 at 10:13, Masahiro Yamada wrote: > > This is an experimental patch, driven by the feedback from Jiri Slaby > and Michael Matz. [1] > > Michael Matz says: > "I know of no linker (outside LTO-like modes) that processes > archives in a different order than first-to-last-member (under > whole-archive), but that's not guaranteed anywhere. So relying on > member-order within archives is always brittle." > > It is pretty easy to pass the list of objects instead of a thin archive > because the linker supports the '@file' syntax, where command line > arguments are read from 'file'. > Can you explain which problem is solved by doing this? If we can only produce a working kernel if each object is linked in the order it appears in the archive, I think we have bigger problems that need solving regardless. And for the .head.text objects that need to appear at the start of the binary image, I think the reported issue with __head annotated C functions on x86 needs to be addressed by getting rid of __head entirely (which seems to have been introduced without proper justification) > Without this patch, the linker receives > > --whole-archive vmlinux.a --no-whole-archive > > With this patch, the linker will receive > > @vmlinux.order > > Here, vmlinux.order is a text file that lists built-in objects in the > correct link order. > > I am not a toolchain expert. I just want to know if this makes any > difference from the linker perspective and from (non-upstreamed) GCC-LTO > perspective. > > (I know this patch does not work for Clang LTO because I did not touch > scripts/generate_initcall_order.pl) > > This patch may be unneeded because more correct patches were submitted [2] > but I am still curious about "thin archive vs direct object list". > > [1]: https://lore.kernel.org/linux-kbuild/alpine.LSU.2.20.2210251210140.29399@wotan.suse.de/ > [2]: https://lore.kernel.org/all/20221101161529.1634188-1-alexandr.lobakin@intel.com/ > > Signed-off-by: Masahiro Yamada > --- > > Makefile | 21 ++++++++++----------- > scripts/Makefile.modpost | 5 +++-- > scripts/Makefile.vmlinux_o | 6 +++--- > scripts/clang-tools/gen_compile_commands.py | 21 ++++++++++++++++++++- > scripts/link-vmlinux.sh | 8 ++++---- > 5 files changed, 40 insertions(+), 21 deletions(-) > > diff --git a/Makefile b/Makefile > index e9e7eff906a5..511484a3dacb 100644 > --- a/Makefile > +++ b/Makefile > @@ -1213,19 +1213,18 @@ quiet_cmd_autoksyms_h = GEN $@ > $(autoksyms_h): > $(call cmd,autoksyms_h) > > -# '$(AR) mPi' needs 'T' to workaround the bug of llvm-ar <= 14 > -quiet_cmd_ar_vmlinux.a = AR $@ > - cmd_ar_vmlinux.a = \ > - rm -f $@; \ > - $(AR) cDPrST $@ $(KBUILD_VMLINUX_OBJS); \ > - $(AR) mPiT $$($(AR) t $@ | sed -n 1p) $@ $$($(AR) t $@ | grep -F -f $(srctree)/scripts/head-object-list.txt) > +quiet_cmd_vmlinux_order = GEN $@ > + cmd_vmlinux_order = \ > + { $(foreach m, $(KBUILD_VMLINUX_OBJS), $(AR) t $m;) :; } > $(tmp-target) ; \ > + grep -F -f $(srctree)/scripts/head-object-list.txt $(tmp-target) > $@; \ > + grep -F -f $(srctree)/scripts/head-object-list.txt $(tmp-target) -v >> $@ > > -targets += vmlinux.a > -vmlinux.a: $(KBUILD_VMLINUX_OBJS) scripts/head-object-list.txt autoksyms_recursive FORCE > - $(call if_changed,ar_vmlinux.a) > +targets += vmlinux.order > +vmlinux.order: $(KBUILD_VMLINUX_OBJS) scripts/head-object-list.txt autoksyms_recursive FORCE > + $(call if_changed,vmlinux_order) > > PHONY += vmlinux_o > -vmlinux_o: vmlinux.a $(KBUILD_VMLINUX_LIBS) > +vmlinux_o: vmlinux.order $(KBUILD_VMLINUX_LIBS) > $(Q)$(MAKE) -f $(srctree)/scripts/Makefile.vmlinux_o > > vmlinux.o modules.builtin.modinfo modules.builtin: vmlinux_o > @@ -2037,7 +2036,7 @@ quiet_cmd_gen_compile_commands = GEN $@ > cmd_gen_compile_commands = $(PYTHON3) $< -a $(AR) -o $@ $(filter-out $<, $(real-prereqs)) > > $(extmod_prefix)compile_commands.json: scripts/clang-tools/gen_compile_commands.py \ > - $(if $(KBUILD_EXTMOD),, vmlinux.a $(KBUILD_VMLINUX_LIBS)) \ > + $(if $(KBUILD_EXTMOD),, vmlinux.order $(KBUILD_VMLINUX_LIBS)) \ > $(if $(CONFIG_MODULES), $(MODORDER)) FORCE > $(call if_changed,gen_compile_commands) > > diff --git a/scripts/Makefile.modpost b/scripts/Makefile.modpost > index e41dee64d429..1d6847da39bd 100644 > --- a/scripts/Makefile.modpost > +++ b/scripts/Makefile.modpost > @@ -70,12 +70,13 @@ quiet_cmd_vmlinux_objs = GEN $@ > for f in $(real-prereqs); do \ > case $${f} in \ > *libgcc.a) ;; \ > - *) $(AR) t $${f} ;; \ > + *.a) $(AR) t $${f} ;; \ > + *) cat $${f} ;; \ > esac \ > done > $@ > > targets += .vmlinux.objs > -.vmlinux.objs: vmlinux.a $(KBUILD_VMLINUX_LIBS) FORCE > +.vmlinux.objs: vmlinux.order $(KBUILD_VMLINUX_LIBS) FORCE > $(call if_changed,vmlinux_objs) > > vmlinux.o-if-present := $(wildcard vmlinux.o) > diff --git a/scripts/Makefile.vmlinux_o b/scripts/Makefile.vmlinux_o > index 0edfdb40364b..6eb07f2bb39f 100644 > --- a/scripts/Makefile.vmlinux_o > +++ b/scripts/Makefile.vmlinux_o > @@ -18,7 +18,7 @@ quiet_cmd_gen_initcalls_lds = GEN $@ > $(PERL) $(real-prereqs) > $@ > > .tmp_initcalls.lds: $(srctree)/scripts/generate_initcall_order.pl \ > - vmlinux.a $(KBUILD_VMLINUX_LIBS) FORCE > + vmlinux.order $(KBUILD_VMLINUX_LIBS) FORCE > $(call if_changed,gen_initcalls_lds) > > targets := .tmp_initcalls.lds > @@ -48,7 +48,7 @@ quiet_cmd_ld_vmlinux.o = LD $@ > cmd_ld_vmlinux.o = \ > $(LD) ${KBUILD_LDFLAGS} -r -o $@ \ > $(addprefix -T , $(initcalls-lds)) \ > - --whole-archive vmlinux.a --no-whole-archive \ > + @vmlinux.order \ > --start-group $(KBUILD_VMLINUX_LIBS) --end-group \ > $(cmd_objtool) > > @@ -57,7 +57,7 @@ define rule_ld_vmlinux.o > $(call cmd,gen_objtooldep) > endef > > -vmlinux.o: $(initcalls-lds) vmlinux.a $(KBUILD_VMLINUX_LIBS) FORCE > +vmlinux.o: $(initcalls-lds) vmlinux.order $(KBUILD_VMLINUX_LIBS) FORCE > $(call if_changed_rule,ld_vmlinux.o) > > targets += vmlinux.o > diff --git a/scripts/clang-tools/gen_compile_commands.py b/scripts/clang-tools/gen_compile_commands.py > index d800b2c0af97..c8ba9f084bd0 100755 > --- a/scripts/clang-tools/gen_compile_commands.py > +++ b/scripts/clang-tools/gen_compile_commands.py > @@ -57,7 +57,7 @@ def parse_arguments(): > parser.add_argument('-a', '--ar', type=str, default='llvm-ar', help=ar_help) > > paths_help = ('directories to search or files to parse ' > - '(files should be *.o, *.a, or modules.order). ' > + '(files should be *.o, *.a, or *.order). ' > 'If nothing is specified, the current directory is searched') > parser.add_argument('paths', type=str, nargs='*', help=paths_help) > > @@ -124,6 +124,23 @@ def cmdfiles_for_a(archive, ar): > yield to_cmdfile(obj) > > > +def cmdfiles_for_vmlinux_order(vmlinux_order): > + """Generate the iterator of .cmd files associated with the vmlinux.order. > + > + Parse the given vmlinux.order, and yield every .cmd file used to build the > + contained modules. > + > + Args: > + vmlinux_order: The vmlinux.order file to parse > + > + Yields: > + The path to every .cmd file found > + """ > + with open(vmlinux_order) as f: > + for line in f: > + yield to_cmdfile(line.rstrip()) > + > + > def cmdfiles_for_modorder(modorder): > """Generate the iterator of .cmd files associated with the modules.order. > > @@ -203,6 +220,8 @@ def main(): > cmdfiles = cmdfiles_in_dir(path) > elif path.endswith('.a'): > cmdfiles = cmdfiles_for_a(path, ar) > + elif path.endswith('vmlinux.order'): > + cmdfiles = cmdfiles_for_vmlinux_order(path) > elif path.endswith('modules.order'): > cmdfiles = cmdfiles_for_modorder(path) > else: > diff --git a/scripts/link-vmlinux.sh b/scripts/link-vmlinux.sh > index 918470d768e9..617ed443e377 100755 > --- a/scripts/link-vmlinux.sh > +++ b/scripts/link-vmlinux.sh > @@ -3,15 +3,15 @@ > # > # link vmlinux > # > -# vmlinux is linked from the objects in vmlinux.a and $(KBUILD_VMLINUX_LIBS). > -# vmlinux.a contains objects that are linked unconditionally. > +# vmlinux is linked from the objects in vmlinux.order and $(KBUILD_VMLINUX_LIBS). > +# vmlinux.order is a text file that contains objects that are linked unconditionally. > # $(KBUILD_VMLINUX_LIBS) are archives which are linked conditionally > # (not within --whole-archive), and do not require symbol indexes added. > # > # vmlinux > # ^ > # | > -# +--< vmlinux.a > +# +--< vmlinux.order > # | > # +--< $(KBUILD_VMLINUX_LIBS) > # | +--< lib/lib.a + more > @@ -65,7 +65,7 @@ vmlinux_link() > objs=vmlinux.o > libs= > else > - objs=vmlinux.a > + objs=@vmlinux.order > libs="${KBUILD_VMLINUX_LIBS}" > fi > > -- > 2.34.1 >