Received: by 2002:a05:6358:d09b:b0:dc:cd0c:909e with SMTP id jc27csp1356310rwb; Mon, 7 Nov 2022 00:14:23 -0800 (PST) X-Google-Smtp-Source: AMsMyM4xT0eJLwur7MZjOZWY1kKcCOzGJPSiy1xS7oyMt93ryD4/mQfuXFFkNeYOKKov++OwJpoe X-Received: by 2002:a62:2703:0:b0:56d:60f2:97 with SMTP id n3-20020a622703000000b0056d60f20097mr40991046pfn.10.1667808862902; Mon, 07 Nov 2022 00:14:22 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1667808862; cv=none; d=google.com; s=arc-20160816; b=yngzFU9Drt3FaQ0OckEa3YkDXls/sAdxlhejaV2Eh5gsDn6v+cB4KDCoyB3+y/37+T 9st9S9pKdhSAbJWVCVT3lxqyP41QiU/vJFga16TkJPyS8837xoPINGL31VMPA/eDB4PS PXifqcgN8lLG+VhEQjmLuZAsFRfunK9d+H5q9nymiChKuVbNsBg3h5cT+w+flssxNp53 IpoO2yn1DDRWFi6bHIBJnhqiaM8J90Ms0mhOlhSwtz4cNcBhh6VSzqPK29Emb4VaH9Rj 73fVKCUFF+ITskA4viRHKo/x24FABgXx5n0zQvaH/XONNtJrVAfZ/fzFBoduv2esX9Lb w78Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=0OUzueKP8neHZjpDNHhse6q9mUB1tcC9jucoTIaHS98=; b=QD/8+9QYevDtgN7MdfAkAapFX0F2S9MIwwMcqwbUWr5bRhKHK9l/AD8lTlFx0XaBs6 iphSpOkpk8lco5JgKuQk/nNutjI2Vl+odO8TaWHEQ8GkcqxKhrY9Nx2edzf9wjdswztU /twdbWWauQhL5PGX4SadsF/1/S+pIglJAOdala7f72DFKiKfk6lRs2nmpFX/kuSG7F3X JOOIxUFUqv/6U1XxNcTKf1HizSMgWW9wasHZtZHcmQigYyKMAEY9XjN/szAcKD8XrwZK 7TgIP1VwgMSXQ5C2AWAA3QBnpLEq+Vv9fNXUaTi2Whv7UydsTPpR9jDR1uzeqsdjDKuU wntQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=TPcmEahM; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id u15-20020a63f64f000000b0043cad841eb1si9061176pgj.827.2022.11.07.00.14.10; Mon, 07 Nov 2022 00:14:22 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=TPcmEahM; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231492AbiKGHzN (ORCPT + 95 others); Mon, 7 Nov 2022 02:55:13 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:54288 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231327AbiKGHzM (ORCPT ); Mon, 7 Nov 2022 02:55:12 -0500 Received: from ams.source.kernel.org (ams.source.kernel.org [145.40.68.75]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id D7F9A13EA0; Sun, 6 Nov 2022 23:55:10 -0800 (PST) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ams.source.kernel.org (Postfix) with ESMTPS id 5D25FB80E1C; Mon, 7 Nov 2022 07:55:09 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 0E296C4347C; Mon, 7 Nov 2022 07:55:08 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1667807708; bh=a3rA/JgsiQE+foPTgW7Y1pBiyd1g7h75+8Rg2asoYGM=; h=References:In-Reply-To:From:Date:Subject:To:Cc:From; b=TPcmEahM0fLxuaUyKC2Eu32STAT7DbQdM0eLVaE4d+/dg+UR3ah5tzRDAsXTL0pmE Acy0JyEOiqty5VyPHnzS4ZEF2lUZvwC2KjofSBRb3YQHH+Wxk4Xr8wvX6Xg/xyGk0h D+8Gwe3DJgn8fj2oZrsU0seUDu2cxN6S76eytwf6MnhW4qpxaW2yxvRieg8aruIfYz Cg95pw9jX6LHeEKRO+vVWRdgzPBCIvR8XrhOZjyuIWIAi5A0rcVGZ9Hw5p5WtAEbUQ 9Wbf22iEk/yDU4FoOWbkn0RhFxT3R844zriiSI36xjR+K5vd4iJO0yZTjtPH6AUxif Gt3hJXAK0DAwQ== Received: by mail-lj1-f179.google.com with SMTP id d20so14911640ljc.12; Sun, 06 Nov 2022 23:55:07 -0800 (PST) X-Gm-Message-State: ACrzQf05bVBzDYNKu3g+AFg6rWZxoSZM4O5+NEUPffHmSz2HOAZnxxZA zAabpUTmA1Ja4shol8AxxJEChVaIYMpEsGx0Tuk= X-Received: by 2002:a05:651c:516:b0:277:2428:3682 with SMTP id o22-20020a05651c051600b0027724283682mr5412055ljp.291.1667807705949; Sun, 06 Nov 2022 23:55:05 -0800 (PST) MIME-Version: 1.0 References: <3acf1cc7a974cb4fb9b77b39311c6714@tfwno.gf> <2f43e1e583a92e1592c48a15a763e1f7@tfwno.gf> In-Reply-To: From: Ard Biesheuvel Date: Mon, 7 Nov 2022 08:54:54 +0100 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: Bug: kexec on Lenovo ThinkPad T480 disables EFI mode To: Dave Young Cc: ns@tfwno.gf, Baoquan He , Eric Biederman , kexec@lists.infradead.org, linux-kernel@vger.kernel.org, linux-efi@vger.kernel.org Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-7.1 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_HI, SPF_HELO_NONE,SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, 7 Nov 2022 at 08:40, Dave Young wrote: > > On Mon, 7 Nov 2022 at 15:36, Dave Young wrote: > > > > Hi Ard, > > > > On Mon, 7 Nov 2022 at 15:30, Ard Biesheuvel wrote: > > > > > > On Mon, 7 Nov 2022 at 07:55, Dave Young wrote: > > > > > > > > Hi, > > > > > > > > On Sat, 5 Nov 2022 at 22:16, wrote: > > > > > > > > > > On 2022-11-05 05:49, Dave Young wrote: > > > > > > Baoquan, thanks for cc me. > > > > > > > > > > > > On Sat, 5 Nov 2022 at 11:10, Baoquan He wrote: > > > > > >> > > > > > >> Add Dave to CC > > > > > >> > > > > > >> On 10/28/22 at 01:02pm, ns@tfwno.gf wrote: > > > > > >> > Greetings, > > > > > >> > > > > > > >> > I've been hitting a bug on my Lenovo ThinkPad T480 where kexecing will > > > > > >> > cause EFI mode (if that's the right term for it) to be unconditionally > > > > > >> > disabled, even when not using the --noefi option to kexec. > > > > > >> > > > > > > >> > What I mean by "EFI mode" being disabled, more than just EFI runtime > > > > > >> > services, is that basically nothing about the system's EFI is visible > > > > > >> > post-kexec. Normally you have a message like this in dmesg when the > > > > > >> > system is booted in EFI mode: > > > > > >> > > > > > > >> > [ 0.000000] efi: EFI v2.70 by EDK II > > > > > >> > [ 0.000000] efi: SMBIOS=0x7f98a000 ACPI=0x7fb7e000 ACPI 2.0=0x7fb7e014 > > > > > >> > MEMATTR=0x7ec63018 > > > > > >> > (obviously not the real firmware of the machine I'm talking about, but I > > > > > >> > can also send that if it would be of any help) > > > > > >> > > > > > > >> > No such message pops up in my dmesg as a result of this bug, & this > > > > > >> > causes some fallout like being unable to find the system's DMI > > > > > >> > information: > > > > > >> > > > > > > >> > <6>[ 0.000000] DMI not present or invalid. > > > > > >> > > > > > > >> > The efivarfs module also fails to load with -ENODEV. > > > > > >> > > > > > > >> > I've tried also booting with efi=runtime explicitly but it doesn't > > > > > >> > change anything. The kernel still does not print the name of the EFI > > > > > >> > firmware, DMI is still missing, & efivarfs still fails to load. > > > > > >> > > > > > > >> > I've been using the kexec_load syscall for all these tests, if it's > > > > > >> > important. > > > > > >> > > > > > > >> > Also, to make it very clear, all this only ever happens post-kexec. When > > > > > >> > booting straight from UEFI (with the EFI stub), all the aforementioned > > > > > >> > stuff that fails works perfectly fine (i.e. name of firmware is printed, > > > > > >> > DMI is properly found, & efivarfs loads & mounts just fine). > > > > > >> > > > > > > >> > This is reproducible with a vanilla 6.1-rc2 kernel. I've been trying to > > > > > >> > bisect it, but it seems like it goes pretty far back. I've got vanilla > > > > > >> > mainline kernel builds dating back to 5.17 that have the exact same > > > > > >> > issue. It might be worth noting that during this testing, I made sure > > > > > >> > the version of the kernel being kexeced & the kernel kexecing were the > > > > > >> > same version. It may not have been a problem in older kernels, but that > > > > > >> > would be difficult to test for me (a pretty important driver for this > > > > > >> > machine was only merged during v5.17-rc4). So it may not have been a > > > > > >> > regression & just a hidden problem since time immemorial. > > > > > >> > > > > > > >> > I am willing to test any patches I may get to further debug or fix > > > > > >> > this issue, preferably based on the current state of torvalds/linux.git. > > > > > >> > I can build & test kernels quite a few times per day. > > > > > >> > > > > > > >> > I can also send any important materials (kernel config, dmesg, firmware > > > > > >> > information, so on & so forth) on request. I'll also just mention I'm > > > > > >> > using kexec-tools 2.0.24 upfront, if it matters. > > > > > > > > > > > > Can you check the efi runtime in sysfs: > > > > > > ls /sys/firmware/efi/runtime-map/ > > > > > > > > > > > > If nothing then maybe you did not enable CONFIG_EFI_RUNTIME_MAP=y, it > > > > > > is needed for kexec UEFI boot on x86_64. > > > > > > > > > > Oh my, it really is that simple. > > > > > > > > > > Indeed, enabling this in the pre-kexec kernel fixes it all up. I had > > > > > blindly disabled it in my quest to downsize the pre-kexec kernel to > > > > > reduce boot time (it only runs a bootloader). In hindsight, the firmware > > > > > drivers section is not really a good section to tweak on a whim. > > > > > > > > > > I'm terribly sorry to have taken your time to "fix" this "bug". But I > > > > > must ask, is there any reason why this is a visible config option, or at > > > > > least not gated behind CONFIG_EXPERT? drivers/firmware/efi/runtime-map.c > > > > > is pretty tiny, & considering it depends on CONFIG_KEXEC_CORE, one > > > > > probably wants to have kexec work properly if they can even enable it. > > > > > > > > Glad to know it works with the .config tweaking. I can not recall any > > > > reason for that though. > > > > > > > > Since it sits in the efi code path, let's see how Ard thinks about > > > > your proposal. > > > > > > > > > > I don't understand why EFI_RUNTIME_MAP should depend on KEXEC_CORE at > > > all: it is documented as a feature that can be enabled for debugging > > > as well, and kexec does not work as expected without it. > > > > Probably debugging only mentioned in text, but not been considered in > > the kconfig logic :( > > > > > > > > Should we just change it like this perhaps? > > > > > > --- a/drivers/firmware/efi/Kconfig > > > +++ b/drivers/firmware/efi/Kconfig > > > @@ -28,8 +28,8 @@ config EFI_VARS_PSTORE_DEFAULT_DISABLE > > > > > > config EFI_RUNTIME_MAP > > > bool "Export efi runtime maps to sysfs" > > > - depends on X86 && EFI && KEXEC_CORE > > > - default y > > > + depends on X86 && EFI > > > + default KEXEC_CORE > > > help > > > > > > and maybe add an 'if EXPERT' so that the option is only visible to > > > modify when CONFIG_EXPERT=y ? > > > > Above changes look good to me. > > > > > > > > In any case, I intend to move this code into arch/x86 as well, so I'll > > > have a couple of patches out shortly. > > > > That would be better since it is X86 only. Thanks, Ard. > > Hmm, before doing that, do you think it is useful for debugging > purposes? That could be a reason to sit in efi code instead of x86 .. > This code was only ever enabled on x86, and on ARM/arm64, we can capture the memory map via efi=debug on any kernel build, and capture the virtual mappings using PTDUMP (which also gives us the exact attributes for each mapped region) So I don't think it has that much value on non-x86 tbh.