Received: by 2002:a25:6193:0:0:0:0:0 with SMTP id v141csp670266ybb; Fri, 10 Apr 2020 07:49:01 -0700 (PDT) X-Google-Smtp-Source: APiQypKlthGBoiIUgv9DwuocNA955qotfprehDRa2zESPhSszllWGFr+6lexXKFu4grTxGzZ7DnV X-Received: by 2002:a05:6214:1933:: with SMTP id es19mr5449687qvb.186.1586530141655; Fri, 10 Apr 2020 07:49:01 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1586530141; cv=none; d=google.com; s=arc-20160816; b=qSvi4r1xzz0LANVgby4pCdGvSmALqt650ccLpAIEZqm+Rv7Dzrd9ARzcxYUt2pYCZs stuWYt++r8t0oawNYhKEHuIUgzeUZafKarv3BPYjZiag3vuo74e3uCXqMl0m+gZKxOi3 mxFrYndP80HPB2TQGINFHn1BhDsoO2akBBpTlNL4sscfIiGAfFrcseF5BFw3kbuzef63 MK6VI8OtarXIc5m1vdw6QWcVPNEojGcgicQbRt6FiymkRilOx+wgy591Sp0/Ii+V3R7u kUljItCBKnR+G1u9AguUD0JyMBMXG4YHYU0dfpBmH6n9yV/XgJoA6+/kNW/ryycszhN+ yocQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:date:from :dkim-signature; bh=8vE9g+7G14NTraPEo7qqZPXqFetbj6StRkDYo8Di7S4=; b=wdQOlHC1oAjHmsemhXcgXC40nOPKxmlRi5sif5NqgkJ3jG7/TlpCOSiYOKKXFFdTzb NjS7LTnSjDBpDyxfLblXuSDhHqETy1R7HwUQvnzwZwT3ZS3Jz8q22+NKIKBzZtfVzam4 6mrOHMDUb7EJQZI86zpaIMAYJS/zGQVf40qGrNunwu6oQCSV3LqlQmwxPzVW88neEZbq HHqkPYnbg+sb4Vpq6YRkIOI67/YYUiGjO9o9pCq5Xv2v3lEJzUDtiGdfTGDicFUwZlwM GSMqr1xTpwFdGRQcyKwU0iwKUcKnZVOd5JhqsTow/hU6GOsAPZFXbrdWELWEKONRtGo/ xw1g== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail header.i=@gmail.com header.s=20161025 header.b=ItJsMQ8U; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id b24si1265938qtk.172.2020.04.10.07.48.43; Fri, 10 Apr 2020 07:49:01 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=fail header.i=@gmail.com header.s=20161025 header.b=ItJsMQ8U; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726626AbgDJOsC (ORCPT + 99 others); Fri, 10 Apr 2020 10:48:02 -0400 Received: from mail-qk1-f195.google.com ([209.85.222.195]:36476 "EHLO mail-qk1-f195.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726080AbgDJOsB (ORCPT ); Fri, 10 Apr 2020 10:48:01 -0400 Received: by mail-qk1-f195.google.com with SMTP id l25so2329164qkk.3; Fri, 10 Apr 2020 07:48:01 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=sender:from:date:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to; bh=8vE9g+7G14NTraPEo7qqZPXqFetbj6StRkDYo8Di7S4=; b=ItJsMQ8UP9u/v19m5LXrQk14pb/p5hjHsk+q1dpzGdrKs5FxywhBjq6HADivqPY30M tbTBIj5r4WAkpSTP4ai3CxPQ8gEHp0FqsSh57HO9nWBPEKIe30MerjxWLLWX9JP4uf92 uVvoDr+aLoV1lFbhqRvZmYqgJXXNjD7gEwGIyGAQDWSqZ1cQP8oDy3bkPzRIMShT8YGt Wxp5rtDLOl1nyX1a0kjZGHTUJJ7eOJ6kwzqd0R7Ab+eXNm40/JVQ5NifVtp70g3fiz7Z 0UtXMnbvgz1JXRnl/GnW5toD8TfO3t3Vq6wXF97w0T4CpMCwBonpMhbC20V289cYRahA LYEg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:sender:from:date:to:cc:subject:message-id :references:mime-version:content-disposition:in-reply-to; bh=8vE9g+7G14NTraPEo7qqZPXqFetbj6StRkDYo8Di7S4=; b=i7hmW9klZr7p4syQXuEOYgx/njKsBsHlLnwJRM/VVVcWujBHa1/LGdeZ6xyc0EZICG i7YNlYRg9f4iA9vFuQBqRqqXYXrpZ53sexhycSG8XhLW2VrNXqrI7DqOe/lRBtv1ecXq UssRrflVuVI900PPntogqG0b0dHrwkyjJFvXrhPHxb2ebY0L073qbZT2LzX+tJqEQCD/ BpekSoGUUF5OkHASVAmqwaaK+NIQTHIQW6hzqriVzBtA/oIAP3cnAMF4qGG4umdr9S31 NJ1VrBMgRg0wunU+eZvW6il3QQWD0ONA8StE9bGZ8j49xkX0Pndx189Z/tTQX57b/WCs /ibA== X-Gm-Message-State: AGi0PuZqcDbQraGizEH/WV/N5oZA1vZBT+d7fImDNJLQ7BgPUpkLBrLa 5xzCZ76XCdNu6ceuZ45Ytx4= X-Received: by 2002:a37:a702:: with SMTP id q2mr4301032qke.150.1586530081092; Fri, 10 Apr 2020 07:48:01 -0700 (PDT) Received: from rani.riverdale.lan ([2001:470:1f07:5f3::b55f]) by smtp.gmail.com with ESMTPSA id d24sm1731426qkl.8.2020.04.10.07.48.00 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 10 Apr 2020 07:48:00 -0700 (PDT) From: Arvind Sankar X-Google-Original-From: Arvind Sankar Date: Fri, 10 Apr 2020 10:47:58 -0400 To: Arvind Sankar Cc: Ard Biesheuvel , Dave Young , pjones@redhat.com, daniel.kiper@oracle.com, Leif Lindholm , Borislav Petkov , Sergey Shatunov , hpa@zytor.com, Linux Kernel Mailing List , mingo@redhat.com, Thomas Gleixner , X86 ML , linux-efi , initramfs@vger.kernel.org, Donovan Tremura , Harald Hoyer Subject: Re: [PATCH 1/2] efi/x86: Move efi stub globals from .bss to .data Message-ID: <20200410144758.GC936997@rani.riverdale.lan> References: <20200406180614.429454-1-nivedita@alum.mit.edu> <20200408074334.GA21886@dhcp-128-65.nay.redhat.com> <20200409143910.GA727557@rani.riverdale.lan> <20200409163530.GA785575@rani.riverdale.lan> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: <20200409163530.GA785575@rani.riverdale.lan> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Apr 09, 2020 at 12:35:30PM -0400, Arvind Sankar wrote: > On Thu, Apr 09, 2020 at 04:47:55PM +0200, Ard Biesheuvel wrote: > > On Thu, 9 Apr 2020 at 16:39, Arvind Sankar wrote: > > > > > > On Wed, Apr 08, 2020 at 09:49:15AM +0200, Ard Biesheuvel wrote: > > > > (add Peter, Leif and Daniel) > > > > > > > > On Wed, 8 Apr 2020 at 09:43, Dave Young wrote: > > > > > > > > > > On 04/06/20 at 02:06pm, Arvind Sankar wrote: > > > > > > Commit > > > > > > > > > > > > 3ee372ccce4d ("x86/boot/compressed/64: Remove .bss/.pgtable from > > > > > > bzImage") > > > > > > > > > > > > removed the .bss section from the bzImage. > > > > > > > > > > > > However, while a PE loader is required to zero-initialize the .bss > > > > > > section before calling the PE entry point, the EFI handover protocol > > > > > > does not currently document any requirement that .bss be initialized by > > > > > > the bootloader prior to calling the handover entry. > > > > > > > > > > > > When systemd-boot is used to boot a unified kernel image [1], the image > > > > > > is constructed by embedding the bzImage as a .linux section in a PE > > > > > > executable that contains a small stub loader from systemd together with > > > > > > additional sections and potentially an initrd. As the .bss section > > > > > > within the bzImage is no longer explicitly present as part of the file, > > > > > > it is not initialized before calling the EFI handover entry. > > > > > > Furthermore, as the size of the embedded .linux section is only the size > > > > > > of the bzImage file itself, the .bss section's memory may not even have > > > > > > been allocated. > > > > > > > > > > I did not follow up the old report, maybe I missed something. But not > > > > > sure why only systemd-boot is mentioned here. I also have similar issue > > > > > with early efi failure. With these two patches applied, it works well > > > > > then. > > > > > > > > > > BTW, I use Fedora 31 + Grub2 > > > > > > > > > > > > > OK, so I take it this means that GRUB's PE/COFF loader does not > > > > zero-initialize BSS either? Does it honor the image size in memory if > > > > it exceeds the file size? > > > > > > Dave, that comment was because the previous report was for systemd-boot > > > stub. > > > > > > Ard, should I revise the commit message to make it clear it's not > > > restricted to systemd-boot but anything using handover entry may be > > > affected? Maybe just a "for example, when systemd-boot..." and then a > > > line to say grub2 with the EFI stub patches is also impacted? > > > > > > > Well, the fact the /some/ piece of software is used in production that > > relies on the ill-defined EFI handover protocol is sufficient > > justification, so I don't think it is hugely important to update it. > > > > > https://src.fedoraproject.org/rpms/grub2/blob/f31/f/0001-Add-support-for-Linux-EFI-stub-loading.patch#_743 > > > > > > + kernel_mem = grub_efi_allocate_pages_max(lh.pref_address, > > > + BYTES_TO_PAGES(lh.init_size)); > > > > > > Looking at this, grub does allocate init_size for the image, but it > > > doesn't zero it out. > > > > > > This call also looks wrong to me though. It allocates at max address of > > > pref_address, which, if it succeeds, will guarantee that the kernel gets > > > loaded entirely below pref_address == LOAD_PHYSICAL_ADDR. In native > > > mode, if it weren't for the EFI stub copying the kernel again, this > > > would cause the startup code to relocate the kernel into unallocated > > > memory. On a mixed-mode boot, this would cause the early page tables > > > setup prior to transitioning to 64-bit mode to be in unallocated memory > > > and potentially get clobbered by the EFI stub. > > > > > > The first try to allocate pref_address should be calling > > > grub_efi_allocate_fixed instead. > > > > Thanks Arvind. I'm sure the Fedora/RedHat folks on cc should be able > > to get these logged somewhere. > > Ok. For dracut, the process for building the unified kernel image needs > a check to make sure the kernel can fit in the space provided for it -- > there is 16MiB of space and the distro bzImage's are up to 10-11MiB in > size, so there's some slack left at present. > > Additionally, in mixed-mode, the unified kernel images are quite likely > to end up with early pgtables from startup_32 clobbering the initrd, > independently of the recent kernel changes. Hopefully no-one actually > uses these in mixed-mode. The grub EFI handover entry patch is busted in mixed-mode for another reason -- while it allocates init_size, it doesn't use the correct alignment. I tested on a Debian buster VM in mixed-mode (that was the one I was able to get to install/boot with mixed-mode), and the early pagetable from startup_32 ends up in unallocated memory due to the rounding up of the bzImage address to account for kernel alignment. This would be an existing problem prior to these patches. Should we try to handle this in the kernel? At some point KASLR is going to pick that memory for the kernel and overwrite the pagetables I would think, resulting in sporadic crashes that are almost unreproducible.