Received: by 2002:a25:1506:0:0:0:0:0 with SMTP id 6csp865331ybv; Wed, 5 Feb 2020 16:13:27 -0800 (PST) X-Google-Smtp-Source: APXvYqwiwyiur4Mf6HPHEuk+Fwj4hVKdDdQbv/K4e/fC/MEHQBxL359SwjSVlzqwfWS8ho+gqu51 X-Received: by 2002:aca:3109:: with SMTP id x9mr5137308oix.107.1580948007386; Wed, 05 Feb 2020 16:13:27 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1580948007; cv=none; d=google.com; s=arc-20160816; b=YUro/qg7GdDUC8L20gWA7685Sxg+z9+iVMqYcVQO9ez4untHWm6OL3ZKXRzJettDyw VUIprJVN1zQZfXpd3i6uSV0gGrgvAwJqjg3CNDVzZJSEoEoD8pWPSrp5zDEPreBkZQ6N TCjlGZaOLafq4JcXbaYaXKCnXe9npESBoMhEk6wvIct4PPPLpirMBil7T5dqf72vyz+9 I/evQgsuHQm4lZ+qgo6q+dUP0fFY6EPRyYPlU388uPr59NS5fVDz4CUQ05n73n1BayMm mawxnLfRcxtuqa73l8S46OWBG7+JbaYEei0qbKuqUTC+cWkVF8vGoDmV86FmRLUixwWP ANlQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:date:from:dkim-signature; bh=lgPcWN6p4BNWezBrExW07TYU0pjpDkLuT/nBxzOZrKA=; b=bIO6BhajlqwNfipxQt3V3MCnHV4ibAksSBXWnOBhibWnDyQrx2dM+ASKlUNs49sGxf ahIGFmsPBYH7TvxReC6/o2OJBDOZ/V52sKkpEWUxdtZLAChwqp5vbQTKQ7pR6B4wm45/ 2OhaHyHP3O6TFNlc7L1yLrKERaoBl6Q4lbIdAR+JkyTnfZubjxD5dv2QyvYWkgcM36pV UCvZQ728ZPwuI9FEErOh2PiV9I0i5F11Z1fWI3ehWy19LY3WLTdZDCQ6r1P14TNCyaqo appn70uzF+heDrypD8bxBEWtFZ48W5MVHMoxl6GrLKme2KaZGb1a8NewQ8vZbHYIiMXD BfdA== ARC-Authentication-Results: i=1; mx.google.com; dkim=fail header.i=@gmail.com header.s=20161025 header.b=FZm3loSm; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id e8si1038821oie.96.2020.02.05.16.13.12; Wed, 05 Feb 2020 16:13:27 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=fail header.i=@gmail.com header.s=20161025 header.b=FZm3loSm; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727570AbgBFALJ (ORCPT + 99 others); Wed, 5 Feb 2020 19:11:09 -0500 Received: from mail-qt1-f195.google.com ([209.85.160.195]:40478 "EHLO mail-qt1-f195.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727149AbgBFALJ (ORCPT ); Wed, 5 Feb 2020 19:11:09 -0500 Received: by mail-qt1-f195.google.com with SMTP id v25so3118843qto.7 for ; Wed, 05 Feb 2020 16:11:08 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=sender:from:date:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to:user-agent; bh=lgPcWN6p4BNWezBrExW07TYU0pjpDkLuT/nBxzOZrKA=; b=FZm3loSmtEtdI+DdMSAgDRDm+Jd48ZowxczIdHgOH5dA0c83sDS1FoMpCvBqy0NUqA EmRu9QAJnbtcLKy7nXG5UD53hxuphg34JWubJG0bvLpAPGflkb/c5J7A1jKIrAMaYy5Z RsY0Ye5PqCWkfct+8jOZ4kZ0L0D9A34dXt9oIU7RLz5QoWJgAc6LRew87bvMWfeQlFu2 20r3KwqcBwuytwRx7bZwHh9Yb0yuuIR4L5JUS/7UTjfrAKv3YbHxvtdS3lAExTFejDOO JvIzkW1ETXFyEhZcdyNT/7SsUScT9+vr6+8K+DNWkYVh3EmBBFZKFsIR2zWILfvxTZv6 Nq6A== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:sender:from:date:to:cc:subject:message-id :references:mime-version:content-disposition:in-reply-to:user-agent; bh=lgPcWN6p4BNWezBrExW07TYU0pjpDkLuT/nBxzOZrKA=; b=EAzpj1IjU6XPKzfuOs5R/6qEqLoV52tC3lHlSpJZtlGiYWIJsnB6raw+tW1KaBCKpr B7k1zmeJz4D/bKjiPpX7zjle96yOiPuOAe7NvXm5XWNnuFRNg77x8ziEnaqVnIasKjl8 Um0SJArZbpJSyZNR4Jz1xMnkC+FS6E96hwOLkdO0++WH+ckz76hNs1EqNXARYnrBtVnO vcmP1U2A2ub/5Ep2+WUwgzFdc+CyCwyoM5scDplbmeF0rLQueKLNAIhSau6VSQEcHKaS OguoNCBdALArslF5FemxR2bQQu+viSSWln5YIRbszakSSwkyupJtJBMKwV02xLJKz275 SLpQ== X-Gm-Message-State: APjAAAXKnZX8jDcOX3qgQgnXM9gtXC05tElv2hA65f6qiXQApKTI0WyZ QuhBRgn1v/1wWrVMIsrQHoY= X-Received: by 2002:ac8:7443:: with SMTP id h3mr342700qtr.202.1580947867401; Wed, 05 Feb 2020 16:11:07 -0800 (PST) Received: from rani.riverdale.lan ([2001:470:1f07:5f3::b55f]) by smtp.gmail.com with ESMTPSA id g77sm611599qke.129.2020.02.05.16.11.06 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 05 Feb 2020 16:11:07 -0800 (PST) From: Arvind Sankar X-Google-Original-From: Arvind Sankar Date: Wed, 5 Feb 2020 19:11:05 -0500 To: Kristen Carlson Accardi Cc: tglx@linutronix.de, mingo@redhat.com, bp@alien8.de, hpa@zytor.com, arjan@linux.intel.com, keescook@chromium.org, rick.p.edgecombe@intel.com, x86@kernel.org, linux-kernel@vger.kernel.org, kernel-hardening@lists.openwall.com Subject: Re: [RFC PATCH 11/11] x86/boot: Move "boot heap" out of .bss Message-ID: <20200206001103.GA220377@rani.riverdale.lan> References: <20200205223950.1212394-1-kristen@linux.intel.com> <20200205223950.1212394-12-kristen@linux.intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: <20200205223950.1212394-12-kristen@linux.intel.com> User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Feb 05, 2020 at 02:39:50PM -0800, Kristen Carlson Accardi wrote: > From: Kees Cook > > Currently the on-disk decompression image includes the "dynamic" heap > region that is used for malloc() during kernel extraction, relocation, > and decompression ("boot_heap" of BOOT_HEAP_SIZE bytes in the .bss > section). It makes no sense to balloon the bzImage with "boot_heap" > as it is zeroed at boot, and acts much more like a "brk" region. > > This seems to be a trivial change because head_{64,32}.S already only > copies up to the start of the .bss section, so any growth in the .bss > area was already not meaningful when placing the image in memory. The > .bss size is, however, reflected in the boot params "init_size", so the > memory range calculations included the "boot_heap" region. Instead of > wasting the on-disk image size bytes, just account for this heap area > when identifying the mem_avoid ranges, and leave it out of the .bss > section entirely. For good measure, also zero initialize it, as this > was already happening for when zeroing the entire .bss section. > > While the bzImage size is dominated by the compressed vmlinux, the > difference removes 64KB for all compressors except bzip2, which removes > 4MB. For example, this is less than 1% under CONFIG_KERNEL_XZ: > > -rw-r--r-- 1 kees kees 7813168 Feb 2 23:39 arch/x86/boot/bzImage.stock-xz > -rw-r--r-- 1 kees kees 7747632 Feb 2 23:42 arch/x86/boot/bzImage.brk-xz > > but much more pronounced under CONFIG_KERNEL_BZIP2 (~27%): > > -rw-r--r-- 1 kees kees 15231024 Feb 2 23:44 arch/x86/boot/bzImage.stock-bzip2 > -rw-r--r-- 1 kees kees 11036720 Feb 2 23:47 arch/x86/boot/bzImage.brk-bzip2 > > For the future fine-grain KASLR work, this will avoid significant pain, > as the ELF section parser will use much more memory during boot and > filling the bzImage with megabytes of zeros seemed like a poor idea. :) > I'm not sure I follow this: the reason the bzImage currently contains .bss and a fix for it is in a patch I have out for review at https://lore.kernel.org/lkml/20200109150218.16544-1-nivedita@alum.mit.edu This alone shouldn't make much of a difference across compressors. The entire .bss is just stored uncompressed as 0's in bzImage currently. The only thing that gets compressed is the original kernel ELF file. Is the difference above just from this patch, or is it including the overhead of function-sections? It is not necessary for it to contain .bss to get the correct init_size. The latter is calculated (in x86/boot/header.S) based on the offset of the _end symbol in the compressed vmlinux, so storing the .bss is just a bug. https://git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git/tree/arch/x86/boot/header.S#n559 From the cover letter: > Image Size > ---------- > Adding additional section headers as a result of compiling with > -ffunction-sections will increase the size of the vmlinux ELF file. In > addition, the vmlinux.bin file generated in arch/x86/boot/compressed by > objcopy grows significantly with the current POC implementation. This is > because the boot heap size must be dramatically increased to support shuffling > the sections and re-sorting kallsyms. With a sample kernel compilation using a > stock Fedora config, bzImage grew about 7.5X when CONFIG_FG_KASLR was enabled. > This is because the boot heap area is included in the image itself. > > It is possible to mitigate this issue by moving the boot heap out of .bss. > Kees Cook has a prototype of this working, and it is included in this > patchset. I am also confused by this -- the boot heap is not part of the vmlinux.bin in arch/x86/boot/compressed: that's a stripped copy of the decompressed kernel, just before we apply the selected compression to it and vmlinux.relocs. Do you mean arch/x86/boot/vmlinux.bin? That is an objcopy of compressed/vmlinux, and it grows in size with increasing .bss for the same reason as above (rather it's the cause of bzImage growing).