Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp730438imu; Fri, 9 Nov 2018 05:15:43 -0800 (PST) X-Google-Smtp-Source: AJdET5e+AyhgBuoKN7iT8DFSQHlRaaGL8BQ4FCko6VobgWbDx2+62TlMqa5p5E0lmSyC/gDDxXcJ X-Received: by 2002:a62:507:: with SMTP id 7-v6mr9021783pff.80.1541769343069; Fri, 09 Nov 2018 05:15:43 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1541769343; cv=none; d=google.com; s=arc-20160816; b=lx8xR2FTybQ685u2hIQfiQ8+jTdQIAHHTWf61uzMx8lkbY2ES53DA6FzN9T68gEpk4 5FYh9QQKerTEGiE4mfXCGPqLwWqnr8cRtphbuw4SBdVLSCofFI71z6sQS++JOdTZV3M7 750Xg5indVLEIUobgV6VXHXuqlcAgcV48+qFLtAx+tBBVncSBS3jMy2x9RGYYF2Xh5P4 Cz5nKDzcTCiWR23VMmUXc6L1UuIcP2etfQ2fKLkLHPZAp/ZZ//ppAzga+R/LOAitWRhQ 3SQAml2yC+92Q2iKw3u8MrnhhiWsB3UNIMLSxObfmUO0hnv5B3u/1GgZVSE/HnCqdfhU 8zag== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-language :content-transfer-encoding:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=NCnTwS9dwTIWUrcCTvbE8Oup43euQRZb3OTCN50FW2Q=; b=rDFuDMpOWPumqzL1fBjc5AdMAOa1FZOmGb4gJw7rS5EvCQjBVRImV6VTvUHxCnKSwn 5hwDrNTuiQy8Abo0hQiaVL0RAW6yhw+akZAMN+9gENiciam64kgfjCUgI5LTMoQJx7T0 ZOrnMqO54RyZVmlxgTGZJxUu8V2YalJRqCvuGpXDprNbufJeJCEwxUKn3tk3P48Vhn01 n0EsuCHL3FZkQq27jp/7LeB5Bf0GWQ3CWCErj0Mxv16C5paMaFSbBQgSpE9sRbQF5ava LZnvq/9Zy0opEqkyrOJ9fGik4zDrd3gnRFpQd7Q8lHj2JYzhp82K79TKGI72U7MER4JK zcCg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id f5si7242584pgr.411.2018.11.09.05.15.26; Fri, 09 Nov 2018 05:15:43 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727827AbeKIWze (ORCPT + 99 others); Fri, 9 Nov 2018 17:55:34 -0500 Received: from mga14.intel.com ([192.55.52.115]:46851 "EHLO mga14.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727667AbeKIWze (ORCPT ); Fri, 9 Nov 2018 17:55:34 -0500 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from orsmga007.jf.intel.com ([10.7.209.58]) by fmsmga103.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 09 Nov 2018 05:14:59 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.54,483,1534834800"; d="scan'208";a="88012441" Received: from mke-mobl.ccr.corp.intel.com (HELO [192.168.4.139]) ([10.255.30.228]) by orsmga007.jf.intel.com with ESMTP; 09 Nov 2018 05:11:21 -0800 Subject: Re: [Qemu-devel] [RFC/PoC PATCH 1/3] i386: set initrd_max to 4G - 1 to allow up to 4G initrd To: Juergen Gross , Ingo Molnar , Li Zhijian Cc: Peter Maydell , x86@kernel.org, hpa@zytor.com, bp@alien8.de, mingo@redhat.com, tglx@linutronix.de, QEMU Developers , Philip Li , linux-kernel@vger.kernel.org, Linus Torvalds , Peter Zijlstra , Kees Cook References: <1541674784-25936-1-git-send-email-lizhijian@cn.fujitsu.com> <1541674784-25936-2-git-send-email-lizhijian@cn.fujitsu.com> <20181109072015.GA86700@gmail.com> <38905d35-29af-b522-1629-b13e98a47a42@intel.com> From: Li Zhijian Message-ID: <01ae76b5-bc70-7859-91e3-7e60e3cca247@intel.com> Date: Fri, 9 Nov 2018 21:11:17 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Content-Language: en-US Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Just noticed that there is a field xloadflags at recent protocol 60 Protocol 2.12: (Kernel 3.8) Added the xloadflags field and extension fields 61 to struct boot_params for loading bzImage and ramdisk 62 above 4G in 64bit. [snip] 617 Field name: xloadflags 618 Type: read 619 Offset/size: 0x236/2 620 Protocol: 2.12+ 621 622 This field is a bitmask. 623 624 Bit 0 (read): XLF_KERNEL_64 625 - If 1, this kernel has the legacy 64-bit entry point at 0x200. 626 627 Bit 1 (read): XLF_CAN_BE_LOADED_ABOVE_4G 628 - If 1, kernel/boot_params/cmdline/ramdisk can be above 4G. 629 maybe we can reuse this field and append a new Bit 5 XLF_INITRD_SIZE_4G or such thanks Zhijian On 11/9/2018 6:04 PM, Juergen Gross wrote: > On 09/11/2018 10:57, Li Zhijian wrote: >> On 11/9/2018 3:20 PM, Ingo Molnar wrote: >>> * Li Zhijian wrote: >>> >>>>> If the kernel initrd creation process creates an initrd which >>>>> is larger than 2GB and also claims that it can't be placed >>>>> with any part of it above 2GB, then that sounds like a bug >>>>> in the initrd creation process... >>>> Exactly, it's a real problem. >>>> >>>> Add x86 maintainers and LKML: >>>> >>>> The background is that QEMU want to support up to 4G initrd. but >>>> linux header ( >>>> initrd_addr_max field) only allow 2G-1. >>>> Is one of the below approaches reasonable: >>>> 1) change initrd_addr_max to 4G-1 directly >>>> simply(arch/x86/boot/header.S)? >>>> 2) lie QEMU bootloader the initrd_addr_max is 4G-1 even though header >>>> said 2G-1 >>>> 3) any else >>> A 10 years old comment from hpa says: >>> >>>    initrd_addr_max: .long 0x7fffffff >>>                                          # (Header version 0x0203 or >>> later) >>>                                          # The highest safe address for >>>                                          # the contents of an initrd >>>                                          # The current kernel allows >>> up to 4 GB, >>>                                          # but leave it at 2 GB to avoid >>>                                          # possible bootloader bugs. >>> >>> To avoid the potential of bugs lurking in dozens of major and hundreds of >>> minor iterations of various Linux bootloaders I'd prefer a real solution >>> and extend it - because if there's a 2GB initrd for some weird reason >>> today there might be a 4GB one in two years. >> thank a lots. that's amazing. >> >> >>> The real solution would be to: >>> >>>   - Extend the boot protocol with a 64-bit field, named initrd_addr64_max >>>     or such. >>>   - We don't change the old field - but if the new field is set by new >>>     kernels then new bootloaders can use that as a new initrd_addr64_max >>>     value. (or reject to load the kernel if the address is too high.) >>> >>>   - The kernel build should also emit a warning when building larger than >>>     2GB initrds, with a list of bootloaders that support the new >>> protocol. >> Actually i just knew QEMU(Seabios + optionrom(linuxboot_dma.bin)) can >> support ~4GB initrd so far. >> >> i just drafted at patch to add this field. could you have a look. >> another patch which is to document initrd_addr64_max is ongoing. >> >> commit db463ac9c1975f115d1ce2acb82d530c2b63b888 >> Author: Li Zhijian >> Date:   Fri Nov 9 17:24:14 2018 +0800 >> >>     x86: Add header field initrd_addr64_max >>         Years ago, kernel had support load ~4GB initrd. But for some >> weird reasons ( >>     avoid possible bootloader bugs), it only allow leave initrd under >> 2GB address >>     space(see initrd_addr_max fild at arch/x86/boot/header.S). >>         So modern bootloaders have not chance to load >=2G initrd >> previously. >>         To avoid the potential of bugs lurking in dozens of major and >> hundreds of >>     minor iterations of various Linux bootloaders. Ingo suggests to add >> a new field >>     initrd_addr64_max. If bootloader believes that it can load initrd to >>> =2G >>     address space, it can use initrd_addr64_max as the maximum loading >> address in >>     stead of the old field initrd_addr_max. >> >> diff --git a/arch/x86/boot/header.S b/arch/x86/boot/header.S >> index 4c881c8..5fc3ebe 100644 >> --- a/arch/x86/boot/header.S >> +++ b/arch/x86/boot/header.S >> @@ -300,7 +300,7 @@ _start: >>         # Part 2 of the header, from the old setup.S >> >>                 .ascii  "HdrS"          # header signature >> -               .word   0x020e          # header version number (>= 0x0105) >> +               .word   0x020f          # header version number (>= 0x0105) >>                                         # or else old loadlin-1.5 will >> fail) >>                 .globl realmode_swtch >>  realmode_swtch:        .word   0, 0            # default_switch, SETUPSEG >> @@ -562,6 +562,12 @@ acpi_rsdp_addr:            .quad 0 >> # 64-bit physical pointer to the >>                                                 # ACPI RSDP table, added >> with >>                                                 # version 2.14 >> >> +#ifdef CONFIG_INITRD_SIZE_4GB >> +initrd_addr64_max:     .quad 0xffffffff        # allow ~4G initrd since >> 2.15 >> +#else >> +initrd_addr64_max:     .quad 0 > Shouldn't this be 0x7fffffff? > > And please update Documentation/x86/boot.txt > > > Juergen