Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp869704imu; Fri, 25 Jan 2019 12:31:07 -0800 (PST) X-Google-Smtp-Source: ALg8bN4uGv4u0mww4F0HHOhUC2grnSdxMWTFq0HwBrMC+nyjQPU43OD9cuTy42G79WNBRBetxgPI X-Received: by 2002:a63:e20a:: with SMTP id q10mr10951139pgh.206.1548448267276; Fri, 25 Jan 2019 12:31:07 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1548448267; cv=none; d=google.com; s=arc-20160816; b=DpiDPMyLiTQFs8pYJGsBqbmn44k6BryHFKhDNBep1H3sdmODl2FPwkDuu/ZpqmAx5a Qa4zsMdpAkvPNdiIPojUaq0Vm6dCPwfsL60FqWPlfREOLaLCiMF6fQ5FgNYxw4VG4kPp Gbj9y87o1+8pDGRs2P+yPFaOZC0a71QHTsD9yvcx3fcs2ccqOeRA9AeRsbw2oWNvvi96 i46N7iP++pX1DTZCgrPNN43xJFOV/wnloMlYZnmacDNK6aA8DG04wUq7x43QWj/TLy49 aI/p+5dWZnxqI47ckFNQsC2LhSyAu5X2eONOQRSdGr2jDxkkSTIWoFfQ7U/AUmpStEWw oSCQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:from:cc:to:subject :content-transfer-encoding:mime-version:references:in-reply-to :user-agent:date; bh=ddOJp5oXaNQZj5IhScp2qoOi3YQ4BUB6ec9Yva1rY/Y=; b=td6VkgL/bMBG/LKoymiH9sX9jqyBzHJtQzZqgOrt2uMF66novAFgcy09FGkBxv5Two me25wldhjwQcOrbzdhH55/pZI3IQWw6AXPsIdsK0iO6BIIy6HN4lmLDR82uAwyVt6LEm Bukn+ftI8oC7wuoDnqWnGEBmVSK5pNLGWCKBmFe0SqEwpLCaq/Gl8yYy06iuu5Mwv9Oh 3J+XNEgk5zsEu/fDI+mgOIq9gRP3IbfotoOKwpugPdaU/Lv+0wQMMFZfNFpJlf5C7pHr ylDujpJnG4C34Z5tbSZbuqhQIVZSSoCwc/bgCWIHfMSaTR6EHgDP+NDq5OLlR0Jwb1ih GL8A== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id n125si149957pga.179.2019.01.25.12.30.51; Fri, 25 Jan 2019 12:31:07 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729035AbfAYU3Q convert rfc822-to-8bit (ORCPT + 99 others); Fri, 25 Jan 2019 15:29:16 -0500 Received: from terminus.zytor.com ([198.137.202.136]:40999 "EHLO mail.zytor.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726044AbfAYU3Q (ORCPT ); Fri, 25 Jan 2019 15:29:16 -0500 Received: from wld62.hos.anvin.org (c-24-5-245-234.hsd1.ca.comcast.net [24.5.245.234] (may be forged)) (authenticated bits=0) by mail.zytor.com (8.15.2/8.15.2) with ESMTPSA id x0PKS61d267658 (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128 verify=NO); Fri, 25 Jan 2019 12:28:07 -0800 Date: Fri, 25 Jan 2019 11:51:52 -0800 User-Agent: K-9 Mail for Android In-Reply-To: References: <20190119103606.GA17400@kroah.com> <8BD4CB7A-944C-4EC5-A198-1D85C9EE28D6@zytor.com> <20190120161003.GB23827@google.com> <20190121014553.GD23827@google.com> <20190122133901.GA189736@google.com> <117d2f96-b0e9-2376-69b7-836fa0c52539@opersys.com> <6f82d99a-d877-aeae-3e63-95dff260c445@opersys.com> <20190124205929.GA141510@google.com> <50511D3D-7193-4B1C-952E-CCC37FA71388@zytor.com> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8BIT Subject: Re: [RFC] Provide in-kernel headers for making it easy to extend the kernel To: Daniel Colascione CC: Joel Fernandes , Karim Yaghmour , Greg KH , Christoph Hellwig , linux-kernel , Andrew Morton , ast@kernel.org, atish patra , Borislav Petkov , Ingo Molnar , Jan Kara , Jonathan Corbet , Kees Cook , kernel-team@android.com, "open list:DOCUMENTATION" , Manoj Rao , Masahiro Yamada , Paul McKenney , "Peter Zijlstra (Intel)" , Randy Dunlap , rostedt@goodmis.org, Thomas Gleixner , "maintainer:X86 ARCHITECTURE (32-BIT AND 64-BIT)" , yhs@fb.com From: hpa@zytor.com Message-ID: <48D020BD-D344-41CC-9D32-033F592605EB@zytor.com> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On January 25, 2019 11:15:52 AM PST, Daniel Colascione wrote: >On Fri, Jan 25, 2019 at 11:01 AM wrote: >> >> On January 24, 2019 12:59:29 PM PST, Joel Fernandes > wrote: >> >On Thu, Jan 24, 2019 at 07:57:26PM +0100, Karim Yaghmour wrote: >> >> >> >> On 1/23/19 11:37 PM, Daniel Colascione wrote: >> >[..] >> >> > > Personally I advocated a more aggressive approach with Joel in >> >private: >> >> > > just put the darn headers straight into the kernel image, it's >> >the >> >> > > *only* artifact we're sure will follow the Android device >> >whatever >> >> > > happens to it (like built-in ftrace). >> >> > >> >> > I was thinking along similar lines. Ordinarily, we make loadable >> >> > kernel modules. What we kind of want here is a non-loadable >kernel >> >> > module --- or a non-loadable section in the kernel image proper. >> >I'm >> >> > not familiar with early-stage kernel loader operation: I know >it's >> >> > possible to crease discardable sections in the kernel image, but >> >can >> >> > we create sections that are never slurped into memory in the >first >> >> > place? If not, maybe loading and immediately discarding the >header >> >> > section is good enough. >> >> >> >> Interesting. Maybe just append it to the image but have it not >loaded >> >and >> >> have a kernel parameter than enables a "/proc/kheaders" driver to >> >know where >> >> the fetch the appended headers from storage at runtime. There >would >> >be no >> >> RAM loading whatsoever of the headers, just some sort of >> >> "kheaders=/dev/foobar:offset:size" parameter. If you turn the >option >> >on, you >> >> get a fatter kernel image size to store on permanent storage, but >no >> >impact >> >> on what's loaded at boot time. >> > >> >Embedding anything into the kernel image does impact boot time >though >> >because >> >it increase the time spent by bootloader. A module OTOH would not >have >> >such >> >overhead. >> > >> >Also a kernel can be booted in any number of ways other than mass >> >storage so >> >it is not a generic Linux-wide solution to have a kheaders= option >like >> >that. >> >If the option is forgotten, then the running system can't use the >> >feature. >> >The other issue is it requires a kernel command line option / >> >bootloader >> >changes for that which adds more configuration burden, which not be >> >needed >> >with a module. >> > >> >> > Would such a thing really do better than LZMA? LZMA already has >> >very >> >> > clever techniques for eliminating long-range redundancies in >> >> > compressible text, including redundancies at the sub-byte level. >I >> >can >> >> > certainly understand the benefit of stripping comments, since >> >removing >> >> > comments really does decrease the total amount of information >the >> >> > compressor has to preserve, but I'm not sure how much the >encoding >> >> > scheme you propose below would help, since it reminds me of the >> >> > encoding scheme that LZMA would discover automatically. >> >> >> >> I'm no compression algorithm expert. If you say LZMA would do the >> >> same/better than what I suggested then I have no reason to contest >> >that. My >> >> goal is to see the headers as part of the kernel image that's >> >distributed on >> >> devices so that they don't have to be chased around. I'm just >trying >> >to make >> >> it as palatable as possible. >> > >> >I believe LZMA is really good at that sort of thing too. >> > >> >Also at 3.3MB of module size, I think we are really good size-wise. >But >> >Dan >> >is helping look at possibly reducing further if he gets time. Many >> >modules in >> >my experience are much bigger. amdgpu.ko on my Linux machine is >6.1MB. >> > >> >I really think making it a module is the best way to make sure this >is >> >bundled with the kernel on the widest number of Android and other >Linux >> >systems, without incurring boot time overhead, or any other command >> >line >> >configuration burden. >> > >> >I spoke to so many people at LPC personally with other kernel >> >contributors, >> >and many folks told me one word - MODULE :D. Even though I >hesitated >> >at >> >first, now it seems the right solution. >> > >> >If no one seriously objects, I'll clean this up and post a v2 and >with >> >the >> >RFC tag taken off. Thank you! >> > >> > - Joel >> >> So let me throw in a different notion. >> >> A kernel module really is nothing other than a kernel build system >artifact stored in the filesystem. >> >> I really don't at any reason whatsoever why this is direct from just >producing an archive and putting it in the module directory, except >that the latter is far simpler. >> >> I see literally *no* problem, social or technical, you are solvin by >actually making it a kernel ELF object. > >Joel does have a point. Suppose we're on Android and we're testing a >random local kernel we've built. We can command the device to boot >into the bootloader, then send our locally-built kernel to the device >with "fastboot boot mykernelz". Having booted the device this way, >there's no on-disk artifact corresponding to mykernelz: we just sent >the boot kernel directly to the device's memory. Now, suppose I want >to attach DCTV or some other fancy ftrace-based analysis tool to the >device in order to study how mykernelz acts in some scenario I care >about. With Joel's approach, DCTV would be able to grab the kernel >headers from the running kernel, compile whatever kprobe or BPF >incantations needed, and have everything Just Work. If we put the >headers only on disk without any way to retrieve them at runtime, we'd >need a different path for kernel self-description in the case where a >running kernel doesn't have an on-disk representation, and that adds a >lot of complexity for everyone everywhere. By providing >guaranteed-correct kernel headers via some runtime interface, we make >a lot of things Just Work, and that has value. Joel specifically is talking about using a module, which *does* have to be in the filesystem. You can't have it both ways, unfortunately. -- Sent from my Android device with K-9 Mail. Please excuse my brevity.