Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id ; Mon, 8 Apr 2002 10:29:56 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id ; Mon, 8 Apr 2002 10:29:56 -0400 Received: from e1.ny.us.ibm.com ([32.97.182.101]:13707 "EHLO e1.ny.us.ibm.com") by vger.kernel.org with ESMTP id ; Mon, 8 Apr 2002 10:29:55 -0400 Message-ID: <3CB1A9A8.1155722E@in.ibm.com> Date: Mon, 08 Apr 2002 20:01:04 +0530 From: Suparna Bhattacharya Organization: IBM X-Mailer: Mozilla 4.75 [en] (X11; U; Linux 2.2.16-22 i686) X-Accept-Language: en MIME-Version: 1.0 To: "Eric W. Biederman" CC: "Martin J. Bligh" , linux-kernel@vger.kernel.org Subject: Re: Faster reboots (and a better way of taking crashdumps?) In-Reply-To: <1759496962.1018114339@[10.10.2.3]> Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org "Eric W. Biederman" wrote: > > "Martin J. Bligh" writes: > > > >> 2. Things that are reset by reboot that we don't reset during > > >> normal kernel boot? > > > > > > A sane BIOS will toggle the board level reset line on reboot. > > > The all don't but that makes it look like a fresh boot, with > > > a negligible speed penalty. > > > > I know that, but what I mean is that I'm *not* going to get > > this reset if I just jump back to the init point ... I was > > trying to work out what kind of trouble that would cause. > > I didn't mean to obfuscate the issue. There is some trouble you can > get into, but most of it is when asking the BIOS to do things for > you. Beyond that there a few rare devices that the kernel doesn't > reset right. Turning off bus master is the primary issue. There > is work in the linux power management infrastructure that will handle > most of this long term. > > > > Seriously check out my code it should just work unless there are > > > > OK, I took a very brief scan of just the descriptions of your > > patches - looks like the main thing you're doing is creating > > a 32 bit kernel entry point, right? So above and beyond that > > I'd have to rework the LILO code to work in 32 bit, which > > probably isn't that hard now I think about it ... all the hard > > stuff is actually done by the command line binary, so maybe ... > > Sorry wrong set of patches. I'm not pushing my kexec stuff quite as > hard. It is a little lower on my priority list. Basically the work > is to allow Linux to boot Linux directly. I have been trying look through this in terms of how it compares with alternate projects (bootimg, monte etc). As I mentioned in an earlier mail, crash dump (mcore) relies on bootimg, and I'm trying to decide if there could be advantages in using your kexec stuff. My main concern of course is with regard to these BIOS dependent/related issues since at the time of a crash dump we may not be in quite a "friendly state". Guess some the linux power mgmt infrastructure or driverfs should help with sane resets etc (I'm not saying its straightforward :)). in the long run. As such how far does your implementation address some of this BIOS/h/w state handling better ? BTW, some of your other boot enhancements like being able to find out which memory areas were used or overwritten during bootup sound useful to me, in being able to estimate the footprint of early boot and avoiding using those portions of memory for saving any state (because boot could stomp over them). Its good to be able to do this in a generic way, rather than have the dump code be aware of the ranges for every architecture. > > ftp://download.lnxi.com/pub/src/linux-kernel-patches/linux-2.5.7.kexec.diff > ftp://download.lnxi.com/pub/src/linux-kernel-patches/kexec-2.5.7.kexec.log > ftp://download.lnxi.com/pub/src/mkelfImage/elfboottools-2.0.tar.gz > type make and see objdir/build/sbin/kexec (work with bzImages) I don't seem to be able to access these urls. The patches I downloaded were from ftp://download.lnxi.com/pub/src/kexec/* (with the same names). Are these the right ones ? (your last note mentioned those, but you are saying that these are the wrong set ... so now I'm a little confused) Is there one single grand rollup patch with all of the function which I should look through or try out ? > > The basic kernel interface that is added is: > > struct segment { > void *buffer; > void *dest_addr; > size_t len; > }; > int kexec(void *start, int nr_segments, struct segment *segments); > Yes, its a good idea to split up the load stage and actual boot/exec stage. Crash dump needs to have it that way too (the second image preloaded in advance since we don't want to do any i/o at that point). > This appears to be an interface I can support on every linux platform. > it works on x86, and I have a proof of concept alpha port. > > The initial transfer of control happens on the bootstrap processor, in > 32bit protected mode with paging disabled. All of the segment > registers are set to a flat 32bit segment with a base address of zero. > And %esp points to an area that is good for a small stack. > > The rest is left up the controlling program. The major discussion > with this happened a while ago. There was a basic agreement on what > the system call interface should look like, but it really hasn't > progressed much since then. There are not a lot of booting experts I > can bang heads with :) > > > > special apic shutdown rules for NUMAQ machines. > > > > The APICs should be OK ... the interconnect firmware sets them > > up, and Linux never changes them, so everything *should* be OK > > in theory. Of course if it ever gets screwed up, reboot won't > > fix it, but then I can't reboot at all right now, so ... ;-) > > I have code in there to fix things of for the SMP case, you might want > to double check that doesn't mess up the NUMAQ case. > > > On the other hand, I don't reset the processors fully (I have > > to use NMI to boot rather than the INIT, INIT, STARTUP sequence), > > which seems to be asking for trouble ;-( > > Well even the INIT assert, INIT deassert, STARTUP sequence doesn't > reset the processors fully. Things like the MTRR's remain setup so > I wouldn't worry about it to much. Assuming the NMI can reliably > get the processor to a kernel entry point. > > Eric > - > To unsubscribe from this list: send the line "unsubscribe linux-kernel" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at http://vger.kernel.org/majordomo-info.html > Please read the FAQ at http://www.tux.org/lkml/ - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/