Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753456AbaDOBht (ORCPT ); Mon, 14 Apr 2014 21:37:49 -0400 Received: from mail9.hitachi.co.jp ([133.145.228.44]:38712 "EHLO mail9.hitachi.co.jp" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753327AbaDOBhq (ORCPT ); Mon, 14 Apr 2014 21:37:46 -0400 Message-ID: <534C8D64.2070108@hitachi.com> Date: Tue, 15 Apr 2014 10:37:40 +0900 From: Masami Hiramatsu Organization: Hitachi, Ltd., Japan User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:13.0) Gecko/20120614 Thunderbird/13.0.1 MIME-Version: 1.0 To: Vivek Goyal Cc: linux-kernel@vger.kernel.org, Satoru MORIYA , Yoshihiro YUNOMAE , Takenori Nagano , Eric Biederman , Motohiro Kosaki , Andrew Morton Subject: Re: [PATCH] kernel/panic: Add "late_kdump" option for kdump in unstable condition References: <20140414045158.10846.35462.stgit@ltc230.yrl.intra.hitachi.co.jp> <20140414193153.GC4281@redhat.com> In-Reply-To: <20140414193153.GC4281@redhat.com> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Vivek, (2014/04/15 4:31), Vivek Goyal wrote: > On Mon, Apr 14, 2014 at 01:51:58PM +0900, Masami Hiramatsu wrote: >> Add a "late_kdump" option to run kdump after running panic >> notifiers and dump kmsg. This can help rare situations which >> kdump drops in failure because of unstable crashed kernel >> or hardware failure (memory corruption on critical data/code), >> or the 2nd kernel is broken by the 1st kernel (it's a broken >> behavior, but who can guarantee that the "crashed" kernel >> works correctly?). >> >> Usage: add "late_kdump" to kernel boot option. That's all. >> >> Note that this actually increases risks of the failure of >> kdump. This option should be set only if you worry about >> the rare case of kdump failure rather than increasing the >> chance of success. >> > > Masami, > > So what's the alternative to kdump which is more reliable? IOW, what > action you are planning to take through kmsg_dump() or through > panic_notifiers? > > I have seen that many a times developers have tried to make the case > to save kernel buffers to NVRAM. Does it work well? Has it been proven > to be more reliable than kdump? Yeah, one possible option is the NVRAM, but even with the serial, there are other reasons to kick the notifiers, e.g. - dump to ipmi which has a very small amount of non-volatile memory - ftrace_dump() to dump "flight recorder" log to serial - pvpanic notifies panic to the host. Anyway, I think the most important reason for linux developers is that we have a chance to improve such horrible notifiers to safer, or at least to clarify what notifier or behavior makes kdump unstable. :-) Thank you, -- Masami HIRAMATSU Software Platform Research Dept. Linux Technology Center Hitachi, Ltd., Yokohama Research Laboratory E-mail: masami.hiramatsu.pt@hitachi.com -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/