Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755302AbZFOGo6 (ORCPT ); Mon, 15 Jun 2009 02:44:58 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1751851AbZFOGov (ORCPT ); Mon, 15 Jun 2009 02:44:51 -0400 Received: from cantor2.suse.de ([195.135.220.15]:56114 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751390AbZFOGov (ORCPT ); Mon, 15 Jun 2009 02:44:51 -0400 Date: Mon, 15 Jun 2009 08:44:47 +0200 From: Nick Piggin To: Wu Fengguang Cc: Balbir Singh , Andrew Morton , LKML , Ingo Molnar , Mel Gorman , Thomas Gleixner , "H. Peter Anvin" , Peter Zijlstra , Hugh Dickins , Andi Kleen , "riel@redhat.com" , "chris.mason@oracle.com" , "linux-mm@kvack.org" Subject: Re: [PATCH 00/22] HWPOISON: Intro (v5) Message-ID: <20090615064447.GA18390@wotan.suse.de> References: <20090615024520.786814520@intel.com> <4A35BD7A.9070208@linux.vnet.ibm.com> <20090615042753.GA20788@localhost> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20090615042753.GA20788@localhost> User-Agent: Mutt/1.5.9i Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2115 Lines: 50 On Mon, Jun 15, 2009 at 12:27:53PM +0800, Wu Fengguang wrote: > On Mon, Jun 15, 2009 at 11:18:18AM +0800, Balbir Singh wrote: > > Wu Fengguang wrote: > > > Hi all, > > > > > > Comments are warmly welcome on the newly introduced uevent code :) > > > > > > I hope we can reach consensus in this round and then be able to post > > > a final version for .31 inclusion. > > > > Isn't that too aggressive? .31 is already in the merge window. > > Yes, a bit aggressive. This is a new feature that involves complex logics. > However it is basically a no-op when there are no memory errors, > and when memory corruption does occur, it's better to (possibly) panic > in this code than to panic unconditionally in the absence of this > feature (as said by Rik). > > So IMHO it's OK for .31 as long as we agree on the user interfaces, > ie. /proc/sys/vm/memory_failure_early_kill and the hwpoison uevent. > > It comes a long way through numerous reviews, and I believe all the > important issues and concerns have been addressed. Nick, Rik, Hugh, > Ingo, ... what are your opinions? Is the uevent good enough to meet > your request to "die hard" or "die gracefully" or whatever on memory > failure events? Uevent? As in, send a message to userspace? I don't think this would be ideal for a fail-stop/failover situation. I can't see a good reason to rush to merge it. IMO the userspace-visible changes have maybe not been considered too thoroughly, which is what I'd be most worried about. I probably missed seeing documentation of exact semantics and situations where admins should tune things one way or the other. Did we verify with filesystem maintainers (eg. btrfs) that the !ISREG test will be enough to prevent oopses? I hope it is going to be merged with an easy-to-use fault injector, because that is the only way Joe kernel developer is ever going to test it. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/