Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S965810Ab3DQGnI (ORCPT ); Wed, 17 Apr 2013 02:43:08 -0400 Received: from mail-ie0-f173.google.com ([209.85.223.173]:37289 "EHLO mail-ie0-f173.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S965796Ab3DQGnG (ORCPT ); Wed, 17 Apr 2013 02:43:06 -0400 Message-ID: <516E446B.5060006@gmail.com> Date: Wed, 17 Apr 2013 14:42:51 +0800 From: Simon Jeons User-Agent: Mozilla/5.0 (X11; Linux i686; rv:17.0) Gecko/20130329 Thunderbird/17.0.5 MIME-Version: 1.0 To: Naoya Horiguchi CC: Andi Kleen , Mitsuhiro Tanino , linux-kernel , linux-mm Subject: Re: [RFC Patch 0/2] mm: Add parameters to make kernel behavior at memory error on dirty cache selectable References: <51662D5B.3050001@hitachi.com> <20130411134915.GH16732@two.firstfloor.org> <1365693788-djsd2ymu-mutt-n-horiguchi@ah.jp.nec.com> In-Reply-To: <1365693788-djsd2ymu-mutt-n-horiguchi@ah.jp.nec.com> Content-Type: text/plain; charset=ISO-2022-JP Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1982 Lines: 45 Hi Naoya, On 04/11/2013 11:23 PM, Naoya Horiguchi wrote: > On Thu, Apr 11, 2013 at 03:49:16PM +0200, Andi Kleen wrote: >>> As a result, if the dirty cache includes user data, the data is lost, >>> and data corruption occurs if an application uses old data. >> The application cannot use old data, the kernel code kills it if it >> would do that. And if it's IO data there is an EIO triggered. >> >> iirc the only concern in the past was that the application may miss >> the asynchronous EIO because it's cleared on any fd access. >> >> This is a general problem not specific to memory error handling, >> as these asynchronous IO errors can happen due to other reason >> (bad disk etc.) >> >> If you're really concerned about this case I think the solution >> is to make the EIO more sticky so that there is a higher chance >> than it gets returned. This will make your data much more safe, >> as it will cover all kinds of IO errors, not just the obscure memory >> errors. > I'm interested in this topic, and in previous discussion, what I was said > is that we can't expect user applications to change their behaviors when > they get EIO, so globally changing EIO's stickiness is not a great approach. The user applications will get EIO firstly or get SIG_KILL firstly? > I'm working on a new pagecache tag based mechanism to solve this. > But it needs time and more discussions. > So I guess Tanino-san suggests giving up on dirty pagecache errors > as a quick solution. > > Thanks, > Naoya > > -- > To unsubscribe, send a message with 'unsubscribe linux-mm' in > the body to majordomo@kvack.org. For more info on Linux MM, > see: http://www.linux-mm.org/ . > Don't email: email@kvack.org -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/