Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751508AbVJLTsr (ORCPT ); Wed, 12 Oct 2005 15:48:47 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1751512AbVJLTsq (ORCPT ); Wed, 12 Oct 2005 15:48:46 -0400 Received: from locomotive.csh.rit.edu ([129.21.60.149]:62317 "EHLO locomotive.unixthugs.org") by vger.kernel.org with ESMTP id S1751123AbVJLTsp (ORCPT ); Wed, 12 Oct 2005 15:48:45 -0400 Message-ID: <434D6932.1040703@suse.com> Date: Wed, 12 Oct 2005 15:51:14 -0400 From: Jeff Mahoney User-Agent: Mozilla Thunderbird 1.0.6 (X11/20050715) X-Accept-Language: en-us, en MIME-Version: 1.0 To: Anton Altaparmakov Cc: Mikulas Patocka , Glauber de Oliveira Costa , linux-kernel@vger.kernel.org, linux-fsdevel@vger.kernel.org, ext2-devel@lists.sourceforge.net, hirofumi@mail.parknet.co.jp, linux-ntfs-dev@lists.sourceforge.net, aia21@cantab.net, hch@infradead.org, viro@zeniv.linux.org.uk, akpm@osdl.org Subject: Re: [PATCH] Use of getblk differs between locations References: <20051010204517.GA30867@br.ibm.com> <20051010214605.GA11427@br.ibm.com> <1129017155.12336.4.camel@imp.csi.cam.ac.uk> In-Reply-To: <1129017155.12336.4.camel@imp.csi.cam.ac.uk> X-Enigmail-Version: 0.92.1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2578 Lines: 63 -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Anton Altaparmakov wrote: > On Tue, 2005-10-11 at 00:49 +0200, Mikulas Patocka wrote: >> On Mon, 10 Oct 2005, Anton Altaparmakov wrote: >>> On Mon, 10 Oct 2005, Mikulas Patocka wrote: >>>> On Mon, 10 Oct 2005, Glauber de Oliveira Costa wrote: >>>> As comment in buffer.c says, getblk will deadlock if the machine is out of >>>> memory. It is questionable whether to deadlock or return NULL and corrupt >>>> filesystem in this case --- deadlock is probably better. >>> What do you mean corrupt filesystem? If a filesystem is written so badly >>> that it will cause corruption when a NULL is returned somewhere, I >>> certainly don't want to have anything to do with it. >> What should a filesystem driver do if it can't suddenly read or write any >> blocks on media? > > Two clear choices: > > 1) Switch to read-only and use the cached data to fulfil requests and > fail all others. > > 2) Ask the user to insert the media/plug the device back in (this is by > far the most likely cause of all requests suddenly failing) and then > continue where they left off. > > It is unfortunate that Linux does not allow for 2) so you need to do 1). I recently looked into 2) a bit and the dm multipath code is almost enough to do exactly this. If you configure your block device as an mpath device that queues on path failure, and change the table to the new device location on device re-attach, the queued up i/o will be flushed out. Almost. Right now, when you change the table and resume the dm mapping, it does a suspend which attempts to write out the data to a device which is no longer there, causing it to just be dropped on the floor. If this were changed not to do that, and perhaps set a timer so that the dirty data wouldn't be left around forever if the device wasn't reattached, 2) would definitely be possible. I realize that the userspace intervention required may involve a bit of dark magic, but my point is most of the code required on the kernel side is already implemented. - -Jeff - -- Jeff Mahoney SUSE Labs -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.2 (GNU/Linux) Comment: Using GnuPG with Thunderbird - http://enigmail.mozdev.org iD8DBQFDTWkyLPWxlyuTD7IRAg0lAJ4yoTfJPgBFPgrT7LaIOdKfN7hFCACfXVSC 685SGzxL2pGMnAySooeNaxk= =9Wfm -----END PGP SIGNATURE----- - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/