Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S934167Ab3CSUyP (ORCPT ); Tue, 19 Mar 2013 16:54:15 -0400 Received: from cantor2.suse.de ([195.135.220.15]:57714 "EHLO mx2.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S933289Ab3CSUyO (ORCPT ); Tue, 19 Mar 2013 16:54:14 -0400 Date: Tue, 19 Mar 2013 21:54:03 +0100 From: Jan Kara To: Al Viro Cc: Jan Kara , David Howells , Miklos Szeredi , torvalds@linux-foundation.org, linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, hch@infradead.org, akpm@linux-foundation.org, apw@canonical.com, nbd@openwrt.org, neilb@suse.de, jordipujolp@gmail.com, ezk@fsl.cs.sunysb.edu, sedat.dilek@googlemail.com, hooanon05@yahoo.co.jp, mszeredi@suse.cz Subject: Re: [PATCH 2/9] vfs: export do_splice_direct() to modules Message-ID: <20130319205403.GG5222@quack.suse.cz> References: <1363184193-1796-3-git-send-email-miklos@szeredi.hu> <1363184193-1796-1-git-send-email-miklos@szeredi.hu> <1944.1363525619@warthog.procyon.org.uk> <20130318153936.GB28508@quack.suse.cz> <20130318215333.GE21522@ZenIV.linux.org.uk> <20130318230103.GF21522@ZenIV.linux.org.uk> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20130318230103.GF21522@ZenIV.linux.org.uk> User-Agent: Mutt/1.5.20 (2009-06-14) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2346 Lines: 50 On Mon 18-03-13 23:01:03, Al Viro wrote: > On Mon, Mar 18, 2013 at 09:53:34PM +0000, Al Viro wrote: > > On Mon, Mar 18, 2013 at 04:39:36PM +0100, Jan Kara wrote: > > > IMO the deadlock is real. In freeze_super() we wait for all writers to > > > the filesystem to finish while blocking beginning of any further writes. So > > > we have a deadlock scenario like: > > > > > > THREAD1 THREAD2 THREAD3 > > > mnt_want_write() mutex_lock(&inode->i_mutex); > > > ... freeze_super() > > > block on mutex_lock(&inode->i_mutex) > > > sb_wait_write(sb, SB_FREEZE_WRITE); > > > block in sb_start_write() > > > > The bug is on fsfreeze side and this is not the only problem related to it. > > I've missed the implications when I applied "fs: Add freezing handling > > to mnt_want_write() / mnt_drop_write()" last June ;-/ > > > > The thing is, until then mnt_want_write() used to be a counter; it could be > > nested. Now any such nesting is a deadlock you've just described. This > > is seriously wrong, IMO. > > > > BTW, having sb_start_write() buried in individual ->splice_write() is > > asking for trouble; could you describe the rules for that? E.g. where > > does it nest wrt filesystem-private locks? XFS iolock, for example... > > I'm looking at the existing callers and I really wonder if we ought to > push sb_start_write() from ->splice_write()/->aio_write()/etc. into the > callers. Yeah, that should be OK. > Something like file_start_write()/file_end_write(), with check for file > being regular one might be a good starting point. As it is, copyup is > really fucked both in unionmount and overlayfs... Makes sense. I can do the patch. BTW, for months I'm trying to push to you a patch which creates a function like file_start_write() which returns EAGAIN if the file is open with O_NONBLOCK and fs is frozen (this allows me to solve a deadlock with bsd process accounting to frozen fs). After this change the patch will become trivial so I'll add it to the series and hopefully it won't get ignored. Honza -- Jan Kara SUSE Labs, CR -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/