Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753725AbXFKWrS (ORCPT ); Mon, 11 Jun 2007 18:47:18 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1753551AbXFKWrK (ORCPT ); Mon, 11 Jun 2007 18:47:10 -0400 Received: from rtr.ca ([64.26.128.89]:1784 "EHLO mail.rtr.ca" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753383AbXFKWrH (ORCPT ); Mon, 11 Jun 2007 18:47:07 -0400 Message-ID: <466DD0E9.1060506@rtr.ca> Date: Mon, 11 Jun 2007 18:47:05 -0400 From: Mark Lord User-Agent: Thunderbird 2.0.0.0 (X11/20070326) MIME-Version: 1.0 To: Jan Kara Cc: Andrew Morton , Chuck Ebbert , Stephen Tweedie , "Theodore Ts'o" , Linux Kernel Subject: Re: ext3fs: umount+sync not enough to guarantee metadata-on-disk References: <46680BB8.50404@rtr.ca> <20070607084142.42583639.akpm@linux-foundation.org> <46682E4E.1070303@redhat.com> <20070607124507.79d8da54.akpm@linux-foundation.org> <466A175C.1000906@rtr.ca> <20070611111457.GB26561@atrey.karlin.mff.cuni.cz> In-Reply-To: <20070611111457.GB26561@atrey.karlin.mff.cuni.cz> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2468 Lines: 57 Jan Kara wrote: >> Andrew Morton wrote: >>> On Thu, 07 Jun 2007 12:11:58 -0400 >>> Chuck Ebbert wrote: >>> >>>> On 06/07/2007 11:41 AM, Andrew Morton wrote: >>>>>> mount /var/lib/mythtv -oremount,ro >>>>>> sync >>>>>> umount /var/lib/mythtv >>>>> Did this succeed? If the application is still truncating that file, the >>>>> umount should have failed. >>>> Shouldn't sync should wait for truncate to finish? >>> I can't think of anything in there at present which would cause that to >>> happen, and it's not immediately obvious how we _could_ make it happen - we >>> have an inode which potentially has no dirty pages and which is itself >>> clean. The truncate can span multiple journal commits, so forcing a >>> journal commit in sync() won't necessarily block behind the truncate. >>> >>> I guess we could ask sync to speculatively take and release every inode's >>> i_mutex or something. But even that would involve quite some hoop-jumping >>> due to those infuriating spinlock-protected list_heads on the superblock. >>> >>> hmm. >> Okay, I added more instrumentation and retested today. >> >> Good and Bad. >> The umount does indeed fail while the massive unlink is happening, >> so I can just loop on that a few times before giving up. >> >> But.. the earlier "remount,ro".. well.. I don't know what it does. >> I did get it to lock up solid, though.. hung on the "remount,ro" >> when issued during an unlink of a 15GB file. The disk I/O eventually >> completes, and drives go idle, but the system remains hung inside >> the remount,ro call. >> >> Alt-sysrq-T was functioning, so I have some screen shots (.jpg) here: >> >> http://rtr.ca/remount_ro/ > Thanks for the traces. > >> That's definitely a bug. > Yes. We have a nice lock inversion there. ext3_remount() is called > with sb->s_lock held and waits for transaction to finish in > journal_lock_updates(). On the other hand ext3_orphan_del() is called > inside a transaction and tries to do lock_super()... Bad luck. > Peachy. Do you have enough knowledge here to generate a fix for this? Maybe just have the remount break out, releasing all locks, and then loop and retry (or return -EBUSY?) when this happens? Cheers - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/