Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S262268AbVCSAPV (ORCPT ); Fri, 18 Mar 2005 19:15:21 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S262296AbVCSANs (ORCPT ); Fri, 18 Mar 2005 19:13:48 -0500 Received: from fire.osdl.org ([65.172.181.4]:52370 "EHLO smtp.osdl.org") by vger.kernel.org with ESMTP id S262339AbVCSANB (ORCPT ); Fri, 18 Mar 2005 19:13:01 -0500 Date: Fri, 18 Mar 2005 16:11:26 -0800 From: Andrew Morton To: rweight@us.ibm.com Cc: dev@sw.ru, linux-kernel@vger.kernel.org Subject: Re: RFC: Bug in generic_forget_inode() ? Message-Id: <20050318161126.7e72e8f3.akpm@osdl.org> In-Reply-To: <1111190260.7102.78.camel@russw.beaverton.ibm.com> References: <1111183051.7102.33.camel@russw.beaverton.ibm.com> <20050318141716.494ab02a.akpm@osdl.org> <1111190260.7102.78.camel@russw.beaverton.ibm.com> X-Mailer: Sylpheed version 0.9.7 (GTK+ 1.2.10; i386-redhat-linux-gnu) Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2566 Lines: 55 Russ Weight wrote: > > On Fri, 2005-03-18 at 14:17 -0800, Andrew Morton wrote: > > Russ Weight wrote: > > > > > > generic_forget_inode() is eventually called (within the context of > > > iput), the inode is placed on the unused list, and the inode_lock is > > > dropped. > > > > > > kswapd calls prune_icache(), locks the inode_lock, and pulls the same > > > inode off of the unused list. Upon completion, prune_icache() calls > > > dispose_list() for the inodes that it has collected. > > > > > > generic_forget_inode() calls write_inode_now(), which calls > > > __writeback_single_inode() which calls __sync_single_inode(). > > > __sync_single_inode() panics when attempting to move the inode onto the > > > unused list (the last call to list_move). This is due to the poison > > > values that were previously loaded into the next and prev list pointers > > > by list_del(). > > > > It's not clear what the actual bug is here. When you say that > > __sync_single_inode() panics over the list pointers, who was it that > > poisoned them? dispose_list()? > > > > Certainly isofs_fill_super() could trivially be rewritten to not do the > > iget()/iput() but we should be sure that that's really the bug. The inode > > lifetime management is rather messy, I'm afraid. > > The pointers are poisoned by dispose_list(). The race condition is > between prune_icache() and generic_forget_inode(). > > When I suggested that a change to isofs_fill_super() might be > considered, I was speculating that isofs_fill_super() might be creating > an unexpected state by doing something unconventional in its usage of > iget() and iput(). This is something I had not investigated. > > The problem is more likely in generic_forget_inode(). It releases the > inode_lock and then locks it again without doing anything to prevent the > inode from being stolen, and without checking to see if it has been > stolen. Likewise, write_inode_now() does not do any checks to see if the > inode has been stolen. > I dunno. This is really fiddly code and does like to blow up in your face. I doubt if the problem is in such a well-trodden path as generic_forget_inode(). Perhaps isofs is breaking the rules by running iget() prior to setting MS_ACTIVE. (What rules, you ask? hah.) - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/