Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751538AbWCIHtA (ORCPT ); Thu, 9 Mar 2006 02:49:00 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1751048AbWCIHtA (ORCPT ); Thu, 9 Mar 2006 02:49:00 -0500 Received: from e33.co.us.ibm.com ([32.97.110.151]:27623 "EHLO e33.co.us.ibm.com") by vger.kernel.org with ESMTP id S1750778AbWCIHs7 (ORCPT ); Thu, 9 Mar 2006 02:48:59 -0500 Message-ID: <440FDF3E.8060400@in.ibm.com> Date: Thu, 09 Mar 2006 13:24:38 +0530 From: Suzuki Organization: IBM Software Labs User-Agent: Mozilla Thunderbird 1.0 (X11/20041206) X-Accept-Language: en-us, en MIME-Version: 1.0 To: linux-fsdevel@vger.kernel.org, "linux-aio kvack.org" , lkml CC: suparna , akpm@osdl.org Subject: [RFC] Badness in __mutex_unlock_slowpath with XFS stress tests Content-Type: multipart/mixed; boundary="------------080105000700070900030208" Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2911 Lines: 90 This is a multi-part message in MIME format. --------------080105000700070900030208 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Missed out linux-aio & linux-fs-devel lists. Forwarding. Comments ? Suzuki K P Linux Technology Center, IBM Software Labs, India. --------------080105000700070900030208 Content-Type: message/rfc822; name="[RFC] Badness in __mutex_unlock_slowpath with XFS stress tests.eml" Content-Transfer-Encoding: 7bit Content-Disposition: inline; filename="[RFC] Badness in __mutex_unlock_slowpath with XFS stress tests.eml" Message-ID: <440FD66D.6060308@in.ibm.com> Date: Thu, 09 Mar 2006 12:47:01 +0530 From: Suzuki Organization: IBM Software Labs User-Agent: Mozilla Thunderbird 1.0 (X11/20041206) X-Accept-Language: en-us, en MIME-Version: 1.0 To: lkml CC: suparna , akpm@osdl.org Subject: [RFC] Badness in __mutex_unlock_slowpath with XFS stress tests Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Hi all, I was working on an issue with getting "Badness in __mutex_unlock_slowpath" and hence a stack trace, while running FS stress tests on XFS on 2.6.16-rc5 kernel. The dmesg looks like : Badness in __mutex_unlock_slowpath at kernel/mutex.c:207 [] show_trace+0x20/0x22 [] dump_stack+0x1e/0x20 [] __mutex_unlock_slowpath+0x12a/0x23b [] mutex_unlock+0xb/0xd [] xfs_read+0x230/0x2d9 [] linvfs_aio_read+0x8d/0x98 [] do_sync_read+0xb8/0x107 [] vfs_read+0xc9/0x19b [] sys_read+0x47/0x6e [] sysenter_past_esp+0x54/0x75 This happens with XFS DIO reads. xfs_read holds the i_mutex and issues a __generic_file_aio_read(), which falls into __blockdev_direct_IO with DIO_OWN_LOCKING flag (since xfs uses own_locking ). Now __blockdev_direct_IO releases the i_mutex for READs with DIO_OWN_LOCKING.When it returns to xfs_read, it tries to unlock the i_mutex ( which is now already unlocked), causing the "Badness". The possible solution which we can think of, is not to unlock the i_mutex for DIO_OWN_LOCKING. This will only affect the DIO_OWN_LOCKING users (as of now, only XFS ) with concurrent DIO sync read requests. AIO read requests would not suffer this problem since they would just return once the DIO is submitted. Another work around for this can be adding a check "mutex_is_locked" before trying to unlock i_mutex in xfs_read. But this seems to be an ugly hack. :( Comments ? thanks, Suzuki --------------080105000700070900030208-- - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/