Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1030557AbWJ3QQs (ORCPT ); Mon, 30 Oct 2006 11:16:48 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1030556AbWJ3QQr (ORCPT ); Mon, 30 Oct 2006 11:16:47 -0500 Received: from pentafluge.infradead.org ([213.146.154.40]:41666 "EHLO pentafluge.infradead.org") by vger.kernel.org with ESMTP id S1030555AbWJ3QQr (ORCPT ); Mon, 30 Oct 2006 11:16:47 -0500 Subject: Re: 2.6.19-rc3-git7: scsi_device_unbusy: inconsistent lock state From: Arjan van de Ven To: Jens Axboe Cc: Mark Lord , IDE/ATA development list , Linux Kernel , mingo@elte.hu In-Reply-To: <20061030154444.GH4563@kernel.dk> References: <45460D52.3000404@rtr.ca> <20061030144315.GG4563@kernel.dk> <1162220239.2948.27.camel@laptopd505.fenrus.org> <20061030154444.GH4563@kernel.dk> Content-Type: text/plain Organization: Intel International BV Date: Mon, 30 Oct 2006 17:16:42 +0100 Message-Id: <1162225002.2948.45.camel@laptopd505.fenrus.org> Mime-Version: 1.0 X-Mailer: Evolution 2.8.0 (2.8.0-7.fc6) Content-Transfer-Encoding: 7bit X-SRS-Rewrite: SMTP reverse-path rewritten from by pentafluge.infradead.org See http://www.infradead.org/rpr.html Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2207 Lines: 60 > > > so to me it looks like lockdep at least has the appearance of moaning > > about a reasonably fishy situation... > > To me it looks more about lockdep complaining because it doesn't grok > the full picture. The question is how to shut it up. ok that is quite possible. But I do think you read the original output incorrectly so let me at least phrase it in english: __queue_lock is used in softirq context like this: [] _spin_lock+0x29/0x40 [] scsi_device_unbusy+0x64/0x90 [] scsi_finish_command+0x1c/0xa0 [] blk_done_softirq+0x62/0x70 [] __do_softirq+0x87/0x100 [] do_softirq+0x55/0x60 [] ksoftirqd+0x7c/0xd0 [] kthread+0xf6/0x100 which means that it always has to be taken _irq / _irqsave and one never can enable interrupts while holding this lock. This backtrace is from the first time the lock was taken in irq context. Now a new situation has arisen that violates this constraint, and it looks like this: [] cfq_set_request+0x351/0x3b0 [] elv_set_request+0x1c/0x40 [] get_request+0x23f/0x270 [] get_request_wait+0x27/0x120 [] __make_request+0x5a/0x350 [] generic_make_request+0x16f/0x220 [] submit_bio+0x64/0x110 now cfq_set_request() uses several inlines which muddies the situation, but lockdep claims one of them is not done correctly. (eg either it takes the lock incorrectly or something does spin_unlock_irq while the lock is held) I get the impression you assumed lockdep was complaining about scsi_device_unbusy; but it's not; that function is only referenced since it's the first place since boot where the lock was taken in softirq context... not because the violation is occuring there. -- if you want to mail me at work (you don't), use arjan (at) linux.intel.com Test the interaction between Linux and your BIOS via http://www.linuxfirmwarekit.org - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/