Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753789AbYKGPAo (ORCPT ); Fri, 7 Nov 2008 10:00:44 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1752242AbYKGPAc (ORCPT ); Fri, 7 Nov 2008 10:00:32 -0500 Received: from iolanthe.rowland.org ([192.131.102.54]:44740 "HELO iolanthe.rowland.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with SMTP id S1752045AbYKGPAb (ORCPT ); Fri, 7 Nov 2008 10:00:31 -0500 Date: Fri, 7 Nov 2008 10:00:29 -0500 (EST) From: Alan Stern X-X-Sender: stern@iolanthe.rowland.org To: Brian Kysela cc: Andrew Morton , , Jens Axboe , , Subject: Re: [BUG] usb-storage: Error in queuecommand: us->srb = ffff88006a338480 In-Reply-To: <20081107005438.GA26198@ezra.hsd1.tn.comcast.net> Message-ID: MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2264 Lines: 53 On Thu, 6 Nov 2008, Brian Kysela wrote: > On 2008-11-06 at 14:36 -0500, Alan Stern wrote: > > On Thu, 6 Nov 2008, Andrew Morton wrote: > > > > > > The error message: > > > > > > > > > [ 152.701070] usb-storage: Error in queuecommand: us->srb = ffff88007dfd0680 > > > > > > > > means that the SCSI layer has told usb-storage to start a new command > > > > before the old one completed. This could happen, for example, if the > > > > SCSI layer had gotten confused and tried to cancel a command that > > > > wasn't running -- then it would think usb-storage was idle when in fact > > > > it was still busy. (This was the failure mechanism resulting from the > > > > timeout changes.) > > > > > > > > > > OK, thanks. I'll cc linux-scsi, but I have this-wont-be-getting-fixed > > > feelings on this one :( > > > > > > I wonder what's special about Brian's setup. > > > > Brian, if you can use usbmon to monitor what happens during the file > > copy and subsequent error, it might provide a clue. Instructions are > > in the kernel source file Documentation/usb/usbmon.txt. > > Since it's a large file, I have posted the content of usbmon here: > > http://www.kysela.org/pub/1.mon.out > > I also got another kernel bug and system freeze on that copy, after the > copy process hung, when I pulled the drive out: This is exactly the failure mechanism I described earlier. The usbmon log shows that a communications error occurred and usb-storage tried to carry out error recovery. The drive apparently did not reset itself, because when the READ was reissued no data was received. In fact, the log _ends_ at the point where the READ was reissued; no more data was transferred in either direction. Presumably the next thing to occur was a command timeout. And since the block-layer timeout code in 2.6.28-rc2 is known to be buggy, it's not surprising that the copy process hung at that point. It would be better to run this test under 2.6.27; that way we might learn something new. Alan Stern -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/