Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1760409AbXKOQ0W (ORCPT ); Thu, 15 Nov 2007 11:26:22 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1755113AbXKOQ0K (ORCPT ); Thu, 15 Nov 2007 11:26:10 -0500 Received: from empbedex1.empirix.com ([12.38.203.54]:9696 "EHLO empbedex1.empirix.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752724AbXKOQ0J convert rfc822-to-8bit (ORCPT ); Thu, 15 Nov 2007 11:26:09 -0500 X-MimeOLE: Produced By Microsoft Exchange V6.5 Content-class: urn:content-classes:message MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 8BIT Subject: RE: 2.6.23.1 - sata_mv (7042) hang with large file operations Date: Thu, 15 Nov 2007 11:26:02 -0500 Message-ID: In-Reply-To: <473B76E0.7010500@rtr.ca> X-MS-Has-Attach: X-MS-TNEF-Correlator: Thread-Topic: 2.6.23.1 - sata_mv (7042) hang with large file operations Thread-Index: AcgnDfCDcFTk2S3PRzW9Z2OXUgvbiQAkZOhA References: <45ED682A.9040408@garzik.org> <4728A816.8020608@garzik.org> <473B36D7.8000205@rtr.ca> <473B44CB.6010209@rtr.ca> <473B76E0.7010500@rtr.ca> From: "Morrison, Tom" To: "Mark Lord" Cc: "Jeff Garzik" , , X-OriginalArrivalTime: 15 Nov 2007 16:26:07.0709 (UTC) FILETIME=[39C068D0:01C827A4] Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1997 Lines: 54 Additional information - the ~file size this is caused is somewhere close to 260Mbytesfiles. If I create a ~260Mbytes file - my program finishes creating the file - but ~5 seconds later (I timed this by hitting enter on the console every second after completion of the command that should have done a fsync of the created file before exiting)... It hangs... I did a little playing around with /proc/sys/dev/scsi/logging_level (set to 0x7) - and it seems that the kernel/box locks up after this line: >> scsi_add_timer: scmd: efca83c0, time: 7500, (c0160660) >> scsi_delete_timer: scmd: efca83c0, rtn: 1 >> scsi_add_timer: scmd: efca8840, time: 7500, (c0160660) Further analysis (setting logging level to 65535 (0xFFFF) Has the following behavior down low) - >> scsi_add_timer: scmd: efca8960, time: 7500, (c0160660) >> sd 0:0:0:0: [sda] Send: 0xefca8960 >> sd 0:0:0:0: [sda] CDB: Write(10): 2a 00 00 47 92 27 00 01 48 00 >> buffer = 0xc0553040, bufflen = 167936, done = 0xc016b194, queuecommand 0xc017ed34 >> leaving scsi_dispatch_cmnd() >> scsi_delete_timer: scmd: efca8960, rtn: 1 >> sd 0:0:0:0: [sda] Done: 0xefca8960 SUCCESS >> sd 0:0:0:0: [sda] Result: hostbyte=DID_OK driverbyte=DRIVER_OK,SUGGEST_OK >> sd 0:0:0:0: [sda] CDB: Write(10): 2a 00 00 47 92 27 00 01 48 00 >> sd 0:0:0:0: [sda] scsi host busy 1 failed 0 >> sd 0:0:0:0: Notifying upper driver of completion (result 0) >> >> scsi_add_timer: scmd: efca82a0, time: 7500, (c0160660) >> sd 0:0:0:0: [sda] Send: 0xefca82a0 >> sd 0:0:0:0: [sda] CDB: Write(10): 2a 00 00 47 93 6f 00 02 40 00 >> buffer = 0xef5734c0, bufflen = 294912, done = 0xc016b194, queuecommand 0xc017ed34 >> leaving scsi_dispatch_cmnd() Nothing more - it hangs! This is really a nasty problem!!!! - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/