2003-06-06 04:22:57

by Grant

[permalink] [raw]
Subject: 2.4.21-rc7-ac1: zip ppa 'device set offline...' problem in scsi_error.c?

Hi there,

My system is AMD K6-2/400 with IDE HD & CDROM with Redhat 9
& updates installed.

I'm trying to track down why zip-disk ppa driver doesn't want to
work for me on 2.4.21-rc7-ac1, while working okay on 2.4.21-rc7.

The error is reported in response to the 'insmod ppa' command,
afterwards, 'mount /mnt/zip' or 'fdisk /dev/sda' fail with '...
/dev/sda4 is not a valid block device'.

Using 'echo "scsi log all" > /proc/scsi/scsi', both kernels
scan IDs 0 thru 7 and find the zip drive on ID 6. But the
AC kernel then takes the zip drive offline.

2.4.21-rc7-ac1 kernel taking zip ppa offline:

Jun 6 12:51:56 two kernel: Host status for host c74db8e0:
Jun 6 12:51:56 two kernel: Device 6 c7cb8800:
Jun 6 12:51:56 two kernel: Attached scsi removable disk sda at scsi0,
channel 0, id 6, lun 0
Jun 6 12:51:56 two kernel: scsi_do_req (host = 0, channel = 0 target = 6,
buffer =c02c4000, bufflen = 0, done = c01d3d40, timeout = 3000, retries = 5)
Jun 6 12:51:56 two kernel: command : 00 00 00 00 00 00
Jun 6 12:51:56 two kernel: Activating command for device 6 (1)
Jun 6 12:51:56 two kernel: Leaving scsi_init_cmd_from_req()
Jun 6 12:51:56 two kernel: Adding timer for command c7cb8600 at 3000
(c01d8940)
Jun 6 12:51:57 two kernel: scsi_dispatch_cmnd (host = 0, channel = 0, target
= 6, command = c7cb8658, buffer = c02c4000,
Jun 6 12:51:57 two kernel: bufflen = 0, done = c01d3d40)
Jun 6 12:51:57 two kernel: queuecommand : routine at c886c420
Jun 6 12:51:57 two kernel: leaving scsi_dispatch_cmnd()
Jun 6 12:51:57 two kernel: Leaving scsi_do_req()
Jun 6 12:51:57 two kernel: Clearing timer for command c7cb8600 1
Jun 6 12:51:57 two kernel: Command failed c7cb8600 2 active=1 busy=1
failed=0
Jun 6 12:51:57 two kernel: bh00:00: old sense key None
Jun 6 12:51:57 two kernel: Non-extended sense class 0 code 0x0
Jun 6 12:51:57 two kernel: Waking error handler thread (-1)
Jun 6 12:51:57 two kernel: Error handler waking up
Jun 6 12:51:57 two kernel: scsi_unjam_host: Checking to see if we need to
request sense
Jun 6 12:51:57 two kernel: scsi_unjam_host: Requesting sense for 6
Jun 6 12:51:57 two kernel: Adding timer for command c7cb8600 at 1000
(c01d8b00)
Jun 6 12:51:57 two kernel: In eh_done c7cb8600 result:0
Jun 6 12:51:57 two kernel: send_eh_cmnd: c7cb8600 eh_state:2002
Jun 6 12:51:57 two kernel: scsi_send_eh_cmnd: scsi_eh_completed_normally
2002
Jun 6 12:51:57 two kernel: Sense requested for c7cb8600 - result 2
Jun 6 12:51:57 two kernel: bh00:00: old sense key None
Jun 6 12:51:57 two kernel: Non-extended sense class 0 code 0x0
Jun 6 12:51:57 two kernel: Command to ID 6 failed
Jun 6 12:51:57 two kernel: Total of 1+0 commands on 1 devices require eh
work
Jun 6 12:51:57 two kernel: scsi_unjam_host: Checking to see if we want to
try abort
Jun 6 12:51:57 two kernel: scsi_unjam_host: Checking to see if we want to
try BDR
Jun 6 12:51:57 two kernel: scsi_unjam_host: Try hard bus reset
Jun 6 12:51:57 two kernel: Sleeping for timer tics 500
Jun 6 12:51:59 two kernel: Adding timer for command c7cb8600 at 1000
(c01d8b00)
Jun 6 12:51:59 two kernel: In eh_done c7cb8600 result:2
Jun 6 12:51:59 two kernel: send_eh_cmnd: c7cb8600 eh_state:2002
Jun 6 12:51:59 two kernel: scsi_send_eh_cmnd: scsi_eh_completed_normally
2003
Jun 6 12:51:59 two kernel: scsi_test_unit_ready: SCpnt c7cb8600 eh_state
2003
Jun 6 12:51:59 two kernel: scsi: device set offline - not ready or command
retry failed after bus reset: host 0 channel 0 id 6 lun 0


what 2.4.21-rc7 does at the same point:


Jun 6 13:37:39 two kernel: Host status for host c73aff40:
Jun 6 13:37:39 two kernel: Device 6 c1168e60:
Jun 6 13:37:39 two kernel: Attached scsi removable disk sda at scsi0,
channel 0, id 6, lun 0
Jun 6 13:37:39 two kernel: scsi_do_req (host = 0, channel = 0 target = 6,
buffer =c02e9000, bufflen = 0, done = c01d0aa0, timeout = 3000, retries = 5)
Jun 6 13:37:39 two kernel: command : 00 00 00 00 00 00
Jun 6 13:37:39 two kernel: Activating command for device 6 (1)
Jun 6 13:37:39 two kernel: Leaving scsi_init_cmd_from_req()
Jun 6 13:37:39 two kernel: Adding timer for command c5d6da00 at 3000
(c01d5600)
Jun 6 13:37:39 two kernel: scsi_dispatch_cmnd (host = 0, channel = 0, target
= 6, command = c5d6da58, buffer = c02e9000,
Jun 6 13:37:39 two kernel: bufflen = 0, done = c01d0aa0)
Jun 6 13:37:39 two kernel: queuecommand : routine at c8850420
Jun 6 13:37:39 two kernel: leaving scsi_dispatch_cmnd()
Jun 6 13:37:39 two kernel: Leaving scsi_do_req()
Jun 6 13:37:39 two kernel: Clearing timer for command c5d6da00 1
Jun 6 13:37:39 two kernel: Command failed c5d6da00 2 active=1 busy=1
failed=0
Jun 6 13:37:39 two kernel: bh00:00: old sense key None
Jun 6 13:37:39 two kernel: Non-extended sense class 0 code 0x0
Jun 6 13:37:39 two kernel: Waking error handler thread (-1)
Jun 6 13:37:39 two kernel: Error handler waking up
Jun 6 13:37:39 two kernel: scsi_unjam_host: Checking to see if we need to
request sense
Jun 6 13:37:39 two kernel: scsi_unjam_host: Requesting sense for 6
Jun 6 13:37:39 two kernel: Adding timer for command c5d6da00 at 1000
(c01d57c0)
Jun 6 13:37:39 two kernel: In eh_done c5d6da00 result:0
Jun 6 13:37:39 two kernel: send_eh_cmnd: c5d6da00 eh_state:2002
Jun 6 13:37:39 two kernel: scsi_send_eh_cmnd: scsi_eh_completed_normally
2002
Jun 6 13:37:39 two kernel: Sense requested for c5d6da00 - result 2
Jun 6 13:37:39 two kernel: Current bh00:00: sense key Unit Attention
Jun 6 13:37:39 two kernel: Additional sense indicates Power on,reset,or bus
device reset occurred
Jun 6 13:37:39 two kernel: Total of 0+0 commands on 0 devices require eh
work



Attached are the full log sections in case the excerpts above
don't give enough information.

Anybody shed some light on this? I'm not sure what to try next.

Thanks,
Grant.



Attachments:
grc_zip_ppa_logs.tar.gz (3.38 kB)