Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id ; Wed, 4 Sep 2002 12:21:05 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id ; Wed, 4 Sep 2002 12:20:58 -0400 Received: from users.linvision.com ([62.58.92.114]:39556 "EHLO abraracourcix.bitwizard.nl") by vger.kernel.org with ESMTP id ; Wed, 4 Sep 2002 12:20:54 -0400 Date: Wed, 4 Sep 2002 18:25:05 +0200 From: Rogier Wolff To: Alan Cox Cc: Andries Brouwer , James Bottomley , "Justin T. Gibbs" , linux-kernel@vger.kernel.org, linux-scsi@vger.kernel.org Subject: Re: aic7xxx sets CDR offline, how to reset? Message-ID: <20020904182505.B20614@bitwizard.nl> References: <20020903171321.A12201@redhat.com> <200209032148.g83LmeP09177@localhost.localdomain> <20020904103737.GA9936@win.tue.nl> <1031138593.2788.40.camel@irongate.swansea.linux.org.uk> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1031138593.2788.40.camel@irongate.swansea.linux.org.uk> User-Agent: Mutt/1.3.22.1i Organization: BitWizard.nl Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2044 Lines: 43 On Wed, Sep 04, 2002 at 12:23:13PM +0100, Alan Cox wrote: > On Wed, 2002-09-04 at 11:37, Andries Brouwer wrote: > > The scsi error recovery has many bad properties, but one is its slowness. > > Once it gets triggered on a machine with SCSI disks it is common to > > have a dead system for several minutes. I have not yet met a situation > > in which rebooting was not preferable above scsi error recovery, > > especially since the attempt to recover often fails. > > Well I for one prefer the scsi timeout/abort sequence on a CD getting > confused badly by a bad block (as at least some of my drives do) to a > reboot everytime I get bad media Reboot is bad. Retries are bad. Errors should be returned to an upper layer, with an error code: "may retry", or "will never work". (Like in SMTP) I will most likely set the "retry count" to 0: Never retry. Almost never works anyway. And the disk already retried manytimes, so retrying in software is only "taking time". We do datarecovery around here. We get bad disks on a dayly basis. We are currently reading a drive that gets over 10Mb per second while spitting out bad block reports! Thing is: those blocks that didn't work first time, may work on our second retry. However, we need userspace control over that retry. We prefer to get the 18G worth of data off the disk first, and only then retry the blocks that happened to be bad first time around. Roger. -- ** R.E.Wolff@BitWizard.nl ** http://www.BitWizard.nl/ ** +31-15-2137555 ** *-- BitWizard writes Linux device drivers for any device you may have! --* * The Worlds Ecosystem is a stable system. Stable systems may experience * * excursions from the stable situation. We are currenly in such an * * excursion: The stable situation does not include humans. *************** - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/