Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1946067AbXBBU4g (ORCPT ); Fri, 2 Feb 2007 15:56:36 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1946068AbXBBU4f (ORCPT ); Fri, 2 Feb 2007 15:56:35 -0500 Received: from avexch1.qlogic.com ([198.70.193.115]:52420 "EHLO avexch1.qlogic.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1946067AbXBBU4f (ORCPT ); Fri, 2 Feb 2007 15:56:35 -0500 Date: Fri, 2 Feb 2007 12:56:30 -0800 From: Andrew Vasquez To: Andrew Morton Cc: Linux Kernel Mailing List , Linux-SCSI Mailing List , Nigel Kirkland Subject: Re: [BUG] Unable to handle kernel NULL pointer dereference...as_move_to_dispatch+0x11/0x135 Message-ID: <20070202205630.GI3737@andrew-vasquezs-computer.local> References: <20070122183510.GA19905@andrew-vasquezs-computer.local> <20070201232302.905962b6.akpm@linux-foundation.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20070201232302.905962b6.akpm@linux-foundation.org> Organization: QLogic Corporation User-Agent: Mutt/1.5.13 (2006-08-11) X-OriginalArrivalTime: 02 Feb 2007 20:56:34.0475 (UTC) FILETIME=[9F83E7B0:01C7470C] Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2352 Lines: 62 On Thu, 01 Feb 2007, Andrew Morton wrote: > On Mon, 22 Jan 2007 10:35:10 -0800 Andrew Vasquez wrote: > > Basically what is happening from the FC side is the initiator executes > > a simple dt test: > > > > dt of=/dev/raw/raw1 procs=8 oncerr=abort bs=16k disable=stats limit=2m passes=1000000 pattern=iot dlimit=2048 > > > > against a single lun (a very basic Windows target mode driver). > > During the test a port-enable, port-disable script is running agains > > the switch's port that is connected to the target (this occurs every > > sixty seconds (for a disabled duration of 2 seconds). Additionally, > > the target itself is set to LOGO (logout) or drop off the topology > > every 30 seconds. > > I don't understand what effect the port-enable/port-disable has upon the > system. Will it cause I/O errors, or what? No I/O errors should make there way to the upper-layers (block/FS). The system *should* be shielded from the fibre-channel fabric events. I just wanted to explain what the (basic sanity) test did. > > This test runs fine up to 2.6.19. > > One thing we did in there was to give direct-io-against-blockdevs some > special-case bio-preparation code. Perhaps this is tickling a bug somehow. > > We can revert that change like this: > > > diff -puN fs/block_dev.c~a fs/block_dev.c > --- a/fs/block_dev.c~a > +++ a/fs/block_dev.c > @@ -196,8 +196,47 @@ static void blk_unget_page(struct page * > pvec->page[--pvec->idx] = page; > } > > +static int > +blkdev_get_blocks(struct inode *inode, sector_t iblock, > + struct buffer_head *bh, int create) ... Hmm, with this patch we've noted two main differences: 1) I/O throughput with the basic 'dd' command used (above) is back to 60MB/s, rather than the appalling 20-22 MB/s we were seeing with 2.6.20-rcX. 2) No panics -- so far with 2+ hours of testing. With our vanilla system of 2.6.20-rc7, the test could trigger the panic within 15 to 20 minutes. We'll let this run over the weekend -- I'll certainly let you know if anything has changed (failures). -- Andrew Vasquez - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/