Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S934568AbYCEMEp (ORCPT ); Wed, 5 Mar 2008 07:04:45 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1765878AbYCEMDh (ORCPT ); Wed, 5 Mar 2008 07:03:37 -0500 Received: from cdptpa-omtalb.mail.rr.com ([75.180.132.120]:45363 "EHLO cdptpa-omtalb.mail.rr.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S933302AbYCEMDf (ORCPT ); Wed, 5 Mar 2008 07:03:35 -0500 Message-ID: <47CE8AEA.1060209@cfl.rr.com> Date: Wed, 05 Mar 2008 06:58:34 -0500 From: Mark Hounschell User-Agent: Thunderbird 2.0.0.9 (X11/20070801) MIME-Version: 1.0 To: Mike Christie CC: markh@compro.net, linux-scsi@vger.kernel.org, linux-kernel@vger.kernel.org, Tony Battersby Subject: Re: New 2.6.24.2 SG_IO SCSI problems References: <47BD9588.9080803@compro.net> <47BEFD4B.9060803@cs.wisc.edu> <47BEFF5F.9020002@cs.wisc.edu> <47BF0CE2.306@compro.net> <47BF40D7.5050301@compro.net> <47BF4BD0.7080008@cs.wisc.edu> In-Reply-To: <47BF4BD0.7080008@cs.wisc.edu> X-Enigmail-Version: 0.95.5 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3293 Lines: 94 Mike Christie wrote: > Mark Hounschell wrote: >> Mark Hounschell wrote: >>> Mike Christie wrote: >>>> Mike Christie wrote: >>>>> Mark Hounschell wrote: >>>>>> I seem to have run into some sort of regression in the SG_IO >>>>>> interface of 2.6.24.2. I have an application that up until 2.6.24 >>>>>> worked fine. The 2.6.23.16 kernel works fine. >>>>>> >>>>>> During reads I get these kernel messages. Writes and other functions >>>>>> _seem_ OK. Actually basic >>>>>> reads are working. Its with large BC reads using an io_vec list that >>>>>> the problem shows up. >>>>>> >>>>> Are you doing SG_IO to the sg device (/dev/sg*) or to the block device >>>>> (/dev/sdX)? >>>> If you are doing SG_IO to the sg device, then I know of one regression >>>> (well not regression exactly, but I fixed a bug but the patch got >>>> partially overwritten by another patch and that caused a new bug). Both >>>> bugs are fixed in 2.6.25-rc2. Could you try that out if you are doing >>>> SG_IO to the sg device. >>>> >>> Yes, I'm using /dev/sg*. And yes again I'll checkout 2.6.25-rc2 ASIC. >>> >>> Thanks >>> Mark >>> - >> >> 2.6.25-rc2 does fix the problem I'm having. I don't suppose there is a >> patch >> lying around for 2.6.24.2?? >> > > I attached a backport of the patch from Tony (added as cc) that is in > 2.6.25-rc2. Could you try it out against 2.6.24.2 just to make sure it > was this patch, then we can send it to stable. > >Mark Hounschell wrote: > >Sorry it took so long. This does fix my problem. I hope it's not to >late for 2.6.24.3 > Backport 76d78300a6eb8b7f08e47703b7e68a659ffc2053 to 2.6.24 >From Tony Battersby: When sending a SCSI command to a tape drive via the SCSI Generic (sg) driver, if the command has a data transfer length more than scatter_elem_sz (32 KB default) and not a multiple of 512, then I either hit BUG_ON(!valid_dma_direction(direction)) in dma_unmap_sg() or else the command never completes (depending on the LLDD). When constructing scatterlists, the sg driver rounds up the scatterlist element sizes to be a multiple of 512. This can result in sum(scatterlist lengths) > bufflen. In this case, scsi_req_map_sg() incorrectly sets bio->bi_size to sum(scatterlist lengths) rather than to bufflen. When the command completes, req_bio_endio() detects that bio->bi_size != 0, and so it doesn't call bio_endio(). This causes the command to be resubmitted, resulting in BUG_ON or the command never completing. This patch makes scsi_req_map_sg() set bio->bi_size to bufflen rather than to sum(scatterlist lengths), which fixes the problem. Signed-off-by: Mike Christie --- linux-2.6.24.2/drivers/scsi/scsi_lib.c 2008-02-10 23:51:11.000000000 -0600 +++ linux-2.6.24.2.work/drivers/scsi/scsi_lib.c 2008-02-22 16:20:09.000000000 -0600 @@ -298,7 +298,6 @@ static int scsi_req_map_sg(struct reques page = sg_page(sg); off = sg->offset; len = sg->length; - data_len += len; while (len > 0 && data_len > 0) { /* Did this ever get sent to the stable team? Regards Mark -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/