Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1765985AbYCDS2R (ORCPT ); Tue, 4 Mar 2008 13:28:17 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1761469AbYCDS16 (ORCPT ); Tue, 4 Mar 2008 13:27:58 -0500 Received: from accolon.hansenpartnership.com ([76.243.235.52]:51715 "EHLO accolon.hansenpartnership.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1759845AbYCDS14 (ORCPT ); Tue, 4 Mar 2008 13:27:56 -0500 Subject: Re: [PATCH] block: fix residual byte count handling From: James Bottomley To: Tejun Heo Cc: FUJITA Tomonori , efault@gmx.de, jens.axboe@oracle.com, fujita.tomonori@lab.ntt.co.jp, akpm@linux-foundation.org, linux-kernel@vger.kernel.org, linux-ide@vger.kernel.org, linux-scsi@vger.kernel.org, jgarzik@pobox.com, bzolnier@gmail.com In-Reply-To: <47CD7C05.1080707@gmail.com> References: <20080304093536.GH6704@kernel.dk> <1204634238.5997.5.camel@homer.simson.net> <47CD4355.4050401@gmail.com> <20080304223006A.tomof@acm.org> <47CD53BA.5040908@gmail.com> <47CD7611.2030606@gmail.com> <47CD7C05.1080707@gmail.com> Content-Type: text/plain Date: Tue, 04 Mar 2008 12:27:51 -0600 Message-Id: <1204655271.3091.58.camel@localhost.localdomain> Mime-Version: 1.0 X-Mailer: Evolution 2.12.3 (2.12.3-1.fc8) Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2535 Lines: 57 On Wed, 2008-03-05 at 01:42 +0900, Tejun Heo wrote: > Tejun Heo wrote: > > Tejun Heo wrote: > >> FUJITA Tomonori wrote: > >>>> Aiee... device going down after timing out on READ_DISC_INFO. That's > >>>> gruesome. Can you please try the other patches? > >>> Tejun, I thought that libata needs a fix for sum(sg) != rq->data_len. No? > >> The extra_len you added to qc->nbytes should be it. The only other > >> place to pay attention is the ATAPI transfer chunk size and your patch > >> seems to get it right. > >> > >>> Now Jens' git tree should work with all the non libata stuff, ide, > >>> firewire, bsg, etc. But I'm not sure about libata. > >> With the second patch, all others should be fine no matter what. I'll > >> go check libata part again. > > > > I can reproduce the problem here and it's very weird. I'll report back > > when I know more. > > Okay, I got it. Heh, it turns out SCSI and/or block layer is not > ready for rq->data_len != sum(sg). When adjusted command completes, > SCSI midlayer completes the command with rq->data_len for PC commands > which eventually ends up in __end_that_request_first(). As there are > extra sg area left after completing rq->data_len, blk layer says so to > SCSI layer and SCSI layer retries the command only with the appended > area. > > The following patch gets the writing going. I really think it's a > serious mistake to break rq->data_len == sum(sg). If we break > rq->data_len == requested size, the worst bugs are giving wrong size > when issuing commands to application layer of devices which is > relatively easy to spot and not all that command anyway. Breaking > rq->data_len == sum(sg), bugs will be in internal mechanics, DMA > engine programming and transport layer. Oh well... > > diff --git a/drivers/scsi/scsi.c b/drivers/scsi/scsi.c > index fecba05..32439ac 100644 > --- a/drivers/scsi/scsi.c > +++ b/drivers/scsi/scsi.c > @@ -757,7 +757,7 @@ void scsi_finish_command(struct scsi_cmnd *cmd) > "Notifying upper driver of completion " > "(result %x)\n", cmd->result)); > > - good_bytes = scsi_bufflen(cmd); > + good_bytes = scsi_bufflen(cmd) + cmd->request->data_len; This doesn't look right. scsi_bufflen(cmd) is req->data_len for PC commands ... did you mean to add extra_len here? James -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/