Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1757432AbYBATjJ (ORCPT ); Fri, 1 Feb 2008 14:39:09 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1754080AbYBATi4 (ORCPT ); Fri, 1 Feb 2008 14:38:56 -0500 Received: from mx1.redhat.com ([66.187.233.31]:38512 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753845AbYBATiz (ORCPT ); Fri, 1 Feb 2008 14:38:55 -0500 Date: Fri, 01 Feb 2008 14:37:41 -0500 (EST) Message-Id: <20080201.143741.104030452.k-ueda@ct.jp.nec.com> To: petkovbb@gmail.com, petkovbb@googlemail.com Cc: jens.axboe@oracle.com, nai.xia@gmail.com, rdreier@cisco.com, bzolnier@gmail.com, flo@rfc822.org, linux-kernel@vger.kernel.org, j-nomura@ce.jp.nec.com, linux-ide@vger.kernel.org, k-ueda@ct.jp.nec.com Subject: Re: kernel BUG at ide-cd.c:1726 in 2.6.24-03863-g0ba6c33 && -g8561b089 From: Kiyoshi Ueda In-Reply-To: <20080201182909.GA7837@gollum.tnic> References: <20080201075117.GC4500@gollum.tnic> <20080201.123927.71085228.k-ueda@ct.jp.nec.com> <20080201182909.GA7837@gollum.tnic> X-Mailer: Mew version 4.2 on Emacs 21.4 / Mule 5.0 =?iso-2022-jp?B?KBskQjgtTFobKEIp?= Mime-Version: 1.0 Content-Type: Text/Plain; charset=us-ascii Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2824 Lines: 61 Hi Boris, On Fri, 1 Feb 2008 19:29:09 +0100, Borislav Petkov wrote: > > > > end_that_request_last() is not called when __blk_end_reuqest() > > > > returns 1. Then, the issuer isn't waken up. > > > > So I think the BUG() or error messages should be there. > > > > > > you mean, end_that_request_last() isn't called when __end_that_request_first() > > > returns an error and this is the case only for fs and pc requests. > > > Otherwise it _is_ called, thus simulating somewhat the previous behavior. > > > However, we never BUG()'ged on residual byte counts before and > > > this driver has been in the kernel tree for ages, so what puzzles > > > me now is how is BUG()'ing here better than before and shouldn't we > > > simply issue a warning instead of killing the interrupt handler... > > > > The Jens' patch passes the residual byte counts to __blk_end_request(), > > so __end_that_reqeust_first() should never return 1 and we should never > > BUG() on the residual byte counts, unless inconsistency happens such as > > the size of remaining bios is bigger than the residual byte counts. > > yep. > > > So if __blk_end_request() returns 1 even with the Jens' patch, > > it means that the block layer or the driver really have a bug. > > And then, the request and the bios could leak or the issuer > > would wait forever because end_that_request_last() isn't called. > > > > The previous behavior might ignore such inconsistency and leak only > > the bios because it was calling end_that_request_last() anyway. > > I would like to BUG() in such cases personally, but I don't object > > strongly if you prefer not to BUG(). > > BUG() is definitely what we should do here to catch this case of sizeof(bios) > > rq->data_len. Putting a brown paper bag over the issue will never get it fixed > if it really leaks bios. Thanks for clarifying that. > > By the way, shouldn't we be doing a little branch prediction here: > > diff --git a/drivers/ide/ide-cd.c b/drivers/ide/ide-cd.c > index 74c6087..bee05a3 100644 > --- a/drivers/ide/ide-cd.c > +++ b/drivers/ide/ide-cd.c > @@ -1722,7 +1722,7 @@ static ide_startstop_t cdrom_newpc_intr(ide_drive_t *drive) > */ > if ((stat & DRQ_STAT) == 0) { > spin_lock_irqsave(&ide_lock, flags); > - if (__blk_end_request(rq, 0, 0)) > + if (unlikely(__blk_end_request(rq, 0, rq->data_len))) > BUG(); > HWGROUP(drive)->rq = NULL; > spin_unlock_irqrestore(&ide_lock, flags); I think it's reasonable. Thanks, Kiyoshi Ueda -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/