Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754883AbYJFRZN (ORCPT ); Mon, 6 Oct 2008 13:25:13 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1753374AbYJFRY7 (ORCPT ); Mon, 6 Oct 2008 13:24:59 -0400 Received: from sh.osrg.net ([192.16.179.4]:58328 "EHLO sh.osrg.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752979AbYJFRY6 (ORCPT ); Mon, 6 Oct 2008 13:24:58 -0400 Date: Tue, 7 Oct 2008 02:24:29 +0900 To: jens.axboe@oracle.com Cc: James.Bottomley@HansenPartnership.com, knikanth@suse.de, linux-kernel@vger.kernel.org, linux-scsi@vger.kernel.org, fujita.tomonori@lab.ntt.co.jp Subject: Re: [PATCH] BUG: nr_phys_segments cannot be less than nr_hw_segments From: FUJITA Tomonori In-Reply-To: <20081002171356.GO19428@kernel.dk> References: <20081002165857.GL19428@kernel.dk> <1222967563.3222.30.camel@localhost.localdomain> <20081002171356.GO19428@kernel.dk> Mime-Version: 1.0 Content-Type: Text/Plain; charset=us-ascii Content-Transfer-Encoding: 7bit Message-Id: <20081007022358N.fujita.tomonori@lab.ntt.co.jp> Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2280 Lines: 45 On Thu, 2 Oct 2008 19:13:57 +0200 Jens Axboe wrote: > On Thu, Oct 02 2008, James Bottomley wrote: > > On Thu, 2008-10-02 at 18:58 +0200, Jens Axboe wrote: > > > On Thu, Oct 02 2008, James Bottomley wrote: > > > > The bug would appear to be that we sometimes only look at q->max_sectors > > > > when deciding on mergability. Either we have to insist on max_sectors > > > > <= hw_max_sectors, or we have to start using min(q->max_sectors, > > > > q->max_hw_sectors) for this. > > > > > > q->max_sectors MUST always be <= q->max_hw_sectors, otherwise we could > > > be sending down requests that are too large for the device to handle. So > > > that condition would be a big bug. The sysfs interface checks for this, > > > and blk_queue_max_sectors() makes sure that is true as well. > > > > Yes, that seems always to be enforced. Perhaps there are other ways of > > tripping this problem ... I'm still sure if it occurs it's because we do > > a physical merge where a virtual merge is forbidden. > > > > > The fixes proposed still look weird. There is no phys vs hw segment > > > constraints, the request must adhere to the limits set by both. It's > > > mostly a moot point anyway, as 2.6.28 will get rid of the hw accounting > > > anyway. > > > > Agree all three proposed fixes look wrong. However, if it's what I > > think, just getting rid of hw accounting won't fix the problem because > > we'll still have the case where a physical merge is forbidden by iommu > > constraints ... this still needs to be accounted for. > > > > What we really need is a demonstration of what actually is going > > wrong ... > > Yep, IIRC we both asked for that the last time as well... Nikanth? Possibly, blk_phys_contig_segment might miscalculate q->max_segment_size? blk_phys_contig_segment does: req->biotail->bi_size + next_req->bio->bi_size > q->max_segment_size; But it's possible that req->biotail and the previous bio are supposed be merged into one segment? Then we could create too large segment here. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/