Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753032AbZKLMQo (ORCPT ); Thu, 12 Nov 2009 07:16:44 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1752853AbZKLMQn (ORCPT ); Thu, 12 Nov 2009 07:16:43 -0500 Received: from [95.166.99.235] ([95.166.99.235]:54599 "EHLO kernel.dk" rhost-flags-FAIL-FAIL-OK-OK) by vger.kernel.org with ESMTP id S1752818AbZKLMQn (ORCPT ); Thu, 12 Nov 2009 07:16:43 -0500 Date: Thu, 12 Nov 2009 13:16:48 +0100 From: Jens Axboe To: Corrado Zoccolo Cc: Linux-Kernel , Jeff Moyer , aaronc@gelato.unsw.edu.au Subject: Re: [RFC, PATCH] cfq-iosched: remove redundant queuing detection code Message-ID: <20091112121648.GJ8742@kernel.dk> References: <200911101454.57522.czoccolo@gmail.com> <20091110151431.GI8742@kernel.dk> <4e5e476b0911100956u47bb3111jeba9fbcc3d992960@mail.gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <4e5e476b0911100956u47bb3111jeba9fbcc3d992960@mail.gmail.com> Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2523 Lines: 54 On Tue, Nov 10 2009, Corrado Zoccolo wrote: > On Tue, Nov 10, 2009 at 4:14 PM, Jens Axboe wrote: > > On Tue, Nov 10 2009, Corrado Zoccolo wrote: > >> The core block layer already has code to detect presence of command > >> queuing devices. We convert cfq to use that instead of re-doing the > >> computation. > > > > There's is the major difference that the CFQ variant is dynamic and the > > block layer one is not. This change came from Aaron some time ago IIRC, > > see commit 45333d5. It's a bit of a chicken and egg problem. > > The comment by Aaron: > CFQ's detection of queueing devices assumes a non-queuing device and detects > if the queue depth reaches a certain threshold. Under some workloads (e.g. > synchronous reads), CFQ effectively forces a unit queue depth, > thus defeating > the detection logic. This leads to poor performance on queuing hardware, > since the idle window remains enabled. > > makes me think that the dynamic-off detection in cfq may really be > buggy (BTW this could explain the bad results on SSD Jeff observed > before my patch set). > The problem is, that once the hw_tag is 0, it is difficult for it to > become 1 again, as explained by Aaron, since cfq will hardly send more > than 1 request at a time. My patch set fixes this for SSDs (the seeky > readers will still be sent without idling, and if they are enough, the > logic will see a large enough depth to reconsider the initial > decision). > > So the only sound way to do the detection is to start in an > indeterminate state, in which CFQ behaves as if hw_tag = 1, and then, > if for a long observation period we never saw large depth, we switch > to hw_tag = 0, otherwise we stick to hw_tag = 1, without reconsidering > it. That is probably the better way to do it, as I said earlier it is indeed a chicken and egg problem. Care to patch something like that up? > I think the correct logic could be pushed to the blk-core, by > introducing also an indeterminate bit. And I still don't think that is a good idea. The block layer case cares more about the capability side ("is this a good ssd?") where as the CFQ case incorporates process behaviour as well. I'll gladly take patches to improve the CFQ logic. -- Jens Axboe -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/