Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1758813AbZLGTbD (ORCPT ); Mon, 7 Dec 2009 14:31:03 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1758774AbZLGTbA (ORCPT ); Mon, 7 Dec 2009 14:31:00 -0500 Received: from g4t0014.houston.hp.com ([15.201.24.17]:37392 "EHLO g4t0014.houston.hp.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1758794AbZLGTa7 convert rfc822-to-8bit (ORCPT ); Mon, 7 Dec 2009 14:30:59 -0500 From: "Miller, Mike (OS Dev)" To: Jens Axboe , =?iso-8859-1?Q?Ozan_=C7a=3F=3Flayan?= CC: linux-kernel , "scameron@beardog.cce.hp.com" Date: Mon, 7 Dec 2009 19:30:37 +0000 Subject: RE: CCISS performance drop in buffered disk reads in newer kernels Thread-Topic: CCISS performance drop in buffered disk reads in newer kernels Thread-Index: Acp3bKf29YqxjjeQRE2QWZuVKELENQABE3Ig Message-ID: <0F5B06BAB751E047AB5C87D1F77A77886994120A79@GVW0547EXC.americas.hpqcorp.net> References: <4B1CDCE6.5040502@pardus.org.tr> <0F5B06BAB751E047AB5C87D1F77A778869941207A5@GVW0547EXC.americas.hpqcorp.net> <4B1D47C5.7030804@pardus.org.tr> <20091207183946.GE8742@kernel.dk> In-Reply-To: <20091207183946.GE8742@kernel.dk> Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: acceptlanguage: en-US Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 8BIT MIME-Version: 1.0 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2543 Lines: 76 > -----Original Message----- > From: Jens Axboe [mailto:jens.axboe@oracle.com] > Sent: Monday, December 07, 2009 12:40 PM > To: Ozan ?a??layan > Cc: Miller, Mike (OS Dev); linux-kernel; scameron@beardog.cce.hp.com > Subject: Re: CCISS performance drop in buffered disk reads in > newer kernels > > On Mon, Dec 07 2009, Ozan ?a??layan wrote: > > Miller, Mike (OS Dev) wrote: > > > Ozan, > > > I'm aware of the performance drop. Please see: > http://bugzilla.kernel.org/show_bug.cgi?id=13127. I removed > the huge read ahead value of 1024 that we used because users > were complaining about small writes being starved. That was > back around the 2.6.25 timeframe. Since that timeframe there > have no changes in the main i/o path. I'll get back on this > as time allows. > > > > > > Meanwhile, you can tweak some of the block layer tunables as such. > > > > > > echo 64 > /sys/block/cciss\!c0d1/queue/read_ahead_kb > > > OR > > > blockdev --setra 128 /dev/cciss/c0d1 > > > > > > These are just example values. There is also > max_hw_sectors_kb and max_sectors_kb that be adjusted. > > > > > > > Hi, > > > > Actually the "#define READ_AHEAD 1024" was removed on March > 2008 which > > was included in the 2.6.25.y tree so 2.6.25.20 has 128kB read_ahead > > value too. > > > > *But* setting read_ahead to 2048 increases buffered disk > read average > > from 60~MB/s to 190~MB/s hence the kernel compile time > drops to 2 minutes. > > > > So maybe the regression/change is in another place? > > > > The server is just a compile-farm so it's triggered by > hand, compiles > > distribution's packages and stays idle until the next > compilation queue. > > Is it safe/OK to use that 2048kB read_ahead value for such workload? > > Yes, it's definitely safe. I agree. > > > (max_hw_sectors_kb is 512 on my 2.6.25.20 setup and 1024 on > 2.6.30.9 > > but it seems that it's read-only) > > The *_hw_* values are the driver exported hardware limits, so > they are always read-only. Ahhh, I didn't know that. There is also an nr_requests attribute which to me implies limiting requests somewhere. The value of nr_request is 128 but the max commands to the cciss controllers exceed that value. What is nr_request supposed to do? -- mikem > > -- > Jens Axboe > > -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/