Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752860AbZG3WsJ (ORCPT ); Thu, 30 Jul 2009 18:48:09 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1752443AbZG3WsI (ORCPT ); Thu, 30 Jul 2009 18:48:08 -0400 Received: from smtp-out.google.com ([216.239.45.13]:61780 "EHLO smtp-out.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752304AbZG3WsH (ORCPT ); Thu, 30 Jul 2009 18:48:07 -0400 DomainKey-Signature: a=rsa-sha1; s=beta; d=google.com; c=nofws; q=dns; h=mime-version:in-reply-to:references:date:message-id:subject:from:to: cc:content-type:content-transfer-encoding:x-system-of-record; b=uhb5azQE1WFO8del9lWakaLpikyt/Nav6zA7w+dcDarDJi0YlkHQMffxstG9bvqoO shJtN07SPESQbiKjKQ6IQ== MIME-Version: 1.0 In-Reply-To: <20090730224308.GJ12579@kernel.dk> References: <1786ab030907281211x6e432ba6ha6afe9de73f24e0c@mail.gmail.com> <20090730213956.GH12579@kernel.dk> <33307c790907301501v4c605ea8oe57762b21d414445@mail.gmail.com> <20090730221727.GI12579@kernel.dk> <33307c790907301534v64c08f59o66fbdfbd3174ff5f@mail.gmail.com> <20090730224308.GJ12579@kernel.dk> Date: Thu, 30 Jul 2009 15:48:02 -0700 Message-ID: <33307c790907301548t2ef1bb72k4adbe81865d2bde9@mail.gmail.com> Subject: Re: Bug in kernel 2.6.31, Slow wb_kupdate writeout From: Martin Bligh To: Jens Axboe Cc: Chad Talbott , linux-kernel@vger.kernel.org, linux-mm@kvack.org, wfg@mail.ustc.edu.cn, Michael Rubin , sandeen@redhat.com, Andrew Morton , Peter Zijlstra Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit X-System-Of-Record: true Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2250 Lines: 44 On Thu, Jul 30, 2009 at 3:43 PM, Jens Axboe wrote: > On Thu, Jul 30 2009, Martin Bligh wrote: >> > The test case above on a 4G machine is only generating 1G of dirty data. >> > I ran the same test case on the 16G, resulting in only background >> > writeout. The relevant bit here being that the background writeout >> > finished quickly, writing at disk speed. >> > >> > I re-ran the same test, but using 300 100MB files instead. While the >> > dd's are running, we are going at ~80MB/sec (this is disk speed, it's an >> > x25-m). When the dd's are done, it continues doing 80MB/sec for 10 >> > seconds or so. Then the remainder (about 2G) is written in bursts at >> > disk speeds, but with some time in between. >> >> OK, I think the test case is sensitive to how many files you have - if >> we punt them to the back of the list, and yet we still have 299 other >> ones, it may well be able to keep the disk spinning despite the bug >> I outlined.Try using 30 1GB files? > > If this disk starts spinning, then we have bigger bugs :-) >> >> Though it doesn't seem to happen with just one dd streamer, and >> I don't see why the bug doesn't trigger in that case either. >> >> I believe the bugfix is correct independent of any bdi changes? > > Yeah I think so too, I'll run some more tests on this tomorrow and > verify it there as well. There's another issue I was discussing with Peter Z. earlier that the bdi changes might help with - if you look at where the dirty pages get to, they are capped hard at the average of the dirty and background thresholds, meaning we can only dirty about half the pages we should be able to. That does very slowly go away when the bdi limit catches up, but it seems to start at 0, and it's progess seems glacially slow (at least if you're impatient ;-)) This seems to affect some of our workloads badly when they have a sharp spike in dirty data to one device, they get throttled heavily when they wouldn't have before the per-bdi dirty limits. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/