Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1757129Ab0GMUae (ORCPT ); Tue, 13 Jul 2010 16:30:34 -0400 Received: from mx1.redhat.com ([209.132.183.28]:47874 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755517Ab0GMUa2 (ORCPT ); Tue, 13 Jul 2010 16:30:28 -0400 From: Jeff Moyer To: Vivek Goyal Cc: Corrado Zoccolo , axboe@kernel.dk, Linux-Kernel Subject: Re: [PATCH 0/2] cfq-iosched: fixing RQ_NOIDLE handling. References: <20100713195650.GA21044@redhat.com> X-PGP-KeyID: 1F78E1B4 X-PGP-CertKey: F6FE 280D 8293 F72C 65FD 5A58 1FF8 A7CA 1F78 E1B4 X-PCLoadLetter: What the f**k does that mean? Date: Tue, 13 Jul 2010 16:30:23 -0400 In-Reply-To: <20100713195650.GA21044@redhat.com> (Vivek Goyal's message of "Tue, 13 Jul 2010 15:56:50 -0400") Message-ID: User-Agent: Gnus/5.110011 (No Gnus v0.11) Emacs/23.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3117 Lines: 74 Vivek Goyal writes: > On Tue, Jul 13, 2010 at 03:38:11PM -0400, Jeff Moyer wrote: >> Corrado Zoccolo writes: >> >> > Can you test the attached patch, where I also added your changes to >> > make jbd(2) to perform sync writes? >> >> I got new storage, so I have new numbers. I only re-ran deadline and >> vanilla cfq for the fs_mark only test. The average of 10 runs comes out >> like so: >> >> deadline: 571.98 >> vanilla cfq: 107.42 >> patched cfq: 460.9 >> >> Mixed workload results with your suggested patch: >> >> fs_mark: 15.65 files/sec >> fio: 132.5 MB/s >> >> So, again, not looking great for the mixed workload, but the patch >> does improve the fs_mark only case. Looking at the blktrace data shows >> that the jbd2 thread preempts the fs_mark thread at all the right >> times. The only thing holding throughput back is the whole notion that >> we need to only dispatch from one queue (even though the storage is >> capable of serving both the reads and writes simultaneously). >> >> I added in the patch that allows the simultaneous dispatch of both reads >> and writes, and here are the results from that run: >> >> fs_mark: 15.975 files/sec >> fio: 132.4 MB/s >> >> So, it looks like that didn't help. The reason this patch doesn't come >> close to the yield patch in the mixed workload is because the yield >> patch set allows the fs_mark process to continue to issue I/O. With >> your patch, the fs_mark process does 64KB of I/O, the jbd2 thread does >> the journal commit, and then the fio process runs again. Given that the >> fs_mark process typically only uses a small fraction of its time slice, >> you end up with an unfair balance. > > Hi Jeff, > > This is little strange. Given the fact that now both fs_mark and jbd > threads are on sync-noidle tree, we should have idled on sync-noidle > tree to provide fairness and that should have made sure that fs_mark/jbd > do more IO and slice is not lost to fio thread. > > Not sure what is happening though in practice. Only you can look at > traces more closely and see if timer is being armed or not. Vivek, if you want to look at traces, just ask. I'd be happy to show them to you, upload them, whatever. I'm not sure why you think otherwise (though I wouldn't blame you for not wanting to look at them!). Now, to answer your question, the jbd2 thread runs and issues a barrier, which causes a forced dispatch of requests. After that a new queue is selected, and since the fs_mark thread is blocked on the journal commit, it's always the fio process that gets to run. This, of course, raises the question of why the blk_yield patches didn't run into the same problem. Looking back at some saved traces, I don't see WBS (write barrier sync) requests, so I wonder if barriers weren't supported by my last storage system. Cheers, Jeff -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/