Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752064Ab3FWWOl (ORCPT ); Sun, 23 Jun 2013 18:14:41 -0400 Received: from mail-gh0-f180.google.com ([209.85.160.180]:38502 "EHLO mail-gh0-f180.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751606Ab3FWWOj (ORCPT ); Sun, 23 Jun 2013 18:14:39 -0400 Message-ID: <51C77344.2040907@gmail.com> Date: Sun, 23 Jun 2013 15:14:28 -0700 From: David Ahern User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.8; rv:17.0) Gecko/20130509 Thunderbird/17.0.6 MIME-Version: 1.0 To: Ingo Molnar CC: Matthew Wilcox , Jens Axboe , Al Viro , Ingo Molnar , linux-kernel@vger.kernel.org, linux-nvme@lists.infradead.org, linux-scsi@vger.kernel.org, Linus Torvalds , Andrew Morton , Peter Zijlstra , Thomas Gleixner Subject: Re: RFC: Allow block drivers to poll for I/O instead of sleeping References: <20130620201713.GV8211@linux.intel.com> <20130623100920.GA19021@gmail.com> In-Reply-To: <20130623100920.GA19021@gmail.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1665 Lines: 36 On 6/23/13 3:09 AM, Ingo Molnar wrote: > If an IO driver is implemented properly then it will batch up requests for > the controller, and gets IRQ-notified on a (sub-)batch of buffers > completed. > > If there's any spinning done then it should be NAPI-alike polling: a > single "is stuff completed" polling pass per new block of work submitted, > to opportunistically interleave completion with submission work. > > I don't see where active spinning brings would improve performance > compared to a NAPI-alike technique. Your numbers obviously show a speedup > we'd like to have, I'm just wondering whether the same speedup (or even > more) could be implemented via: > > - smart batching that rate-limits completion IRQs in essence > + NAPI-alike polling > > ... which would almost never result in IRQ driven completion when we are > close to CPU-bound and while not yet saturating the IO controller's > capacity. > > The spinning approach you add has the disadvantage of actively wasting CPU > time, which could be used to run other tasks. In general it's much better > to make sure the completion IRQs are rate-limited and just schedule. This > (combined with a metric ton of fine details) is what the networking code > does in essence, and they have no trouble reaching very high throughput. Networking code has a similar proposal for low latency sockets using polling: https://lwn.net/Articles/540281/ David -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/