Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933816AbeAKUWl (ORCPT + 1 other); Thu, 11 Jan 2018 15:22:41 -0500 Received: from mail-it0-f67.google.com ([209.85.214.67]:37573 "EHLO mail-it0-f67.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S933043AbeAKUWk (ORCPT ); Thu, 11 Jan 2018 15:22:40 -0500 X-Google-Smtp-Source: ACJfBos+A3kBPPCkVzHsf1/lKC3lz+IgxIY0cf2mP/AxRVhbJ39nsQWgS7lXkG8h8VFqPL2Zg4tu7jyLL+DOVfQGXU8= MIME-Version: 1.0 In-Reply-To: References: <20180109133623.10711-1-dima@arista.com> <20180109133623.10711-2-dima@arista.com> <1515620880.3350.44.camel@arista.com> <20180111032232.GA11633@lerouge> <20180111044456.GC11633@lerouge> <1515681091.3039.21.camel@arista.com> <20180111163204.GE6176@hirez.programming.kicks-ass.net> From: Linus Torvalds Date: Thu, 11 Jan 2018 12:22:38 -0800 X-Google-Sender-Auth: KInoIf7mZ_igKLq6IV5SPjJnSw0 Message-ID: Subject: Re: [RFC 1/2] softirq: Defer net rx/tx processing to ksoftirqd context To: Eric Dumazet Cc: Peter Zijlstra , Dmitry Safonov , Frederic Weisbecker , LKML , Dmitry Safonov <0x7f454c46@gmail.com>, Andrew Morton , David Miller , Frederic Weisbecker , Hannes Frederic Sowa , Ingo Molnar , "Levin, Alexander (Sasha Levin)" , Paolo Abeni , "Paul E. McKenney" , Radu Rendec , Rik van Riel , Stanislaw Gruszka , Thomas Gleixner , Wanpeng Li Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Return-Path: On Thu, Jan 11, 2018 at 12:16 PM, Eric Dumazet wrote: > > Note that when I implemented TCP Small queues, I did experiments between > using a work queue or a tasklet, and workqueues added unacceptable P99 > latencies, when many user threads are competing with kernel threads. Yes. So I think one solution might be to have a hybrid system, where we do the softirq's synchronously normally (which is what you really want for good latency). But then fall down on a threaded model - but that fallback case should be per-softirq, not global. So if one softirq uses a lot of CPU time, that shouldn't affect the latency of other softirqs. So maybe we could get rid of the per-cpu ksoftirqd entirely, and replace it with with per-cpu and per-softirq workqueues? Would something like that sound sane? Just a SMOP/SMOT (small matter of programming/testing). Linus