Received: by 10.223.185.116 with SMTP id b49csp2217435wrg; Thu, 15 Feb 2018 08:15:27 -0800 (PST) X-Google-Smtp-Source: AH8x226NiTIIsdy8KmjvaNPPYl/CFKq0EFQ6H2iomXaCndQfFWDwpk1CMIzhdr9r7t1A1RQtRY3+ X-Received: by 2002:a17:902:76cc:: with SMTP id j12-v6mr2975303plt.120.1518711327706; Thu, 15 Feb 2018 08:15:27 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1518711327; cv=none; d=google.com; s=arc-20160816; b=Q8Hp7KF+AvyS41HC4nPMqOeOATENW7bnNnOEzMvYm17YrPByJ0KXEf2wfECAymR7oo 1Ygtbbm0L1rZKDT5688qNOiu/USWR2jsD+7NDbRgRN57mUdD14k684VbfgfM2kfrBMva iLJYjKbxgQZRkRy2a7Y8x9H+j8sZ+P4KL5jt+ShL1oguUsvmsH0fXKJrOJkjns1YS8ei 91oU26jAKAEtPr0oop3XPy2QbKb+0GWtQL8yepjK3mejGoMIQ7+RWJX1uA6hI4JoGvDU JQ63xdt4tNswxbnmt0LpguovMq2+llVQ9s+xgcIbm8I/FHbHMrbVGhgEZpZ2Agc55TrH Enfw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-transfer-encoding:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dmarc-filter :arc-authentication-results; bh=8ETT7m15xical4pdQg36zP0F8aCe3bwTBiXwKirE8Kg=; b=Jo+e32bldOxoDcdTPasuCHWQVBsNWgLZQ5Tso7MiIuDylrIpGVJ13BNmNiuoDB/tK+ 1ZJlc96xGnD4Mc0WTG+qqrd3jUqEU3at5YxbFOgh+UwVLX1qijzXdubSMMJUF7VpFpTN UpVkL/X/cNiG/qNgpjW3K74oUw9Fjc7KLmsg1ljlXLOzq+ug9XyYWT07SQa/c4CE3qv1 MZAeacaw1cDk45y10TaKyWY7SoXS1+eggOka8aPe1bS1EZHp23s4tFltMlm3275UEmJy 92IcilE4kzsJMcKhmeRWn+fvAETrQk4xcY9UgDZNHBfIs1wxxGcUQq2njJyXbZDMvHRw qURA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id h1-v6si1627108plk.706.2018.02.15.08.15.12; Thu, 15 Feb 2018 08:15:27 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1424813AbeBOQN7 (ORCPT + 99 others); Thu, 15 Feb 2018 11:13:59 -0500 Received: from mail.kernel.org ([198.145.29.99]:39418 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1424375AbeBOQN5 (ORCPT ); Thu, 15 Feb 2018 11:13:57 -0500 Received: from localhost (i16-les03-th2-31-37-47-191.sfr.lns.abo.bbox.fr [31.37.47.191]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 1A304217D8; Thu, 15 Feb 2018 16:13:54 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 1A304217D8 Authentication-Results: mail.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.org Authentication-Results: mail.kernel.org; spf=none smtp.mailfrom=frederic@kernel.org Date: Thu, 15 Feb 2018 17:13:52 +0100 From: Frederic Weisbecker To: Sebastian Andrzej Siewior Cc: LKML , Levin Alexander , Peter Zijlstra , Mauro Carvalho Chehab , Linus Torvalds , Hannes Frederic Sowa , "Paul E . McKenney" , Wanpeng Li , Dmitry Safonov , Thomas Gleixner , Andrew Morton , Paolo Abeni , Radu Rendec , Ingo Molnar , Stanislaw Gruszka , Rik van Riel , Eric Dumazet , David Miller Subject: Re: [RFC PATCH 2/4] softirq: Per vector deferment to workqueue Message-ID: <20180215161349.GA6956@lerouge> References: <1516376774-24076-1-git-send-email-frederic@kernel.org> <1516376774-24076-3-git-send-email-frederic@kernel.org> <20180208174450.qjvjy752jf4ngt2g@breakpoint.cc> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20180208174450.qjvjy752jf4ngt2g@breakpoint.cc> User-Agent: Mutt/1.5.24 (2015-08-30) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Feb 08, 2018 at 06:44:52PM +0100, Sebastian Andrzej Siewior wrote: > On 2018-01-19 16:46:12 [+0100], Frederic Weisbecker wrote: > > diff --git a/kernel/softirq.c b/kernel/softirq.c > > index c8c6841..becb1d9 100644 > > --- a/kernel/softirq.c > > +++ b/kernel/softirq.c > > @@ -62,6 +62,19 @@ const char * const softirq_to_name[NR_SOFTIRQS] = { > … > > +static void vector_work_func(struct work_struct *work) > > +{ > > + struct vector *vector = container_of(work, struct vector, work); > > + struct softirq *softirq = this_cpu_ptr(&softirq_cpu); > > + int vec_nr = vector->nr; > > + int vec_bit = BIT(vec_nr); > > + u32 pending; > > + > > + local_irq_disable(); > > + pending = local_softirq_pending(); > > + account_irq_enter_time(current); > > + __local_bh_disable_ip(_RET_IP_, SOFTIRQ_OFFSET); > > + lockdep_softirq_enter(); > > + set_softirq_pending(pending & ~vec_bit); > > + local_irq_enable(); > > + > > + if (pending & vec_bit) { > > + struct softirq_action *sa = &softirq_vec[vec_nr]; > > + > > + kstat_incr_softirqs_this_cpu(vec_nr); > > + softirq->work_running = 1; > > + trace_softirq_entry(vec_nr); > > + sa->action(sa); > > You invoke the softirq handler while BH is disabled (not wrong, I just > state the obvious). That means, the scheduler can't preempt/interrupt > the workqueue/BH-handler while it is invoked so it has to wait until it > completes its doing. > In do_softirq_workqueue() you schedule multiple workqueue items (one for > each softirq vector) which is unnecessary because they can't preempt one > another and should be invoked the order they were enqueued. So it would > be enough to enqueue one item because it is serialized after all. So one > work_struct per CPU with a cond_resched_rcu_qs() while switching from one > vector to another should accomplish that what you have now here (not > sure if that cond_resched after each vector is needed). But… Makes sense. > > > + trace_softirq_exit(vec_nr); > > + softirq->work_running = 0; > > + } > > + > > + local_irq_disable(); > > + > > + pending = local_softirq_pending(); > > + if (pending & vec_bit) > > + schedule_work_on(smp_processor_id(), &vector->work); > > … on a system that is using system_wq a lot, it might introduced a certain > latency until your softirq-worker gets its turn. The workqueue will > spawn new workers if the current worker schedules out but until that > happens you have to wait. I am not sure if this is intended or whether > this might be a problem. I think you could argue either way depending on > what you currently think is more important. Indeed :) > Further, schedule_work_on(x, ) does not guarentee that the work item is > invoked on CPU x. It tries that but if CPU x goes down due to > CPU-hotplug then the workitem will be moved to random CPU. For that > reason we have work_on_cpu_safe() but you don't want to use that / flush > that workqueue while in here. Yeah, someone also reported me that hotplug issue. I didn't think workqueue would break the affinity but here it does. So we would need a hotplug hook indeed. > > May I instead suggest to stick to ksoftirqd? So you run in softirq > context (after return from IRQ) and if takes too long, you offload the > vector to ksoftirqd instead. You may want to play with the metric on > which you decide when you want switch to ksoftirqd / account how long a > vector runs. Yeah that makes sense. These workqueues are too much headaches eventually. I'm going to try that ksoftirqd thing. Thanks.