Received: by 2002:a05:6902:102b:0:0:0:0 with SMTP id x11csp3005817ybt; Mon, 29 Jun 2020 12:42:24 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxOq99oRRuGZ1HAzCjfC3vwqQhDlisvCw+U349y48B3wn8BfbmAjeXrNeMRAOvPdVoSPaEy X-Received: by 2002:aa7:cc19:: with SMTP id q25mr19103047edt.26.1593459744181; Mon, 29 Jun 2020 12:42:24 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1593459744; cv=none; d=google.com; s=arc-20160816; b=dc2LVgAKvIs3oJFQb1Prbw8i00PYPe3kbv/tG5WUn9xkbUYAouHuZoaj6HsOmF8dt+ bX/EbN+8Ik8XaTymYXH8YhQV1viNI0fsqq5kCSLsJTjpS7KnLbaNo4gncPXeFYX0qOxx UxIcESBBCYHorU9GC5KZlTgneDO3ydwCE5J0mv9YqVUo5Xopfn096P4nuHzUREOCG77w JQdY8KoF76VoFnZ82cumjtBaTRupVC5V45turL9bsX3IOVQXfzPAQS8E1ucsI2EWtbjl dcK4jjx6rLKo7ByMcnXjcJBeJSr6PTxUUacmNKY99CJQ37ruLX8DW8nK8s4Rv3fNtJjm uD/Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:reply-to:message-id :subject:cc:to:from:date:dkim-signature; bh=2wqyJPcqu04LExWl3rfQTb3R3Bga8GbchwhiAesEhM0=; b=TfK7I+jt1DHnYRg7DIBbMQEC8L3/c8gSu3de6XqJTXX3U37vywEyUvc0b6jO2dubET hQWzIq2SQlwct2aWzLK3/s+JKaPCkS1SQWp/9Fwz2YaiS2tv1bdhpeZ+x3JIcbzXm5JU 2nV2IyfHyesEA/tcfJEdzbmf337g+zsVjuM7AoS8u1v4wo6iPD7nFunvsj74dsGmmAvL oxlqSlk6DzMOXm5G1kbzLjKu1sio+LdfQ8zy0FSHhKce1GmGX7aN0L1mrXLuou66z2b0 r2Y2Iy4rO8cvjE3pYoyFXfLL34jjQQ8xmnhMtxJLAdXwnwymIEhed6EZ6wqs30btGB2/ PjjQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b="X/uofipC"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id ca11si337988ejb.252.2020.06.29.12.42.00; Mon, 29 Jun 2020 12:42:24 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b="X/uofipC"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2387531AbgF2Tjr (ORCPT + 99 others); Mon, 29 Jun 2020 15:39:47 -0400 Received: from mail.kernel.org ([198.145.29.99]:40570 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1733004AbgF2Tad (ORCPT ); Mon, 29 Jun 2020 15:30:33 -0400 Received: from paulmck-ThinkPad-P72.home (50-39-105-78.bvtn.or.frontiernet.net [50.39.105.78]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 46A3E251EB; Mon, 29 Jun 2020 15:35:11 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1593444911; bh=zhDviILGrP7MEy5wU5vTBeWiRXOj8U5WPd55gq7vL/Y=; h=Date:From:To:Cc:Subject:Reply-To:References:In-Reply-To:From; b=X/uofipCG3PEl+Pm/dKaREUsIwGJ6KOUa15O99jy7/cqQQQYaxZR2vU0LuTLguRsN 14OFjlHLJKCyts5b6zAz/WLyUiZXqYkHXrEOht1SNbIr161vxXjYZjbgq8zjmKqugk jz4Oo5T2A7VpDrvlMT7mrSGuQygCGjQ7AP17BHL8= Received: by paulmck-ThinkPad-P72.home (Postfix, from userid 1000) id 33EA83522641; Mon, 29 Jun 2020 08:35:11 -0700 (PDT) Date: Mon, 29 Jun 2020 08:35:11 -0700 From: "Paul E. McKenney" To: Nicholas Piggin Cc: Steven Rostedt , Anton Blanchard , linux-kernel@vger.kernel.org Subject: Re: [PATCH] ring-buffer: speed up buffer resets by avoiding synchronize_rcu for each CPU Message-ID: <20200629153511.GX9247@paulmck-ThinkPad-P72> Reply-To: paulmck@kernel.org References: <20200625053403.2386972-1-npiggin@gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20200625053403.2386972-1-npiggin@gmail.com> User-Agent: Mutt/1.9.4 (2018-02-28) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Jun 25, 2020 at 03:34:03PM +1000, Nicholas Piggin wrote: > On a 144 thread system, `perf ftrace` takes about 20 seconds to start > up, due to calling synchronize_rcu() for each CPU. > > cat /proc/108560/stack > 0xc0003e7eb336f470 > __switch_to+0x2e0/0x480 > __wait_rcu_gp+0x20c/0x220 > synchronize_rcu+0x9c/0xc0 > ring_buffer_reset_cpu+0x88/0x2e0 > tracing_reset_online_cpus+0x84/0xe0 > tracing_open+0x1d4/0x1f0 > > On a system with 10x more threads, it starts to become an annoyance. > > Batch these up so we disable all the per-cpu buffers first, then > synchronize_rcu() once, then reset each of the buffers. This brings > the time down to about 0.5s. > > Cc: Paul McKenney > Cc: Anton Blanchard > Cc: Steven Rostedt > Cc: linux-kernel@vger.kernel.org > Signed-off-by: Nicholas Piggin Looks plausible from an RCU viewpoint: Acked-by: Paul E. McKenney > --- > include/linux/ring_buffer.h | 1 + > kernel/trace/ring_buffer.c | 85 +++++++++++++++++++++++++++++++------ > kernel/trace/trace.c | 4 +- > 3 files changed, 73 insertions(+), 17 deletions(-) > > diff --git a/include/linux/ring_buffer.h b/include/linux/ring_buffer.h > index c76b2f3b3ac4..136ea0997e6d 100644 > --- a/include/linux/ring_buffer.h > +++ b/include/linux/ring_buffer.h > @@ -143,6 +143,7 @@ bool ring_buffer_iter_dropped(struct ring_buffer_iter *iter); > unsigned long ring_buffer_size(struct trace_buffer *buffer, int cpu); > > void ring_buffer_reset_cpu(struct trace_buffer *buffer, int cpu); > +void ring_buffer_reset_online_cpus(struct trace_buffer *buffer); > void ring_buffer_reset(struct trace_buffer *buffer); > > #ifdef CONFIG_RING_BUFFER_ALLOW_SWAP > diff --git a/kernel/trace/ring_buffer.c b/kernel/trace/ring_buffer.c > index b8e1ca48be50..3f1fd02bd14a 100644 > --- a/kernel/trace/ring_buffer.c > +++ b/kernel/trace/ring_buffer.c > @@ -270,6 +270,9 @@ EXPORT_SYMBOL_GPL(ring_buffer_event_data); > #define for_each_buffer_cpu(buffer, cpu) \ > for_each_cpu(cpu, buffer->cpumask) > > +#define for_each_online_buffer_cpu(buffer, cpu) \ > + for_each_cpu_and(cpu, buffer->cpumask, cpu_online_mask) > + > #define TS_SHIFT 27 > #define TS_MASK ((1ULL << TS_SHIFT) - 1) > #define TS_DELTA_TEST (~TS_MASK) > @@ -4484,6 +4487,26 @@ rb_reset_cpu(struct ring_buffer_per_cpu *cpu_buffer) > rb_head_page_activate(cpu_buffer); > } > > +/* Must have disabled the cpu buffer then done a synchronize_rcu */ > +static void reset_disabled_cpu_buffer(struct ring_buffer_per_cpu *cpu_buffer) > +{ > + unsigned long flags; > + > + raw_spin_lock_irqsave(&cpu_buffer->reader_lock, flags); > + > + if (RB_WARN_ON(cpu_buffer, local_read(&cpu_buffer->committing))) > + goto out; > + > + arch_spin_lock(&cpu_buffer->lock); > + > + rb_reset_cpu(cpu_buffer); > + > + arch_spin_unlock(&cpu_buffer->lock); > + > + out: > + raw_spin_unlock_irqrestore(&cpu_buffer->reader_lock, flags); > +} > + > /** > * ring_buffer_reset_cpu - reset a ring buffer per CPU buffer > * @buffer: The ring buffer to reset a per cpu buffer of > @@ -4492,7 +4515,6 @@ rb_reset_cpu(struct ring_buffer_per_cpu *cpu_buffer) > void ring_buffer_reset_cpu(struct trace_buffer *buffer, int cpu) > { > struct ring_buffer_per_cpu *cpu_buffer = buffer->buffers[cpu]; > - unsigned long flags; > > if (!cpumask_test_cpu(cpu, buffer->cpumask)) > return; > @@ -4503,24 +4525,42 @@ void ring_buffer_reset_cpu(struct trace_buffer *buffer, int cpu) > /* Make sure all commits have finished */ > synchronize_rcu(); > > - raw_spin_lock_irqsave(&cpu_buffer->reader_lock, flags); > + reset_disabled_cpu_buffer(cpu_buffer); > > - if (RB_WARN_ON(cpu_buffer, local_read(&cpu_buffer->committing))) > - goto out; > + atomic_dec(&cpu_buffer->record_disabled); > + atomic_dec(&cpu_buffer->resize_disabled); > +} > +EXPORT_SYMBOL_GPL(ring_buffer_reset_cpu); > > - arch_spin_lock(&cpu_buffer->lock); > +/** > + * ring_buffer_reset_cpu - reset a ring buffer per CPU buffer > + * @buffer: The ring buffer to reset a per cpu buffer of > + * @cpu: The CPU buffer to be reset > + */ > +void ring_buffer_reset_online_cpus(struct trace_buffer *buffer) > +{ > + struct ring_buffer_per_cpu *cpu_buffer; > + int cpu; > > - rb_reset_cpu(cpu_buffer); > + for_each_online_buffer_cpu(buffer, cpu) { > + cpu_buffer = buffer->buffers[cpu]; > > - arch_spin_unlock(&cpu_buffer->lock); > + atomic_inc(&cpu_buffer->resize_disabled); > + atomic_inc(&cpu_buffer->record_disabled); > + } > > - out: > - raw_spin_unlock_irqrestore(&cpu_buffer->reader_lock, flags); > + /* Make sure all commits have finished */ > + synchronize_rcu(); > > - atomic_dec(&cpu_buffer->record_disabled); > - atomic_dec(&cpu_buffer->resize_disabled); > + for_each_online_buffer_cpu(buffer, cpu) { > + cpu_buffer = buffer->buffers[cpu]; > + > + reset_disabled_cpu_buffer(cpu_buffer); > + > + atomic_dec(&cpu_buffer->record_disabled); > + atomic_dec(&cpu_buffer->resize_disabled); > + } > } > -EXPORT_SYMBOL_GPL(ring_buffer_reset_cpu); > > /** > * ring_buffer_reset - reset a ring buffer > @@ -4528,10 +4568,27 @@ EXPORT_SYMBOL_GPL(ring_buffer_reset_cpu); > */ > void ring_buffer_reset(struct trace_buffer *buffer) > { > + struct ring_buffer_per_cpu *cpu_buffer; > int cpu; > > - for_each_buffer_cpu(buffer, cpu) > - ring_buffer_reset_cpu(buffer, cpu); > + for_each_buffer_cpu(buffer, cpu) { > + cpu_buffer = buffer->buffers[cpu]; > + > + atomic_inc(&cpu_buffer->resize_disabled); > + atomic_inc(&cpu_buffer->record_disabled); > + } > + > + /* Make sure all commits have finished */ > + synchronize_rcu(); > + > + for_each_buffer_cpu(buffer, cpu) { > + cpu_buffer = buffer->buffers[cpu]; > + > + reset_disabled_cpu_buffer(cpu_buffer); > + > + atomic_dec(&cpu_buffer->record_disabled); > + atomic_dec(&cpu_buffer->resize_disabled); > + } > } > EXPORT_SYMBOL_GPL(ring_buffer_reset); > > diff --git a/kernel/trace/trace.c b/kernel/trace/trace.c > index ec44b0e2a19c..9a26a1c875ae 100644 > --- a/kernel/trace/trace.c > +++ b/kernel/trace/trace.c > @@ -2003,7 +2003,6 @@ static void tracing_reset_cpu(struct array_buffer *buf, int cpu) > void tracing_reset_online_cpus(struct array_buffer *buf) > { > struct trace_buffer *buffer = buf->buffer; > - int cpu; > > if (!buffer) > return; > @@ -2015,8 +2014,7 @@ void tracing_reset_online_cpus(struct array_buffer *buf) > > buf->time_start = buffer_ftrace_now(buf, buf->cpu); > > - for_each_online_cpu(cpu) > - ring_buffer_reset_cpu(buffer, cpu); > + ring_buffer_reset_online_cpus(buffer); > > ring_buffer_record_enable(buffer); > } > -- > 2.23.0 >