Received: by 2002:a05:6902:102b:0:0:0:0 with SMTP id x11csp111235ybt; Thu, 25 Jun 2020 16:43:53 -0700 (PDT) X-Google-Smtp-Source: ABdhPJyvH8Y9yWuBHcQelKpnGZM6yXh7NyZz396fXuY3xnVLIs1lKBSbSh8s0d+RE++U8OICtAIA X-Received: by 2002:a17:906:7043:: with SMTP id r3mr213153ejj.270.1593128633547; Thu, 25 Jun 2020 16:43:53 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1593128633; cv=none; d=google.com; s=arc-20160816; b=vjsXMriPaoBfEcGyY8hDfD5kHgyLMff08HR+jXxhYCCQd2hqZMHdhdcfy/HlL3uIdq xKcUFkU7PMEQE1cZJxPCZxYVMYN96ESHE6LfN/EZpGVhjdhYeuEjZJgtD1LSXTW90US4 WyFCrBwH9tTCCWylTNYTFTeMXXjxavQYWN3RF2sN0Ar8tC+AQJMGfiWrNdKAIbXaQ1bw Ndddt+rJJaTkBnvwAPQEv+Q7k8LJNClVHAe68HS2PYkiCfIoDiQDWcu6qn+/nNCmnUL5 HGYBHjL0ph83ZpXhi87AAvSuItOG/Ov8Pl5FSM16loazFOIL/BDyITkxxB7mKtgNNLCn CfXA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:subject:cc:to:from:date :dkim-signature; bh=7iVDWRLvq9g04MYf+QdjKrd6CmNW3i/0tCRaA5tI1Pw=; b=qAr/5HnOUfQRKLWcA0C6iSknM7yOkFZFk9uP9l5/2Aqe9cl0PC2aGjLe/zF/NXMDLe o3BN052JG4PpVQ2vMwb9ctre3bQrOdjOatsvpCmtnIMDsmXbH/iMU53Cphrbx+kEMQnd AqoQmeF3hF8zKm2Z8n0wihDgUw7O0dgSCbrC97ZuJiTFhd+crPE/KUSGSUHv8fkBFRQr axInaAKXUlx62n3ROSvheUPObYFwjIrUChvCl+3Dz9HpoVRHEaHfjKTb/f2GPzSmR+Kf I/u6LKwDJM22DhBmV/hm12s3jFZM5iMt4ClLp1oQM3KuvlJ24ihgYwGQFrZHQCqzZ2e+ 00ig== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@ozlabs.org header.s=201707 header.b=g34XUlIg; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=ozlabs.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id u7si16013756ejn.279.2020.06.25.16.43.30; Thu, 25 Jun 2020 16:43:53 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@ozlabs.org header.s=201707 header.b=g34XUlIg; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=ozlabs.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2407311AbgFYVmQ (ORCPT + 99 others); Thu, 25 Jun 2020 17:42:16 -0400 Received: from ozlabs.org ([203.11.71.1]:41119 "EHLO ozlabs.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2404413AbgFYVmQ (ORCPT ); Thu, 25 Jun 2020 17:42:16 -0400 Received: from authenticated.ozlabs.org (localhost [127.0.0.1]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange ECDHE (P-256) server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) by mail.ozlabs.org (Postfix) with ESMTPSA id 49tD4h6CJcz9sRN; Fri, 26 Jun 2020 07:42:12 +1000 (AEST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=ozlabs.org; s=201707; t=1593121333; bh=4UNLjre3Xn360HqOwEsVPDunhq9Y34/xjAnLIBnQKNE=; h=Date:From:To:Cc:Subject:In-Reply-To:References:From; b=g34XUlIgJfPNZychWKLR39cTICwznyGX/FJjzIot4VGiaHyL6hk6QxFapJ0jWpD5t EHX+qroK5i0b0rvNtJy4YPQI6p1RXx87uPDEFbBVERkQexbXiSDqFS6RjkJ8JkevFN BhvanVMhqm4f+Htyy6pxDdh/CdD1ap25tgsGr8LGJRgakjLPVEweaeb1da7qAQB00D TYZidOViDtKCMiuyFU9M6w/ydEeDYzaqwqcG7xdSW5P8G5bN5+TGWAlBVwVdTygk7n i+Nhv86QzWg2cOLykT6VVS3lqI6e8BLxqVCAGFwX93U3BzQ3blUXkc1cJxHsWUnY4v HWbsQQB45zKKQ== Date: Fri, 26 Jun 2020 07:42:10 +1000 From: Anton Blanchard To: Nicholas Piggin Cc: Steven Rostedt , Paul McKenney , linux-kernel@vger.kernel.org Subject: Re: [PATCH] ring-buffer: speed up buffer resets by avoiding synchronize_rcu for each CPU Message-ID: <20200626074210.5d842560@kryten.localdomain> In-Reply-To: <20200625053403.2386972-1-npiggin@gmail.com> References: <20200625053403.2386972-1-npiggin@gmail.com> X-Mailer: Mutt/1.8.0 (2017-02-23) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Nick, > On a 144 thread system, `perf ftrace` takes about 20 seconds to start > up, due to calling synchronize_rcu() for each CPU. > > cat /proc/108560/stack > 0xc0003e7eb336f470 > __switch_to+0x2e0/0x480 > __wait_rcu_gp+0x20c/0x220 > synchronize_rcu+0x9c/0xc0 > ring_buffer_reset_cpu+0x88/0x2e0 > tracing_reset_online_cpus+0x84/0xe0 > tracing_open+0x1d4/0x1f0 > > On a system with 10x more threads, it starts to become an annoyance. > > Batch these up so we disable all the per-cpu buffers first, then > synchronize_rcu() once, then reset each of the buffers. This brings > the time down to about 0.5s. It's gone from somewhere more than 10 minutes (I gave up waiting) to 3 seconds. Nice work! Tested-by: Anton Blanchard Thanks, Anton > Cc: Paul McKenney > Cc: Anton Blanchard > Cc: Steven Rostedt > Cc: linux-kernel@vger.kernel.org > Signed-off-by: Nicholas Piggin > --- > include/linux/ring_buffer.h | 1 + > kernel/trace/ring_buffer.c | 85 > +++++++++++++++++++++++++++++++------ kernel/trace/trace.c | > 4 +- 3 files changed, 73 insertions(+), 17 deletions(-) > > diff --git a/include/linux/ring_buffer.h b/include/linux/ring_buffer.h > index c76b2f3b3ac4..136ea0997e6d 100644 > --- a/include/linux/ring_buffer.h > +++ b/include/linux/ring_buffer.h > @@ -143,6 +143,7 @@ bool ring_buffer_iter_dropped(struct > ring_buffer_iter *iter); unsigned long ring_buffer_size(struct > trace_buffer *buffer, int cpu); > void ring_buffer_reset_cpu(struct trace_buffer *buffer, int cpu); > +void ring_buffer_reset_online_cpus(struct trace_buffer *buffer); > void ring_buffer_reset(struct trace_buffer *buffer); > > #ifdef CONFIG_RING_BUFFER_ALLOW_SWAP > diff --git a/kernel/trace/ring_buffer.c b/kernel/trace/ring_buffer.c > index b8e1ca48be50..3f1fd02bd14a 100644 > --- a/kernel/trace/ring_buffer.c > +++ b/kernel/trace/ring_buffer.c > @@ -270,6 +270,9 @@ EXPORT_SYMBOL_GPL(ring_buffer_event_data); > #define for_each_buffer_cpu(buffer, cpu) \ > for_each_cpu(cpu, buffer->cpumask) > > +#define for_each_online_buffer_cpu(buffer, cpu) \ > + for_each_cpu_and(cpu, buffer->cpumask, cpu_online_mask) > + > #define TS_SHIFT 27 > #define TS_MASK ((1ULL << TS_SHIFT) - 1) > #define TS_DELTA_TEST (~TS_MASK) > @@ -4484,6 +4487,26 @@ rb_reset_cpu(struct ring_buffer_per_cpu > *cpu_buffer) rb_head_page_activate(cpu_buffer); > } > > +/* Must have disabled the cpu buffer then done a synchronize_rcu */ > +static void reset_disabled_cpu_buffer(struct ring_buffer_per_cpu > *cpu_buffer) +{ > + unsigned long flags; > + > + raw_spin_lock_irqsave(&cpu_buffer->reader_lock, flags); > + > + if (RB_WARN_ON(cpu_buffer, > local_read(&cpu_buffer->committing))) > + goto out; > + > + arch_spin_lock(&cpu_buffer->lock); > + > + rb_reset_cpu(cpu_buffer); > + > + arch_spin_unlock(&cpu_buffer->lock); > + > + out: > + raw_spin_unlock_irqrestore(&cpu_buffer->reader_lock, flags); > +} > + > /** > * ring_buffer_reset_cpu - reset a ring buffer per CPU buffer > * @buffer: The ring buffer to reset a per cpu buffer of > @@ -4492,7 +4515,6 @@ rb_reset_cpu(struct ring_buffer_per_cpu > *cpu_buffer) void ring_buffer_reset_cpu(struct trace_buffer *buffer, > int cpu) { > struct ring_buffer_per_cpu *cpu_buffer = > buffer->buffers[cpu]; > - unsigned long flags; > > if (!cpumask_test_cpu(cpu, buffer->cpumask)) > return; > @@ -4503,24 +4525,42 @@ void ring_buffer_reset_cpu(struct > trace_buffer *buffer, int cpu) /* Make sure all commits have finished > */ synchronize_rcu(); > > - raw_spin_lock_irqsave(&cpu_buffer->reader_lock, flags); > + reset_disabled_cpu_buffer(cpu_buffer); > > - if (RB_WARN_ON(cpu_buffer, > local_read(&cpu_buffer->committing))) > - goto out; > + atomic_dec(&cpu_buffer->record_disabled); > + atomic_dec(&cpu_buffer->resize_disabled); > +} > +EXPORT_SYMBOL_GPL(ring_buffer_reset_cpu); > > - arch_spin_lock(&cpu_buffer->lock); > +/** > + * ring_buffer_reset_cpu - reset a ring buffer per CPU buffer > + * @buffer: The ring buffer to reset a per cpu buffer of > + * @cpu: The CPU buffer to be reset > + */ > +void ring_buffer_reset_online_cpus(struct trace_buffer *buffer) > +{ > + struct ring_buffer_per_cpu *cpu_buffer; > + int cpu; > > - rb_reset_cpu(cpu_buffer); > + for_each_online_buffer_cpu(buffer, cpu) { > + cpu_buffer = buffer->buffers[cpu]; > > - arch_spin_unlock(&cpu_buffer->lock); > + atomic_inc(&cpu_buffer->resize_disabled); > + atomic_inc(&cpu_buffer->record_disabled); > + } > > - out: > - raw_spin_unlock_irqrestore(&cpu_buffer->reader_lock, flags); > + /* Make sure all commits have finished */ > + synchronize_rcu(); > > - atomic_dec(&cpu_buffer->record_disabled); > - atomic_dec(&cpu_buffer->resize_disabled); > + for_each_online_buffer_cpu(buffer, cpu) { > + cpu_buffer = buffer->buffers[cpu]; > + > + reset_disabled_cpu_buffer(cpu_buffer); > + > + atomic_dec(&cpu_buffer->record_disabled); > + atomic_dec(&cpu_buffer->resize_disabled); > + } > } > -EXPORT_SYMBOL_GPL(ring_buffer_reset_cpu); > > /** > * ring_buffer_reset - reset a ring buffer > @@ -4528,10 +4568,27 @@ EXPORT_SYMBOL_GPL(ring_buffer_reset_cpu); > */ > void ring_buffer_reset(struct trace_buffer *buffer) > { > + struct ring_buffer_per_cpu *cpu_buffer; > int cpu; > > - for_each_buffer_cpu(buffer, cpu) > - ring_buffer_reset_cpu(buffer, cpu); > + for_each_buffer_cpu(buffer, cpu) { > + cpu_buffer = buffer->buffers[cpu]; > + > + atomic_inc(&cpu_buffer->resize_disabled); > + atomic_inc(&cpu_buffer->record_disabled); > + } > + > + /* Make sure all commits have finished */ > + synchronize_rcu(); > + > + for_each_buffer_cpu(buffer, cpu) { > + cpu_buffer = buffer->buffers[cpu]; > + > + reset_disabled_cpu_buffer(cpu_buffer); > + > + atomic_dec(&cpu_buffer->record_disabled); > + atomic_dec(&cpu_buffer->resize_disabled); > + } > } > EXPORT_SYMBOL_GPL(ring_buffer_reset); > > diff --git a/kernel/trace/trace.c b/kernel/trace/trace.c > index ec44b0e2a19c..9a26a1c875ae 100644 > --- a/kernel/trace/trace.c > +++ b/kernel/trace/trace.c > @@ -2003,7 +2003,6 @@ static void tracing_reset_cpu(struct > array_buffer *buf, int cpu) void tracing_reset_online_cpus(struct > array_buffer *buf) { > struct trace_buffer *buffer = buf->buffer; > - int cpu; > > if (!buffer) > return; > @@ -2015,8 +2014,7 @@ void tracing_reset_online_cpus(struct > array_buffer *buf) > buf->time_start = buffer_ftrace_now(buf, buf->cpu); > > - for_each_online_cpu(cpu) > - ring_buffer_reset_cpu(buffer, cpu); > + ring_buffer_reset_online_cpus(buffer); > > ring_buffer_record_enable(buffer); > }