Received: by 2002:a25:b323:0:0:0:0:0 with SMTP id l35csp2045465ybj; Sun, 22 Sep 2019 18:23:21 -0700 (PDT) X-Google-Smtp-Source: APXvYqzqQsT4OfG7/CbLN7peMahYeRKZZKNVvwSYgNMybC7cDBIDLciSrTD9WxoicxeamZzy56VR X-Received: by 2002:a50:aa96:: with SMTP id q22mr34141029edc.179.1569201801030; Sun, 22 Sep 2019 18:23:21 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1569201801; cv=none; d=google.com; s=arc-20160816; b=QSuif8gBXZQP34rdjT+qYK5Mt9uC3vqeHMzjrcPvKMlW5/EaKGWjjuxGeS6ldOBcWp yYxHm6MaOUbaAv42JMJUWmVyAh8Znu9NJ3HZ0cCzf/bzO7iKQnoZBQRIPpA6cNRIxKLP wkE+PyaBsdtjZHxsfA2woM0d3OCJdKOIawBLpVt+PRS8XBaaGOFuw8PA9AV/Rc7OkLsJ 5MLnGXCpyglZez4stffYuFRzz3cRIwNUpKLDpT7c7hp4Xqqc1cPVV5Lf2ys+2fYRMy5q WZgh0kSy4a0FbJTGsUyS5/s6jmySapFiHB7HBJRV6yQq/ZPRG1jqOR6xAT/oOH6hT1TS hDOw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:references:in-reply-to:message-id:date :subject:cc:to:from:dkim-signature; bh=bxChQ2KhuPQDXJyvF0uqDEGqjNvnWGadSs4vVg4OZC4=; b=XtQlESw7edfemG9jBSqubKInAICNwtU6q80pwsZb9M6Mz9DQMC4eDMQxtGzzegk/wp TR9MkU+YgoMJyvfJwFcGthjvFS+UVfcpYahukyDpY5uWi/Ew4ZYSFhrK1/R6Vjvoh96j ZlAxTkDwZR4WQ9KFehsQqL3sRD0hEWFFC9k+bfw15aMy9PFmPwChGSR/+y5XGL4+2MiL ATWeORmJZ6D1+o0AgY2BwAms0LpvXSYY0FXKH81AMulgVsRsmnPqnh4wOtWw4CgC18Mw y2rLFa8R+dhUiTxNa7CwCixqACOzLDFp+FD7m2DXV9ZXGQXv0s5S4bl3o4UzyEO24rdn GTxQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=GWCDCVa9; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id si30si4332327ejb.92.2019.09.22.18.22.55; Sun, 22 Sep 2019 18:23:21 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=GWCDCVa9; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2392430AbfITPW2 (ORCPT + 99 others); Fri, 20 Sep 2019 11:22:28 -0400 Received: from mail-wr1-f68.google.com ([209.85.221.68]:35897 "EHLO mail-wr1-f68.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2388486AbfITPW1 (ORCPT ); Fri, 20 Sep 2019 11:22:27 -0400 Received: by mail-wr1-f68.google.com with SMTP id y19so7197132wrd.3 for ; Fri, 20 Sep 2019 08:22:25 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:to:cc:subject:date:message-id:in-reply-to:references; bh=bxChQ2KhuPQDXJyvF0uqDEGqjNvnWGadSs4vVg4OZC4=; b=GWCDCVa9x+LcBo9RIkPxzZlkKsL8iwDQ6ed4sHkCJgp1p8wnBORtjJaIr+QsPnTmTy bpEmCBAn8yBA9tIwX5v0a0SVkSkD8lKq3TfhWldqe2cCLWOkdPTvBWffAgfv0ZtsUPzD jAgt+CqJzAG2rL2AWMGsGICTjoU32PPBBXZG6pNJxRHlnzZ1YciIp/n5zXCGnbviBDax SdwiSfRez14+Z+0lZZevT8p552ENzueu6oSr/Jw7y3Ils18cFMpzKHQS4P529wBexIyT e9DsF7a7jyBzeqdI13a8y4+naZ0z+1oe+62fQF4xTAlEb7vgyHBcYLqgRdoOfCTs+kDF rcrQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references; bh=bxChQ2KhuPQDXJyvF0uqDEGqjNvnWGadSs4vVg4OZC4=; b=kQWg7mJbcqPjz4zcvyTX8gUQKb6eDRNCZVtEI4m5YZ+dKXZbzRfwufihq4OXB6aJqB EXNV+G7C6iDg65OuJe/2Vgz4ZBAr73h+7o4/j+JnQ6cwY9ZWpwQzicQ1VUlIr2R7uKcz i2kqlQ+Q/a0RxJHOHiEfr6/X1z8yIzAwPUoGI6KOXD+LBUHDlBecWskW/OSCeqUECcrt DWktw2GxZG5DXfY99RWizCKH2p3B70Galt0IzAKx5YdBFPu34gyNIf9C9Ci3wWm41d5P paKfJLaQvx6RsicJBLCg2r3M4dQzA9Ea2y93SGZ4V5vYFbMHLbf/KkHJmq+nAwR4+/j/ VhcA== X-Gm-Message-State: APjAAAWqPgSEL0fkHaILQuNunrfZrR7Nj2dhcViuT3gD1dWieL4uw1vd wbpVGUlbN/Mv+DpTjFe5Gg== X-Received: by 2002:adf:c58b:: with SMTP id m11mr11831072wrg.252.1568992944803; Fri, 20 Sep 2019 08:22:24 -0700 (PDT) Received: from buster-jangle.bmw-carit.intra ([217.89.178.117]) by smtp.gmail.com with ESMTPSA id x2sm3152901wrn.81.2019.09.20.08.22.23 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 20 Sep 2019 08:22:24 -0700 (PDT) From: "Viktor Rosendahl (BMW)" To: Steven Rostedt , Ingo Molnar , linux-kernel@vger.kernel.org Cc: Joel Fernandes , "Viktor Rosendahl (BMW)" Subject: [PATCH v7 1/4] ftrace: Implement fs notification for tracing_max_latency Date: Fri, 20 Sep 2019 17:22:16 +0200 Message-Id: <20190920152219.12920-2-viktor.rosendahl@gmail.com> X-Mailer: git-send-email 2.17.1 In-Reply-To: <20190920152219.12920-1-viktor.rosendahl@gmail.com> References: <20190920152219.12920-1-viktor.rosendahl@gmail.com> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org This patch implements the feature that the tracing_max_latency file, e.g. /sys/kernel/debug/tracing/tracing_max_latency will receive notifications through the fsnotify framework when a new latency is available. One particularly interesting use of this facility is when enabling threshold tracing, through /sys/kernel/debug/tracing/tracing_thresh, together with the preempt/irqsoff tracers. This makes it possible to implement a user space program that can, with equal probability, obtain traces of latencies that occur immediately after each other in spite of the fact that the preempt/irqsoff tracers operate in overwrite mode. This facility works with the hwlat, preempt/irqsoff, and wakeup tracers. The tracers may call the latency_fsnotify() from places such as __schedule() or do_idle(); this makes it impossible to call queue_work() directly without risking a deadlock. The same would happen with a softirq, kernel thread or tasklet. For this reason we use the irq_work mechanism to call queue_work(). This patch creates a new workqueue. The reason for doing this is that I wanted to use the WQ_UNBOUND and WQ_HIGHPRI flags. My thinking was that WQ_UNBOUND might help with the latency in some important cases. If we use: queue_work(system_highpri_wq, &tr->fsnotify_work); then the work will (almost) always execute on the same CPU but if we are unlucky that CPU could be too busy while there could be another CPU in the system that would be able to process the work soon enough. queue_work_on() could be used to queue the work on another CPU but it seems difficult to select the right CPU. Signed-off-by: Viktor Rosendahl (BMW) --- kernel/trace/trace.c | 75 +++++++++++++++++++++++++++++++++++++- kernel/trace/trace.h | 18 +++++++++ kernel/trace/trace_hwlat.c | 4 +- 3 files changed, 94 insertions(+), 3 deletions(-) diff --git a/kernel/trace/trace.c b/kernel/trace/trace.c index 947ba433865f..2d2150bf0d42 100644 --- a/kernel/trace/trace.c +++ b/kernel/trace/trace.c @@ -44,6 +44,9 @@ #include #include #include +#include +#include +#include #include "trace.h" #include "trace_output.h" @@ -1480,6 +1483,74 @@ static ssize_t trace_seq_to_buffer(struct trace_seq *s, void *buf, size_t cnt) unsigned long __read_mostly tracing_thresh; +#if (defined(CONFIG_TRACER_MAX_TRACE) || defined(CONFIG_HWLAT_TRACER)) && \ + defined(CONFIG_FSNOTIFY) + +static const struct file_operations tracing_max_lat_fops; +static struct workqueue_struct *fsnotify_wq; + +static void latency_fsnotify_workfn(struct work_struct *work) +{ + struct trace_array *tr = container_of(work, struct trace_array, + fsnotify_work); + fsnotify(tr->d_max_latency->d_inode, FS_MODIFY, + tr->d_max_latency->d_inode, FSNOTIFY_EVENT_INODE, NULL, 0); +} + +static void latency_fsnotify_workfn_irq(struct irq_work *iwork) +{ + struct trace_array *tr = container_of(iwork, struct trace_array, + fsnotify_irqwork); + queue_work(fsnotify_wq, &tr->fsnotify_work); +} + +static void trace_create_maxlat_file(struct trace_array *tr, + struct dentry *d_tracer) +{ + INIT_WORK(&tr->fsnotify_work, latency_fsnotify_workfn); + init_irq_work(&tr->fsnotify_irqwork, latency_fsnotify_workfn_irq); + tr->d_max_latency = trace_create_file("tracing_max_latency", 0644, + d_tracer, &tr->max_latency, + &tracing_max_lat_fops); +} + +__init static int latency_fsnotify_init(void) +{ + fsnotify_wq = alloc_workqueue("tr_max_lat_wq", + WQ_UNBOUND | WQ_HIGHPRI, 0); + if (!fsnotify_wq) { + pr_err("Unable to allocate tr_max_lat_wq\n"); + return -ENOMEM; + } + return 0; +} + +late_initcall_sync(latency_fsnotify_init); + +void latency_fsnotify(struct trace_array *tr) +{ + if (!fsnotify_wq) + return; + /* + * We cannot call queue_work(&tr->fsnotify_work) from here because it's + * possible that we are called from __schedule() or do_idle(), which + * could cause a deadlock. + */ + irq_work_queue(&tr->fsnotify_irqwork); +} + +/* + * (defined(CONFIG_TRACER_MAX_TRACE) || defined(CONFIG_HWLAT_TRACER)) && \ + * defined(CONFIG_FSNOTIFY) + */ +#else + +#define trace_create_maxlat_file(tr, d_tracer) \ + trace_create_file("tracing_max_latency", 0644, d_tracer, \ + &tr->max_latency, &tracing_max_lat_fops) + +#endif + #ifdef CONFIG_TRACER_MAX_TRACE /* * Copy the new maximum trace into the separate maximum-trace @@ -1518,6 +1589,7 @@ __update_max_tr(struct trace_array *tr, struct task_struct *tsk, int cpu) /* record this tasks comm */ tracing_record_cmdline(tsk); + latency_fsnotify(tr); } /** @@ -8550,8 +8622,7 @@ init_tracer_tracefs(struct trace_array *tr, struct dentry *d_tracer) create_trace_options_dir(tr); #if defined(CONFIG_TRACER_MAX_TRACE) || defined(CONFIG_HWLAT_TRACER) - trace_create_file("tracing_max_latency", 0644, d_tracer, - &tr->max_latency, &tracing_max_lat_fops); + trace_create_maxlat_file(tr, d_tracer); #endif if (ftrace_create_function_files(tr, d_tracer)) diff --git a/kernel/trace/trace.h b/kernel/trace/trace.h index 005f08629b8b..4913ed1138aa 100644 --- a/kernel/trace/trace.h +++ b/kernel/trace/trace.h @@ -16,6 +16,8 @@ #include #include #include +#include +#include #ifdef CONFIG_FTRACE_SYSCALLS #include /* For NR_SYSCALLS */ @@ -264,6 +266,11 @@ struct trace_array { #endif #if defined(CONFIG_TRACER_MAX_TRACE) || defined(CONFIG_HWLAT_TRACER) unsigned long max_latency; +#ifdef CONFIG_FSNOTIFY + struct dentry *d_max_latency; + struct work_struct fsnotify_work; + struct irq_work fsnotify_irqwork; +#endif #endif struct trace_pid_list __rcu *filtered_pids; /* @@ -785,6 +792,17 @@ void update_max_tr_single(struct trace_array *tr, struct task_struct *tsk, int cpu); #endif /* CONFIG_TRACER_MAX_TRACE */ +#if (defined(CONFIG_TRACER_MAX_TRACE) || defined(CONFIG_HWLAT_TRACER)) && \ + defined(CONFIG_FSNOTIFY) + +void latency_fsnotify(struct trace_array *tr); + +#else + +#define latency_fsnotify(tr) do { } while (0) + +#endif + #ifdef CONFIG_STACKTRACE void __trace_stack(struct trace_array *tr, unsigned long flags, int skip, int pc); diff --git a/kernel/trace/trace_hwlat.c b/kernel/trace/trace_hwlat.c index fa95139445b2..9c379261ee89 100644 --- a/kernel/trace/trace_hwlat.c +++ b/kernel/trace/trace_hwlat.c @@ -254,8 +254,10 @@ static int get_sample(void) trace_hwlat_sample(&s); /* Keep a running maximum ever recorded hardware latency */ - if (sample > tr->max_latency) + if (sample > tr->max_latency) { tr->max_latency = sample; + latency_fsnotify(tr); + } } out: -- 2.17.1