Received: by 2002:a25:1506:0:0:0:0:0 with SMTP id 6csp611701ybv; Fri, 7 Feb 2020 05:28:08 -0800 (PST) X-Google-Smtp-Source: APXvYqzckKkhkZOKEaTkQbvEqoGwblmhkfiwLqGKyoiefdsbLlNPMsfrLfAjiW/6DS/aLRGOP9XC X-Received: by 2002:a9d:64b:: with SMTP id 69mr2554041otn.237.1581082088647; Fri, 07 Feb 2020 05:28:08 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1581082088; cv=none; d=google.com; s=arc-20160816; b=DPJ99uUV2OH+FRYbUDeWoiMhQbUJoIr675WZs/T4W/if81ClwdTZlrOzmn2nYSdYac 3cUVVlCWbNXyEAEuhNgV+soqTnfnklfWsIUmo6aoK5Xz/HoidCiF4B4QLw2/weUpKzSB RnBBOB5whrpzQcgaHy4KSsP8xaXYYV2dEdNkKWXWGsoB0jhxDS5IlkdTpCd64pYZLoHd zf3Rvv9MMf0fP7TCmtm+CRN9U9V3q6iQOwy2K8eGZCh3xWZgYqgJ1pQfhsSKe8UMQsRE HKkKBPAL86v6Or1WBKdbzYS76no02EEdxat4PdCfsK7KsVsBLLek8TA3Em/yPAOg4tz1 gudA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature; bh=6DLdx59m7mdSNjXwgrV5ORcqFDOMlsVb2GMSq54KHXw=; b=Xm1mN2M3inkfZDHAH+zZCdmVJuW1rXI8sa5IBBcdhqpnF41jbgci3i76bOS0pz4wOi ni8JmC8cRMz4uEhQzZFCGZXL388FtW/xg5sPWJhIPi5HD/6hLcUFeG6VZlquNDGtw4Xn oRkODRcutvIjZD+6w9tsW1i9vH4elD0AqWLTQPazugUoiU6WgC6g/uzaYDLIwTr/rYtm Hvw0CEPLv1fWIo39TaNWL4CKaaj0hLg0EPfu3xYoKBDiol8QOxSVNbrFI60JTJG/ZWsC wzNdz7dqbY8dXOnQukCtZeneRnxepDIQCcIXrD6+Nhswm49tYcYKghgfWiM4U/MdA4nd AHJA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@joelfernandes.org header.s=google header.b=db7an+6Q; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id d8si1642300oti.306.2020.02.07.05.27.56; Fri, 07 Feb 2020 05:28:08 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@joelfernandes.org header.s=google header.b=db7an+6Q; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727618AbgBGN0i (ORCPT + 99 others); Fri, 7 Feb 2020 08:26:38 -0500 Received: from mail-qv1-f67.google.com ([209.85.219.67]:35843 "EHLO mail-qv1-f67.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727491AbgBGN0h (ORCPT ); Fri, 7 Feb 2020 08:26:37 -0500 Received: by mail-qv1-f67.google.com with SMTP id db9so989382qvb.3 for ; Fri, 07 Feb 2020 05:26:36 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=joelfernandes.org; s=google; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to:user-agent; bh=6DLdx59m7mdSNjXwgrV5ORcqFDOMlsVb2GMSq54KHXw=; b=db7an+6Q+QD8AZ5ZFhfpIkf54OZDuXo/vtbZyKe3PIOFoU02y1jTlbqAi0hbfFiExs aep0FdA8N0e2j3cFYR7Oq70dRRtnTbJdjNinDJpixDimYL3uDFNiDg9F9NloINqq6zjg S+l+omYHxziezVvaZ+GBkXSJwMxOKcNAOFCwM= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to:user-agent; bh=6DLdx59m7mdSNjXwgrV5ORcqFDOMlsVb2GMSq54KHXw=; b=fx6+hSiyL/9s7/YB7co4bNfyCNoQ2P8a+aHFHujUuJRjNs+JkQfSag15GT5APW6uR9 y6NG8OARXOiHoMgKnYo+Xi4KLgwtktqXtjKCvp3xzQhVJ/TD1xUAUzm8z47bb36TO4Ou Il1KCvOlTKPvuAFU4BlrJJhS/Nu49zwruTFGF4TQXp72Jz+RIShFLdYvKfBfpYR/1NsR 56azd9W10Uv16Q+aRPjfuZmubunV/qgxSvpiK5XsbEQjk54ghLTuHLMZap91Ss+EXq28 fMFSAjtp5LEDuqyl7Nj732zWxC1Gptzy+B+KxKuP2tfyCr5U59q/omnaYhSCY7l1qWOD Q1gQ== X-Gm-Message-State: APjAAAVt+6h5J4paQL1qS0dgMpL+Vl8pvbOkHnB/l1Yj/8ALxQuEPHUN akoMWi7aqOm902OLz3JyjNme4Q== X-Received: by 2002:ad4:4b08:: with SMTP id r8mr6882597qvw.250.1581081996011; Fri, 07 Feb 2020 05:26:36 -0800 (PST) Received: from localhost ([2620:15c:6:12:9c46:e0da:efbf:69cc]) by smtp.gmail.com with ESMTPSA id 24sm1325047qka.32.2020.02.07.05.26.35 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 07 Feb 2020 05:26:35 -0800 (PST) Date: Fri, 7 Feb 2020 08:26:35 -0500 From: Joel Fernandes To: Frankie Chang Cc: Greg Kroah-Hartman , Todd Kjos , Martijn Coenen , Arve =?iso-8859-1?B?SGr4bm5lduVn?= , Christian Brauner , linux-kernel@vger.kernel.org, linux-mediatek@lists.infradead.org, wsd_upstream@mediatek.com, Jian-Min.Liu@mediatek.com Subject: Re: [PATCH v1 1/1] binder: transaction latency tracking for user build Message-ID: <20200207132635.GB121785@google.com> References: <1580885572-14272-1-git-send-email-Frankie.Chang@mediatek.com> <20200205093612.GA1167956@kroah.com> <20200205154943.GE142103@google.com> <1581045023.22229.46.camel@mtkswgap22> <20200207031759.GA121785@google.com> <1581056939.22229.55.camel@mtkswgap22> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1581056939.22229.55.camel@mtkswgap22> User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Feb 07, 2020 at 02:28:59PM +0800, Frankie Chang wrote: > On Thu, 2020-02-06 at 22:17 -0500, Joel Fernandes wrote: > > On Fri, Feb 07, 2020 at 11:10:23AM +0800, Frankie Chang wrote: > > > On Wed, 2020-02-05 at 10:49 -0500, Joel Fernandes wrote: > > > > On Wed, Feb 05, 2020 at 09:36:12AM +0000, Greg Kroah-Hartman wrote: > > > > > On Wed, Feb 05, 2020 at 02:52:52PM +0800, Frankie Chang wrote: > > > > > > Record start/end timestamp to binder transaction. > > > > > > When transaction is completed or transaction is free, > > > > > > it would be checked if transaction latency over threshold (2 sec), > > > > > > if yes, printing related information for tracing. > > > > > > > > > > > > Signed-off-by: Frankie Chang > > > > > > --- > > > > > > drivers/android/Kconfig | 8 +++ > > > > > > drivers/android/binder.c | 107 +++++++++++++++++++++++++++++++++++++ > > > > > > drivers/android/binder_internal.h | 4 ++ > > > > > > 3 files changed, 119 insertions(+) > > > > > > > > > > > > diff --git a/drivers/android/Kconfig b/drivers/android/Kconfig > > > > > > index 6fdf2ab..7ba80eb 100644 > > > > > > --- a/drivers/android/Kconfig > > > > > > +++ b/drivers/android/Kconfig > > > > > > @@ -54,6 +54,14 @@ config ANDROID_BINDER_IPC_SELFTEST > > > > > > exhaustively with combinations of various buffer sizes and > > > > > > alignments. > > > > > > > > > > > > +config BINDER_USER_TRACKING > > > > > > + bool "Android Binder transaction tracking" > > > > > > + help > > > > > > + Used for track abnormal binder transaction which is over 2 seconds, > > > > > > + when the transaction is done or be free, this transaction would be > > > > > > + checked whether it executed overtime. > > > > > > + If yes, printing out the detail info about it. > > > > > > + > > > > > > endif # if ANDROID > > > > > > > > > > > > endmenu > > > > > > diff --git a/drivers/android/binder.c b/drivers/android/binder.c > > > > > > index e9bc9fc..5a352ee 100644 > > > > > > --- a/drivers/android/binder.c > > > > > > +++ b/drivers/android/binder.c > > > > > > @@ -76,6 +76,11 @@ > > > > > > #include "binder_internal.h" > > > > > > #include "binder_trace.h" > > > > > > > > > > > > +#ifdef CONFIG_BINDER_USER_TRACKING > > > > > > +#include > > > > > > +#include > > > > > > +#endif > > > > > > + > > > > > > static HLIST_HEAD(binder_deferred_list); > > > > > > static DEFINE_MUTEX(binder_deferred_lock); > > > > > > > > > > > > @@ -591,8 +596,104 @@ struct binder_transaction { > > > > > > * during thread teardown > > > > > > */ > > > > > > spinlock_t lock; > > > > > > +#ifdef CONFIG_BINDER_USER_TRACKING > > > > > > + struct timespec timestamp; > > > > > > + struct timeval tv; > > > > > > +#endif > > > > > > }; > > > > > > > > > > > > +#ifdef CONFIG_BINDER_USER_TRACKING > > > > > > + > > > > > > +/* > > > > > > + * binder_print_delay - Output info of a delay transaction > > > > > > + * @t: pointer to the over-time transaction > > > > > > + */ > > > > > > +static void binder_print_delay(struct binder_transaction *t) > > > > > > +{ > > > > > > + struct rtc_time tm; > > > > > > + struct timespec *startime; > > > > > > + struct timespec cur, sub_t; > > > > > > + > > > > > > + ktime_get_ts(&cur); > > > > > > + startime = &t->timestamp; > > > > > > + sub_t = timespec_sub(cur, *startime); > > > > > > + > > > > > > + /* if transaction time is over than 2 sec, > > > > > > + * show timeout warning log. > > > > > > + */ > > > > > > + if (sub_t.tv_sec < 2) > > > > > > + return; > > > > > > + > > > > > > + rtc_time_to_tm(t->tv.tv_sec, &tm); > > > > > > + > > > > > > + spin_lock(&t->lock); > > > > > > + pr_info_ratelimited("%d: from %d:%d to %d:%d", > > > > > > + t->debug_id, > > > > > > + t->from ? t->from->proc->pid : 0, > > > > > > + t->from ? t->from->pid : 0, > > > > > > + t->to_proc ? t->to_proc->pid : 0, > > > > > > + t->to_thread ? t->to_thread->pid : 0); > > > > > > + spin_unlock(&t->lock); > > > > > > + > > > > > > + pr_info_ratelimited(" total %u.%03ld s code %u start %lu.%03ld android %d-%02d-%02d %02d:%02d:%02d.%03lu\n", > > > > > > + (unsigned int)sub_t.tv_sec, > > > > > > + (sub_t.tv_nsec / NSEC_PER_MSEC), > > > > > > + t->code, > > > > > > + (unsigned long)startime->tv_sec, > > > > > > + (startime->tv_nsec / NSEC_PER_MSEC), > > > > > > + (tm.tm_year + 1900), (tm.tm_mon + 1), tm.tm_mday, > > > > > > + tm.tm_hour, tm.tm_min, tm.tm_sec, > > > > > > + (unsigned long)(t->tv.tv_usec / USEC_PER_MSEC)); > > > > > > +} > > > > > > > > > > Ick, why not use a tracepoint for this instead? > > > > > > > > > > And what is userspace supposed to do with this if they see it? > > > > > > > > Or another option is to implement this separately outside of binder.c using > > > > register_trace_* on the existing binder tracepoints, similar to what say the > > > > block tracer or preempt-off tracers do. Call it, say, "binder-latency tracer". > > > > > > > > That way all of this tracing code is in-kernel but outside of binder.c. > > > > > > > > thanks, > > > > > > > > - Joel > > > > > > > Time limitation of recording is the reason why we don't use tracepoint. > > > In some situations, the exception is caused by a series of transactions > > > interaction. > > > Some abnormal transactions may be pending for a long time ago, they > > > could not be recorded due to buffer limited. > > > > register_trace_* does not use the trace buffer so I am not sure what you > > mean. I am asking you to use tracepoints, not ftrace events. > > > > thanks, > > > > - Joel > > > The existing binder tracepoint may not be precise enough because there > is no proper tracepoint which can represent the real finished time of > transaction. May be you can add such TP then? > The reason why we don't put the code outside binder.c is that structure > of binder_transaction didn't put in header file. > If it could be moved to binder_internal.h, then we can add > "binder-latency tracer" as you mentioned earlier. That should be doable. The reason I don't like custom tracing logic in core files as your patch is doing, is that it is doubtful if a lot of people would use it to warrant adding into a core file. In this case, probably a separate tracer (outside of binder.c) is a better option so as to not pollute the core files. thanks, - Joel