Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753446AbbKYFGX (ORCPT ); Wed, 25 Nov 2015 00:06:23 -0500 Received: from mail-pa0-f52.google.com ([209.85.220.52]:35385 "EHLO mail-pa0-f52.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751021AbbKYFGV (ORCPT ); Wed, 25 Nov 2015 00:06:21 -0500 Subject: Re: [PATCH] perf record: Add snapshot mode support for perf's regular events To: "Wangnan (F)" , Arnaldo Carvalho de Melo References: <1448373632-8806-1-git-send-email-yunlong.song@huawei.com> <1448373632-8806-2-git-send-email-yunlong.song@huawei.com> <56547D01.8020606@gmail.com> <20151124152023.GE18140@kernel.org> <56553022.8000101@huawei.com> Cc: Yunlong Song , a.p.zijlstra@chello.nl, paulus@samba.org, mingo@redhat.com, linux-kernel@vger.kernel.org, namhyung@kernel.org, ast@kernel.org, masami.hiramatsu.pt@hitachi.com, kan.liang@intel.com, adrian.hunter@intel.com, jolsa@kernel.org, bp@alien8.de, jean.pihet@linaro.org, rric@kernel.org, xiakaixu@huawei.com, hekuang@huawei.com From: David Ahern Message-ID: <565541C5.1020300@gmail.com> Date: Tue, 24 Nov 2015 22:06:13 -0700 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.11; rv:38.0) Gecko/20100101 Thunderbird/38.3.0 MIME-Version: 1.0 In-Reply-To: <56553022.8000101@huawei.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3098 Lines: 60 On 11/24/15 8:50 PM, Wangnan (F) wrote: > Actually we are discussing about this problem. > > For such tracking events (PERF_RECORD_FORK...), we have dummy event so > it is possible for us to receive tracking events from a separated > channel, therefore we don't have to parse every events to pick those > events out. Instead, we can process tracking events differently, then > more interesting things can be done. For example, squashing those tracking > events if it takes too much memory... If you look at my daemon code I process task events (FORK, MMAP, EXIT) to maintain task state including flushing threads when they terminate. This is a trade-off to having the knowledge to pretty-print addresses (address to symbol resolution) yet not grow without bounds -- be it a file or memory. > > Furthermore, there's another problem being discussed: if userspace > ringbuffer > is bytes based, parsing event is unavoidable. Without parsing event we are > unable to find the new 'head' pointer when overwriting. Instead, we are > thinking about a bucket-based ringbuffer that, let perf maintain a series > of bucket, each time 'poll' return, perf copies new events to the start of > a bucket. If all bucket is occupied, we drop the oldest bucket. > Bucket-based > ringbuffer watest some memory but can avoid event parsing. > > And there's many other problems in this patch. For example, when SIGUSR2 is > received, we need to do something to let all perf events start dumping. > Current implementation can't ensure we receive events just before the > SIGUSR2 if we not set 'no-buffer'. > > Also, output events are in one perf.data, which is not user friendly. > Our final goal is to make perf a daemonized moniter, which can run 7x24 > in user's environment. Each time a glitch is detected, a framework sends > a signal to perf to get a perf.data from it perf. The framework manage > those perf.data like logrotate, help developer analysis those glitch. Exactly. And that's why my daemon is written the way it is. It is intended to run 24x7x365. It retains the last N events which are dumped when some external trigger tells it to. Arnaldo: you asked about an event in the stream but that is not possible. My scheduling daemon targets CPU usage prior to a significant event (what was running, how long, where, etc). The significant event in the motivating case was STP timeouts -- if stp daemon is not able to send BPDUs why? What was running leading up to the timeout. The point is something external to the perf daemon says 'hey, save the last N-events for analysis'. This case sounds like a generalization of my problem with the desire to write a perf.data file instead of processing the events and dumping to a file. It is doable. For example, synthesize task events for all threads in memory and then write out the saved samples. -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/