Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752611AbbDXNtb (ORCPT ); Fri, 24 Apr 2015 09:49:31 -0400 Received: from szxga03-in.huawei.com ([119.145.14.66]:50043 "EHLO szxga03-in.huawei.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751207AbbDXNt3 (ORCPT ); Fri, 24 Apr 2015 09:49:29 -0400 Message-ID: <553A49CF.1020905@huawei.com> Date: Fri, 24 Apr 2015 21:49:03 +0800 From: Yunlong Song User-Agent: Mozilla/5.0 (Windows NT 6.1; rv:31.0) Gecko/20100101 Thunderbird/31.3.0 MIME-Version: 1.0 To: , , , "Arnaldo Carvalho de Melo" CC: , Subject: Re: [Question] How does perf still record the stack of a specified pid even when that process is interrupted and CPU is scheduled to other process References: <553A45CA.8020808@huawei.com> In-Reply-To: <553A45CA.8020808@huawei.com> Content-Type: text/plain; charset="utf-8" Content-Transfer-Encoding: 7bit X-Originating-IP: [10.111.74.205] X-CFilter-Loop: Reflected X-Mirapoint-Virus-RAPID-Raw: score=unknown(0), refid=str=0001.0A020204.553A49DA.0314,ss=1,re=0.000,recu=0.000,reip=0.000,cl=1,cld=1,fgs=0, ip=0.0.0.0, so=2013-05-26 15:14:31, dmn=2013-03-21 17:37:32 X-Mirapoint-Loop-Id: 74c813c4f1ec380f59569ad0e845b2e4 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2253 Lines: 47 On 2015/4/24 21:31, Yunlong Song wrote: > [Profiling Background] > Now we are profiling the performance of ext4 and f2fs on an eMMC card with iozone, > we find a case that ext4 is better than f2fs in random write under the test of > "iozone -s 262144 -r 64 -i 0 -i 2". We want to analyze the I/O delay of the two > file systems. We have got a conclusion that 1% of sys_write takes up 60% time of > the overall sys_write (262144/64=4096). We want to find out the call stack during > this specific 1% sys_write. Our idea is to record the stack in a certain time period > and since the specific 1% case takes up 60% time, the total number of records of its > stack should also takes up 60% of the total records, then we can recognize those stacks > and figure out what the I/O stack of f2fs is doing in the 1% case. > > [Profiling Problem] > > Although perf can record the events (with call stack) of a specified pid, e.g. using > "perf record -g iozone -s 262144 -r 64 -i 0 -i 2". But we find iozone is interrupted > and the CPU is scheduled to other process. As a result, perf will not record any events > of iozone until iozone's context is recovered and the CPU is scheduled to continue > processing the sys_write of iozone. >This obeys our initial idea which is described in [Profiling Background], This "disobeys" our initial idea which is described in [Profiling Background], since we cannot recognize the call stacks of the specific 1% case > by using the ratio of the record number. > > [Alternative Solution without Perf] > We can use /proc/#pid/stack to record the stack in a certain period (e.g. 1ms) of iozone, > no matter whether iozone is interrupted or not. However, we have not taken a deep sight > into this, since we want to use perf to do this kind of thing. > > [Question about Perf] > So we have a question that "How does perf still record the stack of a specified pid even > when that process is interrupted and CPU is scheduled to other process?" > -- Thanks, Yunlong Song -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/