Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1754677AbaB0Cy2 (ORCPT ); Wed, 26 Feb 2014 21:54:28 -0500 Received: from szxga03-in.huawei.com ([119.145.14.66]:24339 "EHLO szxga03-in.huawei.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751700AbaB0Cy1 (ORCPT ); Wed, 26 Feb 2014 21:54:27 -0500 Message-ID: <530EA8B3.5080205@huawei.com> Date: Thu, 27 Feb 2014 10:53:39 +0800 From: xiakaixu User-Agent: Mozilla/5.0 (Windows NT 6.1; rv:17.0) Gecko/20130801 Thunderbird/17.0.8 MIME-Version: 1.0 To: Namhyung Kim CC: , , , , , Huxinwei , Genghui , Subject: Re: A Bug Inquiry in linux/tools/perf/builtin-record.c References: <53040D7F.3000905@huawei.com> <530C4330.20706@huawei.com> <87lhwyuwpy.fsf@sejong.aot.lge.com> In-Reply-To: <87lhwyuwpy.fsf@sejong.aot.lge.com> Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 8bit X-Originating-IP: [10.111.100.238] X-CFilter-Loop: Reflected Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Namhyung, 于 2014/2/26 16:03, Namhyung Kim 写道: > Hi xiakaixu, > >> 于 2014/2/19 9:48, xiakaixu 写道: >>> Hi all, >>> >>> There is a bug found in my work when running "perf record". The basic information >>> is here. As we know, perf record is a parent process and the programme traced is >>> a child process when running "perf record". Sometimes the child process become >>> zombie state and disappear until the parent process is killed. The bug stays in linux/ >>> tools/perf/builtin-record.c. >>> ********************************************************************* >>> static int __cmd_record(struct perf_record *rec, int argc, const char **argv) >>> ...... >>> if (hits == rec->samples) { >>> if (done) >>> break; >>> err = poll(evsel_list->pollfd, evsel_list->nr_fds, -1); >>> waking++; >>> } >>> ...... >>> ********************************************************************* >>> The parent process still call the function >>> poll(evsel_list->pollfd, evsel_list->nr_fds, -1) when the child process has exited >>> already, which caused a zombie process. >>> >>> May I have your opinion ? >>> Waiting for your reply! > > Do you have a real bug report based on this? yeah, of course we have it, I'll be glad to provide it if necessary. > > AFAIK perf record installed a signal handler for SIGCHLD so it'll set > the 'done' variable when child exits and then break the loop. yes, you are right. Though the 'done' varible will be set when child exits, there is time gap between "if(done)" statement and "poll(...)" function. The 'done' variable won't be judge when child exits in this time gap. You know this time gap is instruction level, so this bug is small probability. My solution is adding a while(...) statement outside poll(...) function. > > Thanks, > Namhyung > . > -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/