Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751347AbdIOPL5 convert rfc822-to-8bit (ORCPT ); Fri, 15 Sep 2017 11:11:57 -0400 Received: from mga03.intel.com ([134.134.136.65]:65116 "EHLO mga03.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751152AbdIOPL4 (ORCPT ); Fri, 15 Sep 2017 11:11:56 -0400 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.42,397,1500966000"; d="scan'208";a="129083353" From: "Liang, Kan" To: Arnaldo Carvalho de Melo CC: "peterz@infradead.org" , "mingo@redhat.com" , "linux-kernel@vger.kernel.org" , "jolsa@kernel.org" , "namhyung@kernel.org" , "Hunter, Adrian" , "Odzioba, Lukasz" , "ak@linux.intel.com" Subject: RE: [PATCH RFC V2 00/10] perf top optimization Thread-Topic: [PATCH RFC V2 00/10] perf top optimization Thread-Index: AQHTKqUGKqa3XFGJ30eMjpDMlJ7smaKybY4AgACGmHD//30ZgIAB8bsAgAGsAaA= Date: Fri, 15 Sep 2017 15:11:51 +0000 Message-ID: <37D7C6CF3E00A74B8858931C1DB2F077537BF0A8@SHSMSX103.ccr.corp.intel.com> References: <1505096603-215017-1-git-send-email-kan.liang@intel.com> <20170913152506.GK5866@kernel.org> <37D7C6CF3E00A74B8858931C1DB2F077537AF31A@SHSMSX103.ccr.corp.intel.com> <20170913153819.GL5866@kernel.org> <20170914211946.GB10371@kernel.org> In-Reply-To: <20170914211946.GB10371@kernel.org> Accept-Language: zh-CN, en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-titus-metadata-40: eyJDYXRlZ29yeUxhYmVscyI6IiIsIk1ldGFkYXRhIjp7Im5zIjoiaHR0cDpcL1wvd3d3LnRpdHVzLmNvbVwvbnNcL0ludGVsMyIsImlkIjoiZTdjOTI0MzgtZTRmZi00OThjLWI1NjMtZDRmY2M4NTE3OGE3IiwicHJvcHMiOlt7Im4iOiJDVFBDbGFzc2lmaWNhdGlvbiIsInZhbHMiOlt7InZhbHVlIjoiQ1RQX0lDIn1dfV19LCJTdWJqZWN0TGFiZWxzIjpbXSwiVE1DVmVyc2lvbiI6IjE2LjUuOS4zIiwiVHJ1c3RlZExhYmVsSGFzaCI6Ijd1YjJueW9id1AwS3l6TmhNZVB2QjRMNXRYa1V5UDBVVG12RXRqeWNkYU09In0= x-ctpclassification: CTP_IC dlp-product: dlpe-windows dlp-version: 11.0.0.116 dlp-reaction: no-action x-originating-ip: [10.239.127.40] Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 8BIT MIME-Version: 1.0 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2715 Lines: 76 > Em Wed, Sep 13, 2017 at 12:38:19PM -0300, Arnaldo Carvalho de Melo > escreveu: > > Em Wed, Sep 13, 2017 at 03:29:44PM +0000, Liang, Kan escreveu: > > > > > > > > Em Sun, Sep 10, 2017 at 07:23:13PM -0700, kan.liang@intel.com > escreveu: > > > > > > > > So I got the first two patches already merged, and made some > > > > comments about the other patches, please check those, > > > > > > > > > > Thanks for the review Arnaldo. > > > > > > I will take a close look for the comments. > > > For the next version, I only need to include patch 3-10, correct? > > > > Right, and go from my perf/core branch. The hashtable patch is still > > not there as I am running tests before pushing out, but it should be > > there later today. > > So, its at my repo, branch tmp.perf/threads_hashtable > > But 'perf trace' is broken, please take a look below: > > [root@jouet ~]# gdb -c core > GNU gdb (GDB) Fedora 8.0-20.fc26 > > Core was generated by `perf trace -e block:block_bio_queue'. > Program terminated with signal SIGSEGV, Segmentation fault. > #0 0x000000000051089a in ?? () > (gdb) file perf > Reading symbols from perf...done. > (gdb) bt > #0 0x000000000051089a in ____machine__findnew_thread > (machine=0x3dfcab0, threads=0x3dfca78, pid=-1, tid=-1, create=false) at > util/machine.c:429 I think the root cause is tid==-1. So the index of hashtable will be -1. The patch as below should fix it. diff --git a/tools/perf/util/machine.h b/tools/perf/util/machine.h index e6d5381..3c564b8 100644 --- a/tools/perf/util/machine.h +++ b/tools/perf/util/machine.h @@ -57,7 +57,7 @@ struct machine { static inline struct threads *machine__threads(struct machine *machine, pid_t tid) { - return &machine->threads[tid % THREADS__TABLE_SIZE]; + return &machine->threads[(unsigned int)tid % THREADS__TABLE_SIZE]; } static inline There should be another issue which was introduced by 33013b9a5607 ("perf machine: Optimize a bit the machine__findnew_thread() methods") It should use tid not pid to get the threads. diff --git a/tools/perf/util/machine.c b/tools/perf/util/machine.c index 90ae9c7..ddeea05 100644 --- a/tools/perf/util/machine.c +++ b/tools/perf/util/machine.c @@ -473,7 +473,7 @@ static struct thread *____machine__findnew_thread(struct machine *machine, struct thread *__machine__findnew_thread(struct machine *machine, pid_t pid, pid_t tid) { - return ____machine__findnew_thread(machine, machine__threads(machine, pid), pid, tid, true); + return ____machine__findnew_thread(machine, machine__threads(machine, tid), pid, tid, true); } They are small fixes. I think it's better to merge them with the old patches. Should I include the modified hashtable patches in V3? Thanks, Kan