Received: by 2002:ac0:aa62:0:0:0:0:0 with SMTP id w31-v6csp2416151ima; Mon, 22 Oct 2018 09:21:46 -0700 (PDT) X-Google-Smtp-Source: ACcGV61QWj9SaJ5gOGIk1be97H3u21I1Y9Oli2poyPGwZo72CS1FwffA/3CQ6rdawbZauiQ7csUu X-Received: by 2002:a17:902:6a:: with SMTP id 97-v6mr43266767pla.276.1540225306390; Mon, 22 Oct 2018 09:21:46 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1540225306; cv=none; d=google.com; s=arc-20160816; b=heeVwchhWFKlLddVk3RsGPHT/CZ5ud3HEzsHiRO1yWz694JqpB0S5b9D738DLWG8AE hlTxuix4hPzZitneYUViun7UtlqqKlggGnpTLJTO7H0UaHO/bANqHt6XXKl1RT9IDtyG foBBhvoJIGHq/rEKmqlk89eIigoXg7UV1wE57nTyzQxle2SMNTRLr8U3WfQIpxFb02rn ZYlpfnwfu0X92Y7z7B3EEaMdMneDNzcUaEEsqSqZ05PEig2VSXYvajLE3boykc5T0T2V zIf913U+/UfNdxeMpID9L+PEeX+f0EpsXhC+D3JNSc4GAV6KMQR7CsTo4/2Y8bwwCZku NaGw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=0BeZzUwDB7FIj3TstfXPmt3784JAFJ86LSWd+LGEMqM=; b=0UFv11gNSOOgrJUc72MqVj1mNX6zDKskXYmjzFtw2ey0mtN1NA1XrVU8Yz4Oj0rHud puUWaHLKPVo4+klDCmksQY0VahyqueSqaBfQ+7kky6KyPT5/jb3cETb0WBN0p3CY4GeD DCMY0q3zu+0ii+SyrYROcAXZTJjjFyOhdEbRg+L2xPQwlYu7TiL3kOor/4XACF5lBTH5 P4FlwNBTZk+synVEFc77SsVYmbg4PM3YwxjZmvn6Or0+g3Tz7JOmMUwKF2bcolnGKKmC QL8bYo3B81wgsZaQtvqVH0e2oXQ+llMKx/1qE3vKp4xtJPhiapvF7g7FEyqafrufRoqI 4PDA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id u14-v6si33973557plq.268.2018.10.22.09.21.31; Mon, 22 Oct 2018 09:21:46 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728489AbeJWAfY (ORCPT + 99 others); Mon, 22 Oct 2018 20:35:24 -0400 Received: from mx1.redhat.com ([209.132.183.28]:39793 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727218AbeJWAfY (ORCPT ); Mon, 22 Oct 2018 20:35:24 -0400 Received: from smtp.corp.redhat.com (int-mx02.intmail.prod.int.phx2.redhat.com [10.5.11.12]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id 3FD2B3082A26; Mon, 22 Oct 2018 16:16:15 +0000 (UTC) Received: from krava (unknown [10.43.17.150]) by smtp.corp.redhat.com (Postfix) with SMTP id 2F99D70111; Mon, 22 Oct 2018 16:16:14 +0000 (UTC) Date: Mon, 22 Oct 2018 18:16:13 +0200 From: Jiri Olsa To: Don Zickus Cc: David Miller , acme@kernel.org, linux-kernel@vger.kernel.org Subject: Re: perf overlapping maps... Message-ID: <20181022161613.GF2945@krava> References: <20181019.210549.1285253275146712779.davem@davemloft.net> <20181019.214401.2045294780943844999.davem@davemloft.net> <20181022140738.jvutwmstgm2f65et@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20181022140738.jvutwmstgm2f65et@redhat.com> User-Agent: Mutt/1.10.1 (2018-07-13) X-Scanned-By: MIMEDefang 2.79 on 10.5.11.12 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.45]); Mon, 22 Oct 2018 16:16:15 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Oct 22, 2018 at 10:07:38AM -0400, Don Zickus wrote: > (adding Jiri) > > On Fri, Oct 19, 2018 at 09:44:01PM -0700, David Miller wrote: > > From: David Miller > > Date: Fri, 19 Oct 2018 21:05:49 -0700 (PDT) > > > > > One solution I've come up with is: > > > > > > 1) When synthesizing a fork event, set PERF_RECORD_MISC_COMM_EXEC in > > > header->misc. > > > > > > 2) Use this to elide the map groups clone in > > > thread__clone_map_groups(). > > > > Looking into code history, I notice: > > > > commit 363b785f3805a2632eb09a8b430842461c21a640 > > Author: Don Zickus > > Date: Fri Mar 14 10:43:44 2014 -0400 > > > > perf tools: Speed up thread map generation > > > > and the subsequent: > > > > commit 4aa5f4f7bb8bc41cba15bcd0d80c4fb085027d6b > > Author: Arnaldo Carvalho de Melo > > Date: Fri Feb 27 19:52:10 2015 -0300 > > > > perf tools: Fix FORK after COMM when synthesizing records for pre-existing threads > > > > If Don wanted to have the map cloning to happen for processes without > > CLONE_VM, I'm not sure that's right. > > > > For real threads, we just take a reference to the map group from > > the parent. > > > > Don, a quick summary. If we synthesize a fork event, let's say for an > > emacs process. perf will clone the map groups of the parent bash > > shell which invoked emacs. Via: > > > > thread__fork(thread, parent, timestamp) > > { > > ... > > thread__clone_map_groups(thread, parent) > > { > > ... > > map_groups__clone(thread, parent->mg) > > > > Which is completely bogus. It brings all of the bash process maps > > into the emacs thread map group. Then we process the emacs mmap2 > > events, which overlap the bash process maps already cloned into the > > emacs map group. And this make all kinds of erroneous things happen. > > > > I'm suggesting to elide the map groups clone in this situation where > > we are synthesizing the fork. right, this seems correct.. we should only clone parent maps for kernel fork event, not when we synthesize.. I'll check the solution you proposed and try to come with a patch > > Hi David, > > Honestly, I remember very little of this change other than we ran specjbb > which created thousands of threads and we wanted a better way to handle that > situation (waiting 15 minutes seemed wrong). > > Jiri Olsa is probably more knowledgable about this then I am these days and > can work with Joe to re-do the test to verify any suggested changes does not > break our intended use case. > > Thinking about it more, I am wondering if we did this because we ran the > test and it takes about 20 minutes to 'warm up' then we attached perf to the > test. This implies we had to handle the situation where 10K threads already > existed hence our optimization. But I can be wrong. > > Your suggestion is probably right and I am sure we can reproduce the > scenario to verify things didn't regress. I think the fix might actualy speed things up, but yes, there could be other report regressions thanks, jirka