Received: by 2002:ac0:950c:0:0:0:0:0 with SMTP id f12csp3640759imc; Thu, 14 Mar 2019 01:42:24 -0700 (PDT) X-Google-Smtp-Source: APXvYqz+Wo7vSIkQO1PLvlrDlP0XxsNdn8b06hsQrNlHGVMsLnRqSYpnqMFiZoDBKshmQvxdNRJL X-Received: by 2002:a62:b608:: with SMTP id j8mr49384547pff.183.1552552944134; Thu, 14 Mar 2019 01:42:24 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1552552944; cv=none; d=google.com; s=arc-20160816; b=SZ2IOeSX9nGxB0jUplyT7IZP4rdUD20WEQrymraedTGijaEOBR/Sqrn/B0MzhhHMdj 11dn1ekG18b+/sKzYyfactDtEClQfwv9nU6B6ldTC3hbWeF3NT2licrBNoXUlDy+Cbow MxdLocUdDFKuxTFmKX9T9pvmzAnOXZ20xRnOfErc5B09WrXcb7Cn4ZtX/DoY1xKe1/cB olaF84KjAALbEL6n7QgFtE59pMec4ATdKxBPA3Dk31XUQN7Im8SB4IKEYRIqOqDfQTFb XhK7n9GIKmoKnAOrseuImc0RupzqUtuMq4CkTk/I5ISbcOZj3zGxVCdkpYZh3NbqqrFN SOjw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=pYkjENfZAr/+IV6UCqIbuPsVCgO+phIC5GHlrgAWZ6M=; b=F1i8x8tSi2dxmPnSc9Kr1BXjq46FGsAznWBYtFMYX+Z3KDRvqz/iloIdfV5OAQW4WC TgjyGQ5+nSMHzEX5wkCqIr2lk9shv4CGZ5E/x4/2FbtNdoDQ+PeAfLvKJxBEeYGqePMk TwsuP8js73b3WZnpD9cvsf4fn2ZSQTxO76fhX7wbKndcvepSxgCSc0AN2H5ju0WqIHh2 CEjA6JdEp70fCp7m9z/PRckJ2nrzdGWqFwgkflnDWK2Yiwe6TjXD56OaOFuee9ebLr6I /FWBJUoUl/WCTYNGBLynzyCfpQX5KEvH9MzQ9nevSXmsJ1eMs49gCpNBctI1/+aQcDgK ZdCw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 70si13429905pla.128.2019.03.14.01.42.09; Thu, 14 Mar 2019 01:42:24 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727222AbfCNIj7 (ORCPT + 99 others); Thu, 14 Mar 2019 04:39:59 -0400 Received: from www62.your-server.de ([213.133.104.62]:48112 "EHLO www62.your-server.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726083AbfCNIj6 (ORCPT ); Thu, 14 Mar 2019 04:39:58 -0400 Received: from [78.46.172.3] (helo=sslproxy06.your-server.de) by www62.your-server.de with esmtpsa (TLSv1.2:DHE-RSA-AES256-GCM-SHA384:256) (Exim 4.89_1) (envelope-from ) id 1h4Lto-0005Ex-QQ; Thu, 14 Mar 2019 09:39:36 +0100 Received: from [178.197.248.24] (helo=linux.home) by sslproxy06.your-server.de with esmtpsa (TLSv1.2:ECDHE-RSA-AES256-GCM-SHA384:256) (Exim 4.89) (envelope-from ) id 1h4Lto-0000VZ-HL; Thu, 14 Mar 2019 09:39:36 +0100 Subject: Re: [LKP] [x86, retpolines] ce02ef06fc: will-it-scale.per_thread_ops 3.1% improvement To: kernel test robot Cc: Thomas Gleixner , Jesper Dangaard Brouer , =?UTF-8?B?QmrDtnJuIFTDtnBlbA==?= , Linus Torvalds , "David S. Miller" , Magnus Karlsson , Alexei Starovoitov , Peter Zijlstra , David Woodhouse , Andy Lutomirski , Borislav Petkov , LKML , lkp@01.org, greg@kroah.com References: <20190313052715.GB8429@shao2-debian> From: Daniel Borkmann Message-ID: Date: Thu, 14 Mar 2019 09:39:35 +0100 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:52.0) Gecko/20100101 Thunderbird/52.3.0 MIME-Version: 1.0 In-Reply-To: <20190313052715.GB8429@shao2-debian> Content-Type: text/plain; charset=windows-1252 Content-Language: en-US Content-Transfer-Encoding: 7bit X-Authenticated-Sender: daniel@iogearbox.net X-Virus-Scanned: Clear (ClamAV 0.100.2/25387/Wed Mar 13 12:24:46 2019) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 03/13/2019 06:27 AM, kernel test robot wrote: > Greeting, > > FYI, we noticed a 3.1% improvement of will-it-scale.per_thread_ops due to commit: > > > commit: ce02ef06fcf7a399a6276adb83f37373d10cbbe1 ("x86, retpolines: Raise limit for generating indirect calls from switch-case") > https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master > > in testcase: will-it-scale > on test machine: 88 threads Intel(R) Xeon(R) CPU E5-2699 v4 @ 2.20GHz with 64G memory > with following parameters: > > nr_task: 100% > mode: thread > test: futex3 > cpufreq_governor: performance > ucode: 0xb00002e > > test-description: Will It Scale takes a testcase and runs it from 1 through to n parallel copies to see if the testcase will scale. It builds both a process and threads based test in order to see any differences between the two. > test-url: https://github.com/antonblanchard/will-it-scale > > In addition to that, the commit also has significant impact on the following tests: Any thoughts on whether the above one-liner gcc work-around should be backported to stable as well given these gains? Thanks, Daniel > +------------------+---------------------------------------------------------------------------+ > | testcase: change | will-it-scale: will-it-scale.per_process_ops 4.3% improvement | > | test machine | 112 threads Intel(R) Xeon(R) Platinum 8170 CPU @ 2.10GHz with 128G memory | > | test parameters | cpufreq_governor=performance | > | | mode=process | > | | nr_task=50% | > | | test=futex3 | > +------------------+---------------------------------------------------------------------------+ > | testcase: change | will-it-scale: will-it-scale.per_process_ops 2.5% improvement | > | test machine | 112 threads Intel(R) Xeon(R) Platinum 8170 CPU @ 2.10GHz with 128G memory | > | test parameters | cpufreq_governor=performance | > | | mode=process | > | | nr_task=50% | > | | test=futex1 | > +------------------+---------------------------------------------------------------------------+ > | testcase: change | will-it-scale: will-it-scale.per_process_ops 5.8% improvement | > | test machine | 88 threads Intel(R) Xeon(R) CPU E5-2699 v4 @ 2.20GHz with 64G memory | > | test parameters | cpufreq_governor=performance | > | | mode=process | > | | nr_task=50% | > | | test=futex3 | > +------------------+---------------------------------------------------------------------------+ > | testcase: change | will-it-scale: will-it-scale.per_process_ops 2.6% improvement | > | test machine | 160 threads Intel(R) Xeon(R) CPU E7-8890 v4 @ 2.20GHz with 256G memory | > | test parameters | cpufreq_governor=performance | > | | test=futex1 | > | | ucode=0xb00002e | > +------------------+---------------------------------------------------------------------------+ > | testcase: change | will-it-scale: will-it-scale.per_process_ops 2.5% improvement | > | test machine | 112 threads Intel(R) Xeon(R) Platinum 8170 CPU @ 2.10GHz with 128G memory | > | test parameters | cpufreq_governor=performance | > | | mode=process | > | | nr_task=50% | > | | test=futex2 | > +------------------+---------------------------------------------------------------------------+ > | testcase: change | will-it-scale: will-it-scale.per_thread_ops 3.1% improvement | > | test machine | 88 threads Intel(R) Xeon(R) CPU E5-2699 v4 @ 2.20GHz with 64G memory | > | test parameters | cpufreq_governor=performance | > | | mode=thread | > | | nr_task=50% | > | | test=futex4 | > +------------------+---------------------------------------------------------------------------+ > | testcase: change | will-it-scale: will-it-scale.per_thread_ops 5.4% improvement | > | test machine | 88 threads Intel(R) Xeon(R) CPU E5-2699 v4 @ 2.20GHz with 64G memory | > | test parameters | cpufreq_governor=performance | > | | mode=thread | > | | nr_task=16 | > | | test=futex3 | > +------------------+---------------------------------------------------------------------------+ > | testcase: change | will-it-scale: will-it-scale.per_thread_ops 3.0% improvement | > | test machine | 88 threads Intel(R) Xeon(R) CPU E5-2699 v4 @ 2.20GHz with 64G memory | > | test parameters | cpufreq_governor=performance | > | | mode=thread | > | | nr_task=16 | > | | test=futex4 | > +------------------+---------------------------------------------------------------------------+