Received: by 2002:ac0:98c7:0:0:0:0:0 with SMTP id g7-v6csp1816910imd; Sat, 27 Oct 2018 18:52:58 -0700 (PDT) X-Google-Smtp-Source: AJdET5e861Xrr0kollKun16eKyhX7R2QPEta+fq38inIHIbLZZH7cq96Ra0XYt1ZGBMz4XgYcAUX X-Received: by 2002:a63:dd58:: with SMTP id g24-v6mr8655294pgj.86.1540691578063; Sat, 27 Oct 2018 18:52:58 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1540691578; cv=none; d=google.com; s=arc-20160816; b=pIkL2ABG4EAi66bA2fEf63JjDbExLFoCRarvpz7JRaWPsIyRXyvzrlFVjPRe9u6UOf 4VbrfTDFnNuwPRL4bsDW3sWiE+kxEQkOxUcDnI4+V3f5GvENso7AeVsdHM9JwHkNirm7 H5UKxT2Ucr5/7iKz81/VvbFsQLiHFtke8XJNY1Y69zaKALIcsQ8DaJFQ0UUMa6hWGZ5s kWtuVl9nP1394r54nXnQlFE8ZicrMPpkQ3lSpkrkqJqIJ3Ix9uMcWlOLhQLS3IqgqK/N cw74XUx8JrbqaH3wqtMJVWB1rsDNyet6dggVYHv0v5LTcJ6H1x9YF9g5svFVlEZBD92S fcVg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:content-transfer-encoding :dlp-reaction:dlp-version:dlp-product:content-language :accept-language:in-reply-to:references:message-id:date:thread-index :thread-topic:subject:cc:to:from; bh=aFqeTGCM1cGf3+FMZMqdMWD2Vc7gJnn+kE7mth1aPc8=; b=h+7+++wYXjk6B7WQwBrUOw2WfPHN9vbRfBkjxJTn/ZUVKQ0wQJKcNjhoMnlEXB4l4j p9HH2e5jAcKkXEfx7UupwyCxmcGoVz5q1zdjG33mNY+USH3QEdaShC+VZfJclK7/70ct WvdrME/mPuF8uqbG/7OjNXu8eCiieGeAUBb+gOXVBuNzODeMk44iwx4Ci8gwG0sm17vn SkTuXtz+S12GrzcKq2IDiu/Z36+ZvYYw9DLh9bVjeeSSba3Wg0yWh4+NdybDW8pP3rz6 ePMY8UYp98gDFO/vR4RzYfxBDCMFIGXRa787bz2xS7kvEhUbRgzYvCbp02J8eag5ZyEe FDQQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id u27-v6si16976224pfk.82.2018.10.27.18.52.16; Sat, 27 Oct 2018 18:52:57 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729093AbeJ1K0O convert rfc822-to-8bit (ORCPT + 99 others); Sun, 28 Oct 2018 06:26:14 -0400 Received: from mga06.intel.com ([134.134.136.31]:41493 "EHLO mga06.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1729073AbeJ1K0N (ORCPT ); Sun, 28 Oct 2018 06:26:13 -0400 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from orsmga006.jf.intel.com ([10.7.209.51]) by orsmga104.jf.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 27 Oct 2018 18:43:14 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.54,434,1534834800"; d="scan'208";a="86122482" Received: from fmsmsx105.amr.corp.intel.com ([10.18.124.203]) by orsmga006.jf.intel.com with ESMTP; 27 Oct 2018 18:43:13 -0700 Received: from fmsmsx113.amr.corp.intel.com (10.18.116.7) by FMSMSX105.amr.corp.intel.com (10.18.124.203) with Microsoft SMTP Server (TLS) id 14.3.319.2; Sat, 27 Oct 2018 18:43:13 -0700 Received: from shsmsx102.ccr.corp.intel.com (10.239.4.154) by FMSMSX113.amr.corp.intel.com (10.18.116.7) with Microsoft SMTP Server (TLS) id 14.3.319.2; Sat, 27 Oct 2018 18:43:12 -0700 Received: from shsmsx101.ccr.corp.intel.com ([169.254.1.102]) by shsmsx102.ccr.corp.intel.com ([169.254.2.84]) with mapi id 14.03.0415.000; Sun, 28 Oct 2018 09:43:04 +0800 From: "Wang, Kemi" To: Eric Dumazet , "Chen, Rong A" , Yuchung Cheng CC: Soheil Hassas Yeganeh , "netdev@vger.kernel.org" , LKML , Eric Dumazet , "lkp@01.org" , Wei Wang , Neal Cardwell , "David S. Miller" Subject: RE: [LKP] [tcp] a337531b94: netperf.Throughput_Mbps -6.1% regression Thread-Topic: [LKP] [tcp] a337531b94: netperf.Throughput_Mbps -6.1% regression Thread-Index: AQHUa5MQe+4SDBcIYEaP1lHzVUjOs6Ut3UYAgAYIYeA= Date: Sun, 28 Oct 2018 01:43:02 +0000 Message-ID: <25017BF213203E48912DB000DE5F5E1E7632A249@SHSMSX101.ccr.corp.intel.com> References: <20181024121358.GM24195@shao2-debian> In-Reply-To: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: x-titus-metadata-40: eyJDYXRlZ29yeUxhYmVscyI6IiIsIk1ldGFkYXRhIjp7Im5zIjoiaHR0cDpcL1wvd3d3LnRpdHVzLmNvbVwvbnNcL0ludGVsMyIsImlkIjoiNmE4YjY1OWUtZWMzNi00ZDcyLTlkY2YtNzhmMzg0YmFjMWFjIiwicHJvcHMiOlt7Im4iOiJDVFBDbGFzc2lmaWNhdGlvbiIsInZhbHMiOlt7InZhbHVlIjoiQ1RQX05UIn1dfV19LCJTdWJqZWN0TGFiZWxzIjpbXSwiVE1DVmVyc2lvbiI6IjE3LjEwLjE4MDQuNDkiLCJUcnVzdGVkTGFiZWxIYXNoIjoiYkNXbDBkbXIwZnBKK0xjK3BVb01pYUE0b3R1WjEySTRFdURYUHordmNNQlpXMHZIcktSYkVUNTRxYkgxYVNjVyJ9 x-ctpclassification: CTP_NT dlp-product: dlpe-windows dlp-version: 11.0.0.116 dlp-reaction: no-action x-originating-ip: [10.239.127.40] Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 8BIT MIME-Version: 1.0 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi, Eric Thanks for the info. We rerun the test and verified that this issue has been fixed with commit 041a14d2671573611ffd6412bc16e2f64469f7fb. Only about 0.1% performance difference was observed. -----Original Message----- From: LKP [mailto:lkp-bounces@lists.01.org] On Behalf Of Eric Dumazet Sent: Wednesday, October 24, 2018 9:27 PM To: Chen, Rong A ; Yuchung Cheng Cc: Soheil Hassas Yeganeh ; netdev@vger.kernel.org; LKML ; Eric Dumazet ; lkp@01.org; Wei Wang ; Neal Cardwell ; David S. Miller Subject: Re: [LKP] [tcp] a337531b94: netperf.Throughput_Mbps -6.1% regression Hi Rong This has been reported already, and we believe this has been fixed with : commit 041a14d2671573611ffd6412bc16e2f64469f7fb Author: Yuchung Cheng Date: Mon Oct 1 15:42:32 2018 -0700 tcp: start receiver buffer autotuning sooner Previously receiver buffer auto-tuning starts after receiving one advertised window amount of data. After the initial receiver buffer was raised by patch a337531b942b ("tcp: up initial rmem to 128KB and SYN rwin to around 64KB"), the reciver buffer may take too long to start raising. To address this issue, this patch lowers the initial bytes expected to receive roughly the expected sender's initial window. Fixes: a337531b942b ("tcp: up initial rmem to 128KB and SYN rwin to around 64KB") Signed-off-by: Yuchung Cheng Signed-off-by: Wei Wang Signed-off-by: Neal Cardwell Signed-off-by: Eric Dumazet Reviewed-by: Soheil Hassas Yeganeh Signed-off-by: David S. Miller Thanks On 10/24/2018 05:13 AM, kernel test robot wrote: > Greeting, > > FYI, we noticed a -6.1% regression of netperf.Throughput_Mbps due to commit: > > > commit: a337531b942bd8a03e7052444d7e36972aac2d92 ("tcp: up initial rmem to 128KB and SYN rwin to around 64KB") > https://git.kernel.org/cgit/linux/kernel/git/davem/net-next.git master > > in testcase: netperf > on test machine: 16 threads Intel(R) Xeon(R) CPU D-1541 @ 2.10GHz with 8G memory > with following parameters: > > ip: ipv4 > runtime: 900s > nr_threads: 200% > cluster: cs-localhost > test: TCP_STREAM > ucode: 0x7000013 > cpufreq_governor: performance > > test-description: Netperf is a benchmark that can be use to measure various aspect of networking performance. > test-url: http://www.netperf.org/netperf/ > > In addition to that, the commit also has significant impact on the following tests: > > +------------------+-------------------------------------------------------------------+ > | testcase: change | netperf: netperf.Throughput_Mbps -1.0% regression | > | test machine | 16 threads Intel(R) Xeon(R) CPU D-1541 @ 2.10GHz with 8G memory | > | test parameters | cluster=cs-localhost | > | | cpufreq_governor=performance | > | | ip=ipv4 | > | | nr_threads=200% | > | | runtime=300s | > | | send_size=5K | > | | test=TCP_SENDFILE | > | | ucode=0x7000013 | > +------------------+-------------------------------------------------------------------+ > | testcase: change | netperf: netperf.Throughput_Mbps -5.9% regression | > | test machine | 16 threads Intel(R) Xeon(R) CPU D-1541 @ 2.10GHz with 8G memory | > | test parameters | cluster=cs-localhost | > | | cpufreq_governor=performance | > | | ip=ipv4 | > | | nr_threads=200% | > | | runtime=900s | > | | test=TCP_MAERTS | > | | ucode=0x7000013 | > +------------------+-------------------------------------------------------------------+ > | testcase: change | netperf: netperf.Throughput_Mbps -3.2% regression | > | test machine | 4 threads Intel(R) Core(TM) i5-3317U CPU @ 1.70GHz with 4G memory | > | test parameters | cluster=cs-localhost | > | | cpufreq_governor=performance | > | | ip=ipv4 | > | | nr_threads=200% | > | | runtime=900s | > | | test=TCP_MAERTS | > | | ucode=0x20 | > +------------------+-------------------------------------------------------------------+ > > > Details are as below: > --------------------------------------------------------------------------------------------------> > > > To reproduce: > > git clone https://github.com/intel/lkp-tests.git > cd lkp-tests > bin/lkp install job.yaml # job file is attached in this email > bin/lkp run job.yaml > > ========================================================================================= > cluster/compiler/cpufreq_governor/ip/kconfig/nr_threads/rootfs/runtime/tbox_group/test/testcase/ucode: > cs-localhost/gcc-7/performance/ipv4/x86_64-rhel-7.2/200%/debian-x86_64-2018-04-03.cgz/900s/lkp-bdw-de1/TCP_STREAM/netperf/0x7000013 > > commit: > 3ff6cde846 ("hns3: Another build fix.") > a337531b94 ("tcp: up initial rmem to 128KB and SYN rwin to around 64KB") > > 3ff6cde846857d45 a337531b942bd8a03e7052444d > ---------------- -------------------------- > fail:runs %reproduction fail:runs > | | | > :4 50% 2:4 dmesg.WARNING:at#for_ip_interrupt_entry/0x > %stddev %change %stddev > \ | \ > 2497 -6.1% 2345 netperf.Throughput_Mbps > 79924 -6.1% 75061 netperf.Throughput_total_Mbps > 186513 +11.3% 207590 netperf.time.involuntary_context_switches > 5.488e+08 -6.1% 5.154e+08 netperf.workload > 1172 ? 34% -37.6% 731.75 ? 5% cpuidle.C1E.usage > 1137 ? 34% -40.0% 682.25 ? 8% turbostat.C1E > 2775 ? 11% +17.5% 3261 ? 9% sched_debug.cpu.nr_switches.stddev > 0.01 ? 17% +28.2% 0.01 ? 10% sched_debug.rt_rq:/.rt_time.avg > 0.14 ? 17% +28.2% 0.18 ? 10% sched_debug.rt_rq:/.rt_time.max > 0.03 ? 17% +28.2% 0.04 ? 10% sched_debug.rt_rq:/.rt_time.stddev > 66336 +0.9% 66948 proc-vmstat.nr_anon_pages > 2.755e+08 -6.1% 2.588e+08 proc-vmstat.numa_hit > 2.755e+08 -6.1% 2.588e+08 proc-vmstat.numa_local > 2.197e+09 -6.1% 2.064e+09 proc-vmstat.pgalloc_normal > 2.197e+09 -6.1% 2.064e+09 proc-vmstat.pgfree > 5.903e+11 -7.9% 5.438e+11 perf-stat.branch-instructions > 2.68 -0.0 2.64 perf-stat.branch-miss-rate% > 1.582e+10 -9.2% 1.436e+10 perf-stat.branch-misses > 6.26e+11 -4.7% 5.964e+11 perf-stat.cache-misses > 6.26e+11 -4.7% 5.964e+11 perf-stat.cache-references > 11.69 +8.6% 12.69 perf-stat.cpi > 123723 +2.1% 126291 perf-stat.cpu-migrations > 0.09 ? 2% +0.0 0.09 perf-stat.dTLB-load-miss-rate% > 1.475e+12 -7.1% 1.37e+12 perf-stat.dTLB-loads > 1.094e+12 -6.9% 1.018e+12 perf-stat.dTLB-stores > 2.912e+08 ? 5% -13.0% 2.533e+08 perf-stat.iTLB-loads > 3.019e+12 -7.9% 2.781e+12 perf-stat.instructions > 0.09 -7.9% 0.08 perf-stat.ipc > 5500 -1.9% 5394 perf-stat.path-length > 0.53 ? 2% -0.2 0.38 ? 57% perf-profile.calltrace.cycles-pp.ip_output.__ip_queue_xmit.__tcp_transmit_skb.tcp_write_xmit.__tcp_push_pending_frames > 0.63 ? 2% -0.1 0.58 ? 4% perf-profile.calltrace.cycles-pp.syscall_return_via_sysret > 0.73 ? 3% +0.1 0.78 ? 2% perf-profile.calltrace.cycles-pp.tcp_clean_rtx_queue.tcp_ack.tcp_rcv_established.tcp_v4_do_rcv.tcp_v4_rcv > 0.96 +0.1 1.03 perf-profile.calltrace.cycles-pp.tcp_ack.tcp_rcv_established.tcp_v4_do_rcv.tcp_v4_rcv.ip_local_deliver_finish > 98.02 +0.1 98.13 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe > 97.88 +0.1 98.00 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe > 0.70 ? 3% -0.1 0.64 ? 4% perf-profile.children.cycles-pp.syscall_return_via_sysret > 0.26 ? 5% -0.0 0.21 ? 6% perf-profile.children.cycles-pp._raw_spin_lock_bh > 0.28 ? 5% -0.0 0.24 ? 6% perf-profile.children.cycles-pp.lock_sock_nested > 0.46 ? 4% -0.0 0.43 ? 2% perf-profile.children.cycles-pp.nf_hook_slow > 0.21 ? 8% -0.0 0.18 ? 5% perf-profile.children.cycles-pp.tcp_rcv_space_adjust > 0.08 ? 5% -0.0 0.06 perf-profile.children.cycles-pp.entry_SYSCALL_64_stage2 > 0.08 ? 6% -0.0 0.06 ? 6% perf-profile.children.cycles-pp.ip_finish_output > 0.17 ? 6% +0.0 0.20 ? 5% perf-profile.children.cycles-pp.tcp_event_new_data_sent > 0.24 ? 4% +0.0 0.27 ? 2% perf-profile.children.cycles-pp.mod_timer > 0.15 ? 2% +0.0 0.18 ? 2% perf-profile.children.cycles-pp.__might_sleep > 0.80 ? 3% +0.0 0.84 ? 2% perf-profile.children.cycles-pp.tcp_clean_rtx_queue > 0.30 ? 3% +0.1 0.36 ? 4% perf-profile.children.cycles-pp.__might_fault > 1.61 ? 4% +0.1 1.69 perf-profile.children.cycles-pp.__release_sock > 1.06 ? 2% +0.1 1.14 perf-profile.children.cycles-pp.tcp_ack > 98.24 +0.1 98.36 perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe > 98.09 +0.1 98.23 perf-profile.children.cycles-pp.do_syscall_64 > 70.28 +0.6 70.86 perf-profile.children.cycles-pp.copy_user_enhanced_fast_string > 1.56 -0.1 1.48 ? 3% perf-profile.self.cycles-pp.copy_page_to_iter > 0.70 ? 3% -0.1 0.64 ? 4% perf-profile.self.cycles-pp.syscall_return_via_sysret > 1.37 ? 2% -0.1 1.32 ? 2% perf-profile.self.cycles-pp.__free_pages_ok > 0.55 ? 3% -0.0 0.50 ? 3% perf-profile.self.cycles-pp.__alloc_skb > 0.44 ? 3% -0.0 0.40 ? 5% perf-profile.self.cycles-pp.tcp_recvmsg > 0.16 ? 9% -0.0 0.14 ? 5% perf-profile.self.cycles-pp.sock_has_perm > 0.08 ? 6% -0.0 0.06 perf-profile.self.cycles-pp.entry_SYSCALL_64_stage2 > 0.10 ? 4% +0.0 0.12 ? 6% perf-profile.self.cycles-pp.tcp_clean_rtx_queue > 0.14 ? 6% +0.0 0.17 ? 4% perf-profile.self.cycles-pp.__might_sleep > 69.25 +0.5 69.77 perf-profile.self.cycles-pp.copy_user_enhanced_fast_string > > > > netperf.Throughput_Mbps > > 3000 +-+------------------------------------------------------------------+ > | | > 2500 +-+..+.+..+.+..+.+..+.+..+.+..+.+..+.+.+..+.+..+.+..+.+..+.+..+.+..+.| > O O O O O O O O O O O O O O O O O O O O O O O O O | > | : | > 2000 +-+ | > |: | > 1500 +-+ | > |: | > 1000 +-+ | > |: | > |: | > 500 +-+ | > | | > 0 +-+------------------------------------------------------------------+ > > > netperf.Throughput_total_Mbps > > 90000 +-+-----------------------------------------------------------------+ > | | > 80000 O-O..O.O..O.O..O.O.O..O.O..O.O..O.O.O..O.O..O.O..O.O.O..O.O..+.+..+.| > 70000 +-+ | > | : | > 60000 +-+ | > 50000 +-+ | > |: | > 40000 +-+ | > 30000 +-+ | > |: | > 20000 +-+ | > 10000 +-+ | > | | > 0 +-+-----------------------------------------------------------------+ > > > netperf.workload > > 6e+08 +-+-----------------------------------------------------------------+ > | +..+.+..+.+..+.+.+..+.+..+.+..+.+.+..+.+..+.+..+.+.+..+.+..+.+..+.| > 5e+08 O-O O O O O O O O O O O O O O O O O O O O O O O O | > | : | > | : | > 4e+08 +-+ | > |: | > 3e+08 +-+ | > |: | > 2e+08 +-+ | > |: | > | | > 1e+08 +-+ | > | | > 0 +-+-----------------------------------------------------------------+ > > > [*] bisect-good sample > [O] bisect-bad sample > > *************************************************************************************************** > lkp-bdw-de1: 16 threads Intel(R) Xeon(R) CPU D-1541 @ 2.10GHz with 8G memory > ========================================================================================= > cluster/compiler/cpufreq_governor/ip/kconfig/nr_threads/rootfs/runtime/send_size/tbox_group/test/testcase/ucode: > cs-localhost/gcc-7/performance/ipv4/x86_64-rhel-7.2/200%/debian-x86_64-2018-04-03.cgz/300s/5K/lkp-bdw-de1/TCP_SENDFILE/netperf/0x7000013 > > commit: > 3ff6cde846 ("hns3: Another build fix.") > a337531b94 ("tcp: up initial rmem to 128KB and SYN rwin to around 64KB") > > 3ff6cde846857d45 a337531b942bd8a03e7052444d > ---------------- -------------------------- > fail:runs %reproduction fail:runs > | | | > 1:4 -25% :4 dmesg.WARNING:at#for_ip_interrupt_entry/0x > %stddev %change %stddev > \ | \ > 5211 -1.0% 5160 netperf.Throughput_Mbps > 166777 -1.0% 165138 netperf.Throughput_total_Mbps > 1268 -1.6% 1247 netperf.time.percent_of_cpu_this_job_got > 3539 -1.6% 3481 netperf.time.system_time > 282.77 -1.5% 278.54 netperf.time.user_time > 1435875 -1.0% 1421780 netperf.time.voluntary_context_switches > 1.222e+09 -1.0% 1.21e+09 netperf.workload > 22728 -1.3% 22437 vmstat.system.cs > 1218263 ? 3% -5.6% 1150027 ? 4% proc-vmstat.pgalloc_normal > 1197588 ? 4% -6.0% 1125684 ? 4% proc-vmstat.pgfree > 3424 ? 17% -28.2% 2456 ? 21% sched_debug.cpu.nr_load_updates.stddev > 9.00 ? 11% -19.9% 7.21 ? 11% sched_debug.cpu.nr_uninterruptible.max > 35344728 ? 33% -94.5% 1954598 ?144% cpuidle.C3.time > 79217 ? 32% -95.5% 3571 ?115% cpuidle.C3.usage > 13342584 ? 19% +253.4% 47153200 ? 34% cpuidle.C6.time > 17886 ? 21% +185.8% 51115 ? 34% cpuidle.C6.usage > 4295 ? 24% +108.0% 8934 ? 53% cpuidle.POLL.time > 79180 ? 32% -95.6% 3487 ?118% turbostat.C3 > 0.73 ? 32% -0.7 0.04 ?144% turbostat.C3% > 17693 ? 21% +187.9% 50931 ? 34% turbostat.C6 > 0.27 ? 19% +0.7 0.97 ? 34% turbostat.C6% > 0.35 ? 30% -89.9% 0.04 ?173% turbostat.CPU%c3 > 0.08 ? 6% +693.3% 0.59 ? 38% turbostat.CPU%c6 > 2.95 +3.1% 3.04 turbostat.RAMWatt > 1.711e+12 -1.3% 1.689e+12 perf-stat.branch-instructions > 5.345e+10 -1.2% 5.283e+10 perf-stat.branch-misses > 9.417e+10 +16.7% 1.099e+11 perf-stat.cache-misses > 9.417e+10 +16.7% 1.099e+11 perf-stat.cache-references > 6927335 -1.1% 6849494 perf-stat.context-switches > 2.936e+12 -1.3% 2.899e+12 perf-stat.dTLB-loads > 1.796e+12 -1.3% 1.773e+12 perf-stat.dTLB-stores > 80.43 +3.5 83.95 perf-stat.iTLB-load-miss-rate% > 3.809e+09 ? 4% -4.7% 3.629e+09 ? 2% perf-stat.iTLB-load-misses > 9.248e+08 ? 3% -25.0% 6.934e+08 perf-stat.iTLB-loads > 8.835e+12 -1.3% 8.719e+12 perf-stat.instructions > 69.17 -1.1 68.08 perf-profile.calltrace.cycles-pp.__x64_sys_sendfile64.do_syscall_64.entry_SYSCALL_64_after_hwframe > 65.80 -1.0 64.79 perf-profile.calltrace.cycles-pp.do_sendfile.__x64_sys_sendfile64.do_syscall_64.entry_SYSCALL_64_after_hwframe > 55.88 -0.8 55.04 perf-profile.calltrace.cycles-pp.do_splice_direct.do_sendfile.__x64_sys_sendfile64.do_syscall_64.entry_SYSCALL_64_after_hwframe > 52.32 -0.8 51.56 perf-profile.calltrace.cycles-pp.splice_direct_to_actor.do_splice_direct.do_sendfile.__x64_sys_sendfile64.do_syscall_64 > 35.71 -0.6 35.11 perf-profile.calltrace.cycles-pp.direct_splice_actor.splice_direct_to_actor.do_splice_direct.do_sendfile.__x64_sys_sendfile64 > 34.84 -0.6 34.26 perf-profile.calltrace.cycles-pp.splice_from_pipe.direct_splice_actor.splice_direct_to_actor.do_splice_direct.do_sendfile > 33.94 -0.5 33.41 perf-profile.calltrace.cycles-pp.__splice_from_pipe.splice_from_pipe.direct_splice_actor.splice_direct_to_actor.do_splice_direct > 26.16 -0.5 25.70 perf-profile.calltrace.cycles-pp.tcp_sendpage.inet_sendpage.kernel_sendpage.sock_sendpage.pipe_to_sendpage > 30.02 -0.5 29.55 perf-profile.calltrace.cycles-pp.pipe_to_sendpage.__splice_from_pipe.splice_from_pipe.direct_splice_actor.splice_direct_to_actor > 28.77 -0.4 28.34 perf-profile.calltrace.cycles-pp.sock_sendpage.pipe_to_sendpage.__splice_from_pipe.splice_from_pipe.direct_splice_actor > 27.68 -0.4 27.27 perf-profile.calltrace.cycles-pp.inet_sendpage.kernel_sendpage.sock_sendpage.pipe_to_sendpage.__splice_from_pipe > 27.98 -0.4 27.58 perf-profile.calltrace.cycles-pp.kernel_sendpage.sock_sendpage.pipe_to_sendpage.__splice_from_pipe.splice_from_pipe > 20.30 -0.3 19.95 perf-profile.calltrace.cycles-pp.tcp_sendpage_locked.tcp_sendpage.inet_sendpage.kernel_sendpage.sock_sendpage > 19.49 -0.3 19.16 perf-profile.calltrace.cycles-pp.do_tcp_sendpages.tcp_sendpage_locked.tcp_sendpage.inet_sendpage.kernel_sendpage > 9.78 -0.2 9.53 perf-profile.calltrace.cycles-pp.tcp_write_xmit.__tcp_push_pending_frames.do_tcp_sendpages.tcp_sendpage_locked.tcp_sendpage > 9.94 -0.2 9.70 perf-profile.calltrace.cycles-pp.__tcp_push_pending_frames.do_tcp_sendpages.tcp_sendpage_locked.tcp_sendpage.inet_sendpage > 6.32 -0.2 6.09 perf-profile.calltrace.cycles-pp.__tcp_transmit_skb.tcp_write_xmit.__tcp_push_pending_frames.do_tcp_sendpages.tcp_sendpage_locked > 5.59 -0.2 5.42 perf-profile.calltrace.cycles-pp.__ip_queue_xmit.__tcp_transmit_skb.tcp_write_xmit.__tcp_push_pending_frames.do_tcp_sendpages > 5.19 -0.2 5.02 perf-profile.calltrace.cycles-pp.ip_output.__ip_queue_xmit.__tcp_transmit_skb.tcp_write_xmit.__tcp_push_pending_frames > 4.79 -0.2 4.62 perf-profile.calltrace.cycles-pp.ip_rcv.__netif_receive_skb_one_core.process_backlog.net_rx_action.__softirqentry_text_start > 5.51 -0.2 5.35 perf-profile.calltrace.cycles-pp.__softirqentry_text_start.do_softirq_own_stack.do_softirq.__local_bh_enable_ip.ip_finish_output2 > 5.00 -0.2 4.84 perf-profile.calltrace.cycles-pp.__netif_receive_skb_one_core.process_backlog.net_rx_action.__softirqentry_text_start.do_softirq_own_stack > 5.52 -0.2 5.36 perf-profile.calltrace.cycles-pp.do_softirq_own_stack.do_softirq.__local_bh_enable_ip.ip_finish_output2.ip_output > 5.37 -0.2 5.21 perf-profile.calltrace.cycles-pp.net_rx_action.__softirqentry_text_start.do_softirq_own_stack.do_softirq.__local_bh_enable_ip > 4.68 -0.2 4.53 perf-profile.calltrace.cycles-pp.security_file_permission.do_sendfile.__x64_sys_sendfile64.do_syscall_64.entry_SYSCALL_64_after_hwframe > 5.61 -0.2 5.46 perf-profile.calltrace.cycles-pp.do_softirq.__local_bh_enable_ip.ip_finish_output2.ip_output.__ip_queue_xmit > 5.21 -0.2 5.06 perf-profile.calltrace.cycles-pp.process_backlog.net_rx_action.__softirqentry_text_start.do_softirq_own_stack.do_softirq > 4.58 -0.2 4.42 perf-profile.calltrace.cycles-pp.ip_finish_output2.ip_output.__ip_queue_xmit.__tcp_transmit_skb.tcp_write_xmit > 5.66 -0.2 5.50 perf-profile.calltrace.cycles-pp.__local_bh_enable_ip.ip_finish_output2.ip_output.__ip_queue_xmit.__tcp_transmit_skb > 4.39 -0.2 4.24 perf-profile.calltrace.cycles-pp.__entry_SYSCALL_64_trampoline > 2.87 ? 2% -0.1 2.76 perf-profile.calltrace.cycles-pp.selinux_file_permission.security_file_permission.do_sendfile.__x64_sys_sendfile64.do_syscall_64 > 1.25 ? 3% -0.1 1.15 perf-profile.calltrace.cycles-pp.__inode_security_revalidate.selinux_file_permission.security_file_permission.do_sendfile.__x64_sys_sendfile64 > 4.30 -0.1 4.20 perf-profile.calltrace.cycles-pp.ip_local_deliver_finish.ip_local_deliver.ip_rcv.__netif_receive_skb_one_core.process_backlog > 1.86 -0.1 1.77 ? 3% perf-profile.calltrace.cycles-pp.release_sock.tcp_sendpage.inet_sendpage.kernel_sendpage.sock_sendpage > 1.14 -0.1 1.08 ? 2% perf-profile.calltrace.cycles-pp.file_has_perm.security_file_permission.do_splice_direct.do_sendfile.__x64_sys_sendfile64 > 0.69 -0.1 0.63 perf-profile.calltrace.cycles-pp.tcp_release_cb.release_sock.tcp_sendpage.inet_sendpage.kernel_sendpage > 0.61 ? 2% -0.1 0.56 ? 2% perf-profile.calltrace.cycles-pp.__might_fault.__x64_sys_sendfile64.do_syscall_64.entry_SYSCALL_64_after_hwframe > 0.61 ? 2% -0.0 0.57 ? 4% perf-profile.calltrace.cycles-pp.avc_has_perm.file_has_perm.security_file_permission.do_splice_direct.do_sendfile > 0.57 ? 2% +0.0 0.61 ? 2% perf-profile.calltrace.cycles-pp.___might_sleep.__might_fault.copy_page_to_iter.skb_copy_datagram_iter.tcp_recvmsg > 90.63 +0.2 90.83 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe > 91.39 +0.2 91.62 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe > 20.12 +1.3 21.46 perf-profile.calltrace.cycles-pp.__x64_sys_recvfrom.do_syscall_64.entry_SYSCALL_64_after_hwframe > 20.10 +1.3 21.44 perf-profile.calltrace.cycles-pp.__sys_recvfrom.__x64_sys_recvfrom.do_syscall_64.entry_SYSCALL_64_after_hwframe > 19.84 +1.4 21.24 perf-profile.calltrace.cycles-pp.tcp_recvmsg.inet_recvmsg.__sys_recvfrom.__x64_sys_recvfrom.do_syscall_64 > 19.89 +1.4 21.30 perf-profile.calltrace.cycles-pp.inet_recvmsg.__sys_recvfrom.__x64_sys_recvfrom.do_syscall_64.entry_SYSCALL_64_after_hwframe > 15.07 +1.6 16.65 perf-profile.calltrace.cycles-pp.skb_copy_datagram_iter.tcp_recvmsg.inet_recvmsg.__sys_recvfrom.__x64_sys_recvfrom > 14.25 +1.6 15.82 perf-profile.calltrace.cycles-pp.copy_page_to_iter.skb_copy_datagram_iter.tcp_recvmsg.inet_recvmsg.__sys_recvfrom > 11.15 +1.6 12.74 perf-profile.calltrace.cycles-pp.copyout.copy_page_to_iter.skb_copy_datagram_iter.tcp_recvmsg.inet_recvmsg > 10.84 +1.6 12.45 perf-profile.calltrace.cycles-pp.copy_user_enhanced_fast_string.copyout.copy_page_to_iter.skb_copy_datagram_iter.tcp_recvmsg > 69.33 -1.1 68.23 perf-profile.children.cycles-pp.__x64_sys_sendfile64 > 65.94 -1.0 64.92 perf-profile.children.cycles-pp.do_sendfile > 55.98 -0.8 55.14 perf-profile.children.cycles-pp.do_splice_direct > 52.38 -0.8 51.60 perf-profile.children.cycles-pp.splice_direct_to_actor > 35.77 -0.6 35.16 perf-profile.children.cycles-pp.direct_splice_actor > 34.91 -0.6 34.33 perf-profile.children.cycles-pp.splice_from_pipe > 34.07 -0.5 33.53 perf-profile.children.cycles-pp.__splice_from_pipe > 30.09 -0.5 29.62 perf-profile.children.cycles-pp.pipe_to_sendpage > 26.31 -0.5 25.86 perf-profile.children.cycles-pp.tcp_sendpage > 28.85 -0.4 28.42 perf-profile.children.cycles-pp.sock_sendpage > 27.75 -0.4 27.33 perf-profile.children.cycles-pp.inet_sendpage > 28.05 -0.4 27.65 perf-profile.children.cycles-pp.kernel_sendpage > 20.38 -0.3 20.03 perf-profile.children.cycles-pp.tcp_sendpage_locked > 19.62 -0.3 19.29 perf-profile.children.cycles-pp.do_tcp_sendpages > 9.69 -0.3 9.42 perf-profile.children.cycles-pp.security_file_permission > 8.60 -0.2 8.38 perf-profile.children.cycles-pp.__tcp_transmit_skb > 10.66 -0.2 10.43 perf-profile.children.cycles-pp.tcp_write_xmit > 10.79 -0.2 10.56 perf-profile.children.cycles-pp.__tcp_push_pending_frames > 7.82 -0.2 7.64 perf-profile.children.cycles-pp.__ip_queue_xmit > 7.38 -0.2 7.20 perf-profile.children.cycles-pp.ip_output > 6.36 -0.2 6.19 perf-profile.children.cycles-pp.__local_bh_enable_ip > 5.95 -0.2 5.78 perf-profile.children.cycles-pp.__entry_SYSCALL_64_trampoline > 4.86 -0.2 4.69 perf-profile.children.cycles-pp.ip_rcv > 5.07 -0.2 4.91 perf-profile.children.cycles-pp.__netif_receive_skb_one_core > 5.44 -0.2 5.29 perf-profile.children.cycles-pp.net_rx_action > 5.58 -0.2 5.42 perf-profile.children.cycles-pp.do_softirq_own_stack > 5.28 -0.2 5.13 perf-profile.children.cycles-pp.process_backlog > 6.70 -0.2 6.55 perf-profile.children.cycles-pp.ip_finish_output2 > 5.67 -0.1 5.52 perf-profile.children.cycles-pp.do_softirq > 2.76 ? 3% -0.1 2.62 perf-profile.children.cycles-pp.__inode_security_revalidate > 1.39 ? 4% -0.1 1.27 ? 2% perf-profile.children.cycles-pp._cond_resched > 4.45 -0.1 4.34 perf-profile.children.cycles-pp.ip_local_deliver > 0.73 ? 5% -0.1 0.64 ? 3% perf-profile.children.cycles-pp.rcu_all_qs > 0.72 -0.1 0.65 perf-profile.children.cycles-pp.tcp_release_cb > 0.30 ? 5% -0.1 0.24 ? 3% perf-profile.children.cycles-pp.tcp_rcv_space_adjust > 0.43 ? 4% -0.0 0.39 ? 5% perf-profile.children.cycles-pp.copy_user_generic_unrolled > 0.17 ? 7% -0.0 0.12 ? 6% perf-profile.children.cycles-pp.ip_rcv_finish_core > 0.19 ? 7% -0.0 0.15 ? 6% perf-profile.children.cycles-pp.ip_rcv_finish > 0.14 ? 5% -0.0 0.11 ? 8% perf-profile.children.cycles-pp.tcp_rearm_rto > 0.10 ? 11% -0.0 0.06 ? 6% perf-profile.children.cycles-pp.sockfd_lookup_light > 0.07 ? 5% +0.0 0.09 ? 5% perf-profile.children.cycles-pp.skb_entail > 0.11 ? 3% +0.0 0.13 ? 6% perf-profile.children.cycles-pp.scheduler_tick > 0.51 ? 3% +0.0 0.55 ? 3% perf-profile.children.cycles-pp.tcp_established_options > 90.70 +0.2 90.90 perf-profile.children.cycles-pp.do_syscall_64 > 91.47 +0.2 91.70 perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe > 20.13 +1.3 21.47 perf-profile.children.cycles-pp.__x64_sys_recvfrom > 20.10 +1.3 21.44 perf-profile.children.cycles-pp.__sys_recvfrom > 19.89 +1.4 21.30 perf-profile.children.cycles-pp.inet_recvmsg > 19.84 +1.4 21.26 perf-profile.children.cycles-pp.tcp_recvmsg > 16.63 +1.6 18.19 perf-profile.children.cycles-pp.copy_page_to_iter > 15.08 +1.6 16.66 perf-profile.children.cycles-pp.skb_copy_datagram_iter > 11.24 +1.6 12.82 perf-profile.children.cycles-pp.copyout > 11.24 +1.6 12.82 perf-profile.children.cycles-pp.copy_user_enhanced_fast_string > 5.68 -0.2 5.51 perf-profile.self.cycles-pp.__entry_SYSCALL_64_trampoline > 0.67 -0.1 0.60 ? 2% perf-profile.self.cycles-pp.tcp_release_cb > 0.93 ? 2% -0.1 0.86 ? 2% perf-profile.self.cycles-pp.__inode_security_revalidate > 1.09 ? 2% -0.0 1.05 ? 2% perf-profile.self.cycles-pp.do_syscall_64 > 0.16 ? 9% -0.0 0.12 ? 7% perf-profile.self.cycles-pp.ip_rcv_finish_core > 0.09 ? 11% -0.0 0.05 ? 62% perf-profile.self.cycles-pp.__tcp_ack_snd_check > 0.40 ? 3% -0.0 0.36 ? 7% perf-profile.self.cycles-pp.copy_user_generic_unrolled > 0.80 -0.0 0.77 ? 2% perf-profile.self.cycles-pp.current_time > 0.28 ? 2% -0.0 0.25 ? 3% perf-profile.self.cycles-pp.tcp_recvmsg > 0.27 ? 6% -0.0 0.24 ? 5% perf-profile.self.cycles-pp.__alloc_skb > 0.18 ? 6% -0.0 0.15 ? 7% perf-profile.self.cycles-pp.tcp_mstamp_refresh > 0.10 ? 5% -0.0 0.08 ? 5% perf-profile.self.cycles-pp.__tcp_select_window > 0.22 ? 3% +0.0 0.24 ? 2% perf-profile.self.cycles-pp._raw_spin_lock_irqsave > 0.46 ? 5% +0.0 0.51 ? 4% perf-profile.self.cycles-pp.tcp_established_options > 11.14 +1.5 12.68 perf-profile.self.cycles-pp.copy_user_enhanced_fast_string > > > > *************************************************************************************************** > lkp-bdw-de1: 16 threads Intel(R) Xeon(R) CPU D-1541 @ 2.10GHz with 8G memory > ========================================================================================= > cluster/compiler/cpufreq_governor/ip/kconfig/nr_threads/rootfs/runtime/tbox_group/test/testcase/ucode: > cs-localhost/gcc-7/performance/ipv4/x86_64-rhel-7.2/200%/debian-x86_64-2018-04-03.cgz/900s/lkp-bdw-de1/TCP_MAERTS/netperf/0x7000013 > > commit: > 3ff6cde846 ("hns3: Another build fix.") > a337531b94 ("tcp: up initial rmem to 128KB and SYN rwin to around 64KB") > > 3ff6cde846857d45 a337531b942bd8a03e7052444d > ---------------- -------------------------- > fail:runs %reproduction fail:runs > | | | > 1:4 2% 1:4 perf-profile.children.cycles-pp.schedule_timeout > %stddev %change %stddev > \ | \ > 2497 -5.9% 2349 netperf.Throughput_Mbps > 79914 -5.9% 75172 netperf.Throughput_total_Mbps > 2472 +4.7% 2588 netperf.time.maximum_resident_set_size > 8998 +8.0% 9715 netperf.time.minor_page_faults > 88.91 -13.7% 76.77 netperf.time.user_time > 5.487e+08 -5.9% 5.162e+08 netperf.workload > 50507215 ? 49% -63.0% 18671277 ? 27% cpuidle.C3.time > 111760 ? 6% +12.4% 125584 ? 3% meminfo.DirectMap4k > 0.35 ? 49% -0.2 0.13 ? 29% turbostat.C3% > 42.19 -1.2% 41.70 turbostat.PkgWatt > 1988 +9.6% 2180 ? 2% sched_debug.cfs_rq:/.util_est_enqueued.max > 401.62 ? 3% +11.2% 446.64 ? 4% sched_debug.cfs_rq:/.util_est_enqueued.stddev > 3.91 ? 12% -18.4% 3.19 ? 14% sched_debug.cpu.nr_uninterruptible.stddev > 697.25 ? 4% +48.3% 1034 ? 19% slabinfo.dmaengine-unmap-16.active_objs > 697.25 ? 4% +48.3% 1034 ? 19% slabinfo.dmaengine-unmap-16.num_objs > 1464 ? 11% -20.9% 1157 ? 9% slabinfo.skbuff_head_cache.active_objs > 1464 ? 11% -20.9% 1157 ? 9% slabinfo.skbuff_head_cache.num_objs > 70462 +1.3% 71390 proc-vmstat.nr_active_anon > 66190 +1.5% 67154 proc-vmstat.nr_anon_pages > 70462 +1.3% 71390 proc-vmstat.nr_zone_active_anon > 2.756e+08 -6.0% 2.592e+08 proc-vmstat.numa_hit > 2.756e+08 -6.0% 2.592e+08 proc-vmstat.numa_local > 2.197e+09 -6.0% 2.067e+09 proc-vmstat.pgalloc_normal > 2.197e+09 -6.0% 2.066e+09 proc-vmstat.pgfree > 5.831e+11 -7.8% 5.377e+11 perf-stat.branch-instructions > 1.567e+10 -8.9% 1.428e+10 perf-stat.branch-misses > 6.246e+11 -4.4% 5.974e+11 perf-stat.cache-misses > 6.246e+11 -4.4% 5.974e+11 perf-stat.cache-references > 11.79 +8.4% 12.78 perf-stat.cpi > 122574 +2.4% 125502 perf-stat.cpu-migrations > 1.473e+12 -7.0% 1.369e+12 perf-stat.dTLB-loads > 0.07 ? 13% +0.0 0.09 ? 6% perf-stat.dTLB-store-miss-rate% > 7.83e+08 ? 13% +15.6% 9.049e+08 ? 6% perf-stat.dTLB-store-misses > 1.092e+12 -6.8% 1.017e+12 perf-stat.dTLB-stores > 1.153e+09 -10.1% 1.037e+09 perf-stat.iTLB-load-misses > 2.66e+08 ? 4% -7.0% 2.474e+08 perf-stat.iTLB-loads > 2.994e+12 -7.8% 2.761e+12 perf-stat.instructions > 0.08 -7.8% 0.08 perf-stat.ipc > 5456 -2.0% 5348 perf-stat.path-length > 2.62 -0.1 2.49 perf-profile.calltrace.cycles-pp.tcp_write_xmit.__tcp_push_pending_frames.tcp_rcv_established.tcp_v4_do_rcv.tcp_v4_rcv > 2.64 -0.1 2.51 perf-profile.calltrace.cycles-pp.__tcp_push_pending_frames.tcp_rcv_established.tcp_v4_do_rcv.tcp_v4_rcv.ip_local_deliver_finish > 2.83 -0.1 2.73 perf-profile.calltrace.cycles-pp.__free_pages_ok.skb_release_data.__kfree_skb.tcp_recvmsg.inet_recvmsg > 3.64 -0.1 3.54 perf-profile.calltrace.cycles-pp.__kfree_skb.tcp_recvmsg.inet_recvmsg.__sys_recvfrom.__x64_sys_recvfrom > 3.27 -0.1 3.18 perf-profile.calltrace.cycles-pp.skb_release_data.__kfree_skb.tcp_recvmsg.inet_recvmsg.__sys_recvfrom > 98.03 +0.1 98.11 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe > 97.89 +0.1 97.96 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe > 0.44 ? 58% +0.3 0.71 ? 5% perf-profile.calltrace.cycles-pp.smp_apic_timer_interrupt.apic_timer_interrupt.copy_user_enhanced_fast_string.copyout.copy_page_to_iter > 2.92 ? 6% +0.4 3.29 ? 4% perf-profile.calltrace.cycles-pp.apic_timer_interrupt.copy_user_enhanced_fast_string.copyout.copy_page_to_iter.skb_copy_datagram_iter > 0.00 +0.5 0.55 ? 6% perf-profile.calltrace.cycles-pp.hrtimer_interrupt.smp_apic_timer_interrupt.apic_timer_interrupt.copy_user_enhanced_fast_string.copyout > 3.64 -0.1 3.52 perf-profile.children.cycles-pp.tcp_write_xmit > 3.60 -0.1 3.48 perf-profile.children.cycles-pp.__tcp_push_pending_frames > 2.84 -0.1 2.74 perf-profile.children.cycles-pp.__free_pages_ok > 4.08 -0.1 4.00 perf-profile.children.cycles-pp.__kfree_skb > 0.80 ? 2% -0.1 0.74 ? 3% perf-profile.children.cycles-pp.__entry_SYSCALL_64_trampoline > 0.23 ? 4% -0.0 0.20 ? 5% perf-profile.children.cycles-pp.__sk_mem_schedule > 0.22 ? 4% -0.0 0.19 ? 5% perf-profile.children.cycles-pp.__sk_mem_raise_allocated > 0.06 -0.0 0.04 ? 57% perf-profile.children.cycles-pp.tcp_release_cb > 0.08 ? 6% -0.0 0.06 ? 15% perf-profile.children.cycles-pp.__tcp_select_window > 0.23 +0.0 0.24 ? 2% perf-profile.children.cycles-pp.__tcp_send_ack > 0.06 ? 11% +0.0 0.08 ? 5% perf-profile.children.cycles-pp.___perf_sw_event > 0.06 ? 14% +0.0 0.09 ? 13% perf-profile.children.cycles-pp.tcp_write_timer_handler > 0.12 ? 7% +0.0 0.15 ? 5% perf-profile.children.cycles-pp.update_curr > 0.06 ? 11% +0.0 0.09 ? 17% perf-profile.children.cycles-pp.call_timer_fn > 0.17 ? 4% +0.0 0.20 ? 3% perf-profile.children.cycles-pp.___slab_alloc > 0.18 ? 4% +0.0 0.21 ? 3% perf-profile.children.cycles-pp.__slab_alloc > 0.05 ? 58% +0.0 0.08 ? 15% perf-profile.children.cycles-pp.tcp_write_timer > 0.04 ? 58% +0.0 0.08 ? 16% perf-profile.children.cycles-pp.tcp_send_loss_probe > 0.32 ? 3% +0.0 0.35 perf-profile.children.cycles-pp.kmem_cache_alloc_node > 0.14 ? 7% +0.0 0.19 ? 16% perf-profile.children.cycles-pp.preempt_schedule_common > 0.21 ? 12% +0.1 0.27 ? 6% perf-profile.children.cycles-pp.task_tick_fair > 0.00 +0.1 0.06 ? 11% perf-profile.children.cycles-pp.__tcp_retransmit_skb > 0.51 ? 3% +0.1 0.57 ? 6% perf-profile.children.cycles-pp.__sched_text_start > 1.61 +0.1 1.68 ? 2% perf-profile.children.cycles-pp.__release_sock > 1.06 ? 3% +0.1 1.14 ? 2% perf-profile.children.cycles-pp.tcp_ack > 0.28 ? 9% +0.1 0.36 ? 4% perf-profile.children.cycles-pp.scheduler_tick > 98.09 +0.1 98.18 perf-profile.children.cycles-pp.do_syscall_64 > 98.23 +0.1 98.32 perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe > 0.49 ? 8% +0.1 0.58 ? 5% perf-profile.children.cycles-pp.update_process_times > 0.50 ? 8% +0.1 0.61 ? 6% perf-profile.children.cycles-pp.tick_sched_handle > 0.54 ? 9% +0.1 0.67 ? 5% perf-profile.children.cycles-pp.tick_sched_timer > 0.79 ? 8% +0.1 0.93 ? 3% perf-profile.children.cycles-pp.__hrtimer_run_queues > 0.93 ? 9% +0.2 1.09 ? 2% perf-profile.children.cycles-pp.hrtimer_interrupt > 1.13 ? 10% +0.2 1.37 ? 4% perf-profile.children.cycles-pp.smp_apic_timer_interrupt > 2.51 ? 6% +0.4 2.87 ? 3% perf-profile.children.cycles-pp.apic_timer_interrupt > 70.21 +0.4 70.63 perf-profile.children.cycles-pp.copy_user_enhanced_fast_string > 1.61 -0.1 1.49 ? 2% perf-profile.self.cycles-pp.copy_page_to_iter > 0.78 ? 2% -0.1 0.72 ? 3% perf-profile.self.cycles-pp.__entry_SYSCALL_64_trampoline > 1.37 -0.1 1.32 perf-profile.self.cycles-pp.__free_pages_ok > 0.21 ? 5% -0.0 0.18 ? 4% perf-profile.self.cycles-pp.__sk_mem_raise_allocated > 0.65 ? 2% -0.0 0.62 perf-profile.self.cycles-pp.free_one_page > 0.41 ? 2% -0.0 0.39 ? 4% perf-profile.self.cycles-pp.skb_copy_datagram_iter > 0.08 ? 6% -0.0 0.06 ? 15% perf-profile.self.cycles-pp.__tcp_select_window > 0.10 ? 5% -0.0 0.08 ? 8% perf-profile.self.cycles-pp.import_single_range > 0.14 ? 5% +0.0 0.16 ? 5% perf-profile.self.cycles-pp.___slab_alloc > 0.19 ? 3% +0.0 0.21 ? 3% perf-profile.self.cycles-pp.kmem_cache_alloc_node > 0.15 ? 4% +0.0 0.17 ? 4% perf-profile.self.cycles-pp.__might_sleep > 0.03 ?100% +0.0 0.07 ? 13% perf-profile.self.cycles-pp.___perf_sw_event > > > > *************************************************************************************************** > lkp-u410: 4 threads Intel(R) Core(TM) i5-3317U CPU @ 1.70GHz with 4G memory > ========================================================================================= > cluster/compiler/cpufreq_governor/ip/kconfig/nr_threads/rootfs/runtime/tbox_group/test/testcase/ucode: > cs-localhost/gcc-7/performance/ipv4/x86_64-rhel-7.2/200%/debian-x86_64-2018-04-03.cgz/900s/lkp-u410/TCP_MAERTS/netperf/0x20 > > commit: > 3ff6cde846 ("hns3: Another build fix.") > a337531b94 ("tcp: up initial rmem to 128KB and SYN rwin to around 64KB") > > 3ff6cde846857d45 a337531b942bd8a03e7052444d > ---------------- -------------------------- > fail:runs %reproduction fail:runs > | | | > 4:4 -100% :4 dmesg.RIP:intel_modeset_init[i915] > 4:4 -100% :4 dmesg.WARNING:at_drivers/gpu/drm/i915/intel_display.c:#intel_modeset_init[i915] > 2:4 -3% 2:4 perf-profile.children.cycles-pp.schedule_timeout > %stddev %change %stddev > \ | \ > 3879 -3.2% 3753 netperf.Throughput_Mbps > 31036 -3.2% 30030 netperf.Throughput_total_Mbps > 2463 +3.6% 2552 netperf.time.maximum_resident_set_size > 2499 +7.5% 2685 netperf.time.minor_page_faults > 24.96 -14.8% 21.28 ? 8% netperf.time.user_time > 543040 ? 13% -15.9% 456816 ? 2% netperf.time.voluntary_context_switches > 2.131e+08 -3.2% 2.062e+08 netperf.workload > 21274 +3.3% 21986 interrupts.CAL:Function_call_interrupts > 826.00 ? 6% -27.1% 602.00 ? 23% slabinfo.skbuff_head_cache.active_objs > 3904 ? 2% -4.5% 3728 vmstat.system.cs > 56.50 ? 2% +8.8% 61.50 ? 5% turbostat.CoreTmp > 56.75 ? 2% +8.4% 61.50 ? 5% turbostat.PkgTmp > 4224 ?173% +294.2% 16653 ? 52% sched_debug.cfs_rq:/.spread0.avg > 110.92 ? 8% -22.2% 86.34 ? 10% sched_debug.cfs_rq:/.util_avg.stddev > 896147 ? 3% -11.3% 795033 ? 4% sched_debug.cpu.avg_idle.max > 162406 ? 9% -26.1% 119960 ? 21% sched_debug.cpu.avg_idle.stddev > 59886 ? 3% -3.8% 57590 proc-vmstat.nr_dirty_background_threshold > 119920 ? 3% -3.8% 115322 proc-vmstat.nr_dirty_threshold > 628429 ? 3% -3.7% 605425 proc-vmstat.nr_free_pages > 1.071e+08 -3.2% 1.036e+08 proc-vmstat.numa_hit > 1.071e+08 -3.2% 1.036e+08 proc-vmstat.numa_local > 8.503e+08 -3.2% 8.229e+08 proc-vmstat.pgfree > 2.265e+11 -5.7% 2.135e+11 perf-stat.branch-instructions > 3.01 -0.1 2.94 perf-stat.branch-miss-rate% > 6.809e+09 -7.8% 6.279e+09 ? 3% perf-stat.branch-misses > 30.13 +2.0 32.13 perf-stat.cache-miss-rate% > 5.149e+10 +3.2% 5.314e+10 perf-stat.cache-misses > 1.709e+11 -3.2% 1.654e+11 perf-stat.cache-references > 3532029 ? 2% -4.5% 3373137 perf-stat.context-switches > 7.31 +6.2% 7.76 perf-stat.cpi > 5.633e+09 ? 2% -5.8% 5.308e+09 perf-stat.dTLB-load-misses > 7.264e+11 -4.1% 6.964e+11 perf-stat.dTLB-loads > 6.35e+11 -4.0% 6.097e+11 perf-stat.dTLB-stores > 4.029e+08 -7.1% 3.743e+08 ? 2% perf-stat.iTLB-load-misses > 1.157e+12 -5.7% 1.091e+12 perf-stat.instructions > 0.14 -5.8% 0.13 perf-stat.ipc > 5426 -2.5% 5289 perf-stat.path-length > 1.16 ? 6% -0.2 0.99 ? 3% perf-profile.calltrace.cycles-pp.__entry_SYSCALL_64_trampoline > 0.99 ? 6% -0.1 0.88 ? 10% perf-profile.calltrace.cycles-pp.tcp_v4_do_rcv.__release_sock.release_sock.tcp_recvmsg.inet_recvmsg > 96.58 +0.3 96.87 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe > 26.12 ? 2% +1.3 27.40 perf-profile.calltrace.cycles-pp.copy_user_enhanced_fast_string.copyin._copy_from_iter_full.tcp_sendmsg_locked.tcp_sendmsg > 26.39 ? 2% +1.3 27.69 perf-profile.calltrace.cycles-pp.copyin._copy_from_iter_full.tcp_sendmsg_locked.tcp_sendmsg.sock_sendmsg > 27.12 ? 3% +1.4 28.48 perf-profile.calltrace.cycles-pp._copy_from_iter_full.tcp_sendmsg_locked.tcp_sendmsg.sock_sendmsg.__sys_sendto > 41.73 ? 2% +1.7 43.40 ? 2% perf-profile.calltrace.cycles-pp.tcp_sendmsg_locked.tcp_sendmsg.sock_sendmsg.__sys_sendto.__x64_sys_sendto > 43.17 ? 2% +1.7 44.87 ? 2% perf-profile.calltrace.cycles-pp.tcp_sendmsg.sock_sendmsg.__sys_sendto.__x64_sys_sendto.do_syscall_64 > 43.75 ? 2% +1.8 45.51 perf-profile.calltrace.cycles-pp.sock_sendmsg.__sys_sendto.__x64_sys_sendto.do_syscall_64.entry_SYSCALL_64_after_hwframe > 44.88 ? 2% +1.8 46.63 perf-profile.calltrace.cycles-pp.__x64_sys_sendto.do_syscall_64.entry_SYSCALL_64_after_hwframe > 44.73 ? 2% +1.8 46.53 perf-profile.calltrace.cycles-pp.__sys_sendto.__x64_sys_sendto.do_syscall_64.entry_SYSCALL_64_after_hwframe > 1.38 ? 6% -0.2 1.20 ? 3% perf-profile.children.cycles-pp.__entry_SYSCALL_64_trampoline > 0.42 ? 9% -0.1 0.31 ? 9% perf-profile.children.cycles-pp.tcp_queue_rcv > 0.79 ? 6% -0.1 0.68 ? 5% perf-profile.children.cycles-pp.ktime_get_with_offset > 0.32 ? 12% -0.1 0.21 ? 33% perf-profile.children.cycles-pp.scheduler_tick > 0.35 ? 12% -0.1 0.26 ? 11% perf-profile.children.cycles-pp.tcp_try_coalesce > 0.29 ? 10% -0.1 0.20 ? 17% perf-profile.children.cycles-pp.skb_try_coalesce > 0.88 ? 2% -0.1 0.79 ? 4% perf-profile.children.cycles-pp.tcp_mstamp_refresh > 0.32 ? 9% -0.1 0.26 ? 18% perf-profile.children.cycles-pp.ip_local_out > 0.41 ? 3% +0.0 0.45 ? 4% perf-profile.children.cycles-pp.selinux_ip_postroute > 0.03 ?102% +0.1 0.09 ? 24% perf-profile.children.cycles-pp.lock_timer_base > 0.00 +0.1 0.08 ? 29% perf-profile.children.cycles-pp.raw_local_deliver > 0.57 ? 4% +0.1 0.66 ? 7% perf-profile.children.cycles-pp.tcp_event_new_data_sent > 0.20 ? 28% +0.1 0.29 ? 21% perf-profile.children.cycles-pp._cond_resched > 64.27 +0.5 64.78 perf-profile.children.cycles-pp.copy_user_enhanced_fast_string > 26.41 ? 2% +1.3 27.70 perf-profile.children.cycles-pp.copyin > 27.16 ? 3% +1.3 28.50 perf-profile.children.cycles-pp._copy_from_iter_full > 41.76 ? 2% +1.7 43.44 ? 2% perf-profile.children.cycles-pp.tcp_sendmsg_locked > 43.19 ? 2% +1.7 44.88 ? 2% perf-profile.children.cycles-pp.tcp_sendmsg > 44.88 ? 2% +1.8 46.65 perf-profile.children.cycles-pp.__x64_sys_sendto > 43.75 ? 2% +1.8 45.51 perf-profile.children.cycles-pp.sock_sendmsg > 44.74 ? 2% +1.8 46.54 perf-profile.children.cycles-pp.__sys_sendto > 1.21 ? 8% -0.2 0.99 ? 5% perf-profile.self.cycles-pp.copy_page_to_iter > 1.32 ? 6% -0.2 1.15 ? 3% perf-profile.self.cycles-pp.__entry_SYSCALL_64_trampoline > 0.29 ? 9% -0.1 0.20 ? 18% perf-profile.self.cycles-pp.skb_try_coalesce > 0.50 ? 9% -0.1 0.42 ? 10% perf-profile.self.cycles-pp.ktime_get_with_offset > 0.19 ? 14% -0.1 0.12 ? 10% perf-profile.self.cycles-pp.__local_bh_enable_ip > 0.08 ? 10% -0.0 0.03 ?102% perf-profile.self.cycles-pp.selinux_sock_rcv_skb_compat > 0.13 ? 3% -0.0 0.08 ? 57% perf-profile.self.cycles-pp.__x64_sys_sendto > 0.07 ? 12% -0.0 0.03 ?100% perf-profile.self.cycles-pp._raw_spin_unlock_irqrestore > 0.11 ? 11% -0.0 0.08 ? 22% perf-profile.self.cycles-pp.__sys_recvfrom > 0.05 ? 61% +0.0 0.09 ? 11% perf-profile.self.cycles-pp.selinux_ip_postroute > 0.09 ? 20% +0.1 0.15 ? 31% perf-profile.self.cycles-pp.rcu_all_qs > 0.00 +0.1 0.07 ? 28% perf-profile.self.cycles-pp.raw_local_deliver > > > > > > Disclaimer: > Results have been estimated based on internal Intel analysis and are provided > for informational purposes only. Any difference in system hardware or software > design or configuration may affect actual performance. > > > Thanks, > Rong Chen > _______________________________________________ LKP mailing list LKP@lists.01.org https://lists.01.org/mailman/listinfo/lkp