Received: by 2002:a25:ab43:0:0:0:0:0 with SMTP id u61csp59330ybi; Thu, 30 May 2019 20:29:14 -0700 (PDT) X-Google-Smtp-Source: APXvYqyFy2wP9vz5hDgHQQpsTdwRLrwB6OGjbqU3mkAcnB2OcWUIyjaSiIrK72DP9f9S/fJUfIXm X-Received: by 2002:a63:4714:: with SMTP id u20mr6807939pga.347.1559273354210; Thu, 30 May 2019 20:29:14 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1559273354; cv=none; d=google.com; s=arc-20160816; b=Ke8ruWildaTE06NKJagBt2/TU/+F1AKoMjqljFh9bZhRcykJRfkVmckC/0b1JESNX8 cJv6DgOffbWPXyDTzd69P6cIkv3qy2hkXydjPO9heq1hFGzsYAV9kVlanhtONBwkfyE2 01/+inynpoNGDnXcwZJf9hZoV/+YCDN3CMcNRq5FKI7qGmsVMmnaKMjgKTPycAn5VRMA tfcbh9AvOHzhJFb/Rg6RTXgBNHdRYDhci9cRTY7zbQgu8obyoL8GtNQK1HylqfJMVHQv h7/DH/DRdmzHi3vM7/Zpj/nR4zJVNTb1FCDD53Swn0ad6e0uF6XKJ+nBVMyapCbruFoI kyxw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=5ksrR59xlkGgK/S72Zci/IFy8xRiIuooOUZwh5B5OJM=; b=hSJ68sXbDMFjrlrSPLWg5b+If3g0oIySiP5lfQHpGCScl5zQr8BQdA//0SGnLYVtHH IlGzSpAXeKdFpORJlfU2kIduQb0omcW6BdhMwp2hJgBaaB5V8xnxBDHGixdWILEqZZqP W8gnIy1lwk0GB1ttH+y+ajQP7rIO/JIPf6NfmyAiRuJQbrb3h0Qpa26QgnVMQF9+w9OF NXm10euIZv3EI0ddl83yDUAcvQ2a0B6zaCJF8nucGhVE49bxRiBc/BQ7WewiDJhhKhiU pFg2QtdiztojyIgTUs5p/xm/rPAy7oJLpqvuGSQ4dGfomglgBi1jvXMqrxNlB9WcNBWt fjUQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id e189si5272766pfe.54.2019.05.30.20.28.46; Thu, 30 May 2019 20:29:14 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726610AbfEaD1P (ORCPT + 99 others); Thu, 30 May 2019 23:27:15 -0400 Received: from mga14.intel.com ([192.55.52.115]:35846 "EHLO mga14.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726418AbfEaD1P (ORCPT ); Thu, 30 May 2019 23:27:15 -0400 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from fmsmga008.fm.intel.com ([10.253.24.58]) by fmsmga103.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 30 May 2019 20:27:15 -0700 X-ExtLoop1: 1 Received: from xingzhen-mobl1.ccr.corp.intel.com (HELO [10.255.30.215]) ([10.255.30.215]) by fmsmga008.fm.intel.com with ESMTP; 30 May 2019 20:27:13 -0700 Subject: Re: [LKP] [SUNRPC] 0472e47660: fsmark.app_overhead 16.0% regression To: Trond Myklebust , "rong.a.chen@intel.com" Cc: "torvalds@linux-foundation.org" , "lkp@01.org" , "linux-kernel@vger.kernel.org" References: <20190520055434.GZ31424@shao2-debian> <9a07c589f955e5af5acc0fa09a16a3256089e764.camel@hammerspace.com> <9753a9a4a82943f6aacc2bfb0f93efc5f96bcaa5.camel@hammerspace.com> From: Xing Zhengjun Message-ID: <2bbe636a-14f1-4592-d1f9-a9f765a02939@linux.intel.com> Date: Fri, 31 May 2019 11:27:13 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.7.0 MIME-Version: 1.0 In-Reply-To: <9753a9a4a82943f6aacc2bfb0f93efc5f96bcaa5.camel@hammerspace.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 5/31/2019 3:10 AM, Trond Myklebust wrote: > On Thu, 2019-05-30 at 15:20 +0800, Xing Zhengjun wrote: >> >> On 5/30/2019 10:00 AM, Trond Myklebust wrote: >>> Hi Xing, >>> >>> On Thu, 2019-05-30 at 09:35 +0800, Xing Zhengjun wrote: >>>> Hi Trond, >>>> >>>> On 5/20/2019 1:54 PM, kernel test robot wrote: >>>>> Greeting, >>>>> >>>>> FYI, we noticed a 16.0% improvement of fsmark.app_overhead due >>>>> to >>>>> commit: >>>>> >>>>> >>>>> commit: 0472e476604998c127f3c80d291113e77c5676ac ("SUNRPC: >>>>> Convert >>>>> socket page send code to use iov_iter()") >>>>> https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git >>>>> master >>>>> >>>>> in testcase: fsmark >>>>> on test machine: 40 threads Intel(R) Xeon(R) CPU E5-2690 v2 @ >>>>> 3.00GHz with 384G memory >>>>> with following parameters: >>>>> >>>>> iterations: 1x >>>>> nr_threads: 64t >>>>> disk: 1BRD_48G >>>>> fs: xfs >>>>> fs2: nfsv4 >>>>> filesize: 4M >>>>> test_size: 40G >>>>> sync_method: fsyncBeforeClose >>>>> cpufreq_governor: performance >>>>> >>>>> test-description: The fsmark is a file system benchmark to test >>>>> synchronous write workloads, for example, mail servers >>>>> workload. >>>>> test-url: https://sourceforge.net/projects/fsmark/ >>>>> >>>>> >>>>> >>>>> Details are as below: >>>>> ------------------------------------------------------------- >>>>> ---- >>>>> ---------------------------------> >>>>> >>>>> >>>>> To reproduce: >>>>> >>>>> git clone https://github.com/intel/lkp-tests.git >>>>> cd lkp-tests >>>>> bin/lkp install job.yaml # job file is attached in >>>>> this >>>>> email >>>>> bin/lkp run job.yaml >>>>> >>>>> =============================================================== >>>>> ==== >>>>> ====================== >>>>> compiler/cpufreq_governor/disk/filesize/fs2/fs/iterations/kconf >>>>> ig/n >>>>> r_threads/rootfs/sync_method/tbox_group/test_size/testcase: >>>>> gcc-7/performance/1BRD_48G/4M/nfsv4/xfs/1x/x86_64-rhel- >>>>> 7.6/64t/debian-x86_64-2018-04-03.cgz/fsyncBeforeClose/lkp-ivb- >>>>> ep01/40G/fsmark >>>>> >>>>> commit: >>>>> e791f8e938 ("SUNRPC: Convert xs_send_kvec() to use >>>>> iov_iter_kvec()") >>>>> 0472e47660 ("SUNRPC: Convert socket page send code to use >>>>> iov_iter()") >>>>> >>>>> e791f8e9380d945e 0472e476604998c127f3c80d291 >>>>> ---------------- --------------------------- >>>>> fail:runs %reproduction fail:runs >>>>> | | | >>>>> :4 50% 2:4 dmesg.WARNING:a >>>>> t#for >>>>> _ip_interrupt_entry/0x >>>>> %stddev %change %stddev >>>>> \ | \ >>>>> 15118573 >>>>> ± 2% +16.0% 17538083 fsmark.app_overhead >>>>> 510.93 - >>>>> 22.7% 395.12 fsmark.files_per_sec >>>>> 24.90 +22.8% 30.57 fsmark.time.ela >>>>> psed_ >>>>> time >>>>> 24.90 +22.8% 30.57 fsmark.time.ela >>>>> psed_ >>>>> time.max >>>>> 288.00 ± 2% - >>>>> 27.8% 208.00 fsmark.time.percent_of_cpu_this_job_got >>>>> 70.03 ± 2% - >>>>> 11.3% 62.14 fsmark.time.system_time >>>>> >>>> >>>> Do you have time to take a look at this regression? >>> >>> From your stats, it looks to me as if the problem is increased >>> NUMA >>> overhead. Pretty much everything else appears to be the same or >>> actually performing better than previously. Am I interpreting that >>> correctly? >> The real regression is the throughput(fsmark.files_per_sec) is >> decreased >> by 22.7%. > > Understood, but I'm trying to make sense of why. I'm not able to > reproduce this, so I have to rely on your performance stats to > understand where the 22.7% regression is coming from. As far as I can > see, the only numbers in the stats you published that are showing a > performance regression (other than the fsmark number itself), are the > NUMA numbers. Is that a correct interpretation? > We re-test the case yesterday, the test result almost is the same. we will do more test and also check the test case itself, if you need more information, please let me know, thanks. >>> If my interpretation above is correct, then I'm not seeing where >>> this >>> patch would be introducing new NUMA regressions. It is just >>> converting >>> from using one method of doing socket I/O to another. Could it >>> perhaps >>> be a memory artefact due to your running the NFS client and server >>> on >>> the same machine? >>> >>> Apologies for pushing back a little, but I just don't have the >>> hardware available to test NUMA configurations, so I'm relying on >>> external testing for the above kind of scenario. >>> >> Thanks for looking at this. If you need more information, please let >> me >> know. >>> Thanks >>> Trond >>> -- Zhengjun Xing