Received: by 2002:a05:6358:11c7:b0:104:8066:f915 with SMTP id i7csp6946775rwl; Wed, 22 Mar 2023 19:02:35 -0700 (PDT) X-Google-Smtp-Source: AK7set93vRGmq+eYVDoqRVDPOWDVAlGwrWnEkeyIOsrRc1Ye2BQfYcnn5/hklc9cSXxGYDYN7jcf X-Received: by 2002:a17:90b:1a8e:b0:234:b964:5703 with SMTP id ng14-20020a17090b1a8e00b00234b9645703mr5729430pjb.18.1679536955644; Wed, 22 Mar 2023 19:02:35 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1679536955; cv=none; d=google.com; s=arc-20160816; b=KMIBtrHTpjhvH/TJO4xR71cqF/mxhXe2XoKSzuMOqp25CH9OQ4tK+ODj6YSfNdtPhL 0sWw6nPGiQGTk4ZBF0U095ln+meBXAGpiNI/RtfCUNgJCs3Npgjm3A1KYQnVMrDUTfaK Q+xuWqBUcNmNtNayU3x1FYN0yJQ1gRBLEJGMv5s+0ThIiA4GybY3nAD31/x89ZgI1q70 3PQXaJawtbRQ+v/X1jzuftZHpCkcrhpdLuDuHK3wxqN/BCQyl6NrT78X1EviDDoAF0fB qoO3f0Pns3Mah8QP193iQFWCMM9xataWx6GtH3S4U8klFhsOCltrbEeLJIFBY1ysotCw yFzQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :user-agent:message-id:in-reply-to:date:references:subject:cc:to :from:dkim-signature; bh=67ZmEesRIp68q4h1/fO7I9QkviUhR17FEi/A7TaygJY=; b=aGhPNAMGHqXobzDznMjc28PIILneOegRdEPqK56WLto0gobCFLj4LjsCsWGPCNupZj eRxsrYqXClkGW0DUtUwgwDfZnvwKEa68uGlHhgOYwLwlJbURSkwcqtrrxrnNxFcfHpL4 Y27SJhgo00E58rTovUe1IXcyMAz5P5k7yVjy3+CyB5bH5nQYEvvqnH7cX9vH2LgD3GH3 rZgUNoaVZluepRi6/n92dTU1T+BZ47TA2JMYudu6XKODX7OpNP2h4k4xFHfVOAF3SrHm srWIw+wqeHbRVKRhk+WTZzWkSaDEPdfaIgPKMIkBUuiHgG0I7BbKHQb16qaLUsEC0+TX LHPQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=HfzeWcDG; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id i21-20020a17090ad35500b0023369af18e2si464514pjx.72.2023.03.22.19.02.24; Wed, 22 Mar 2023 19:02:35 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@intel.com header.s=Intel header.b=HfzeWcDG; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229496AbjCWByd (ORCPT + 99 others); Wed, 22 Mar 2023 21:54:33 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:35258 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230039AbjCWByc (ORCPT ); Wed, 22 Mar 2023 21:54:32 -0400 Received: from mga14.intel.com (mga14.intel.com [192.55.52.115]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id E1AE6868B for ; Wed, 22 Mar 2023 18:54:30 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=intel.com; i=@intel.com; q=dns/txt; s=Intel; t=1679536470; x=1711072470; h=from:to:cc:subject:references:date:in-reply-to: message-id:mime-version:content-transfer-encoding; bh=CwS67PYYqH0PatMp8p449KP+I7kSWUyHQOcV0z+jSpk=; b=HfzeWcDGlnh4+oUG6ZhmCSQTn+6Y8d/f9V7ZHCC7bE7sP/6ti/AwtuAl LuIC8o7z0HJ3/sE2P5/L9PyObU2E599rAle6Zl7z5eHFYeCYIXNURmcg6 vx7KDX7JxqkdR2gGnZZrl0EajHa9yCFdyBkEWvC6g9J4Km4zLLG52GEHe z4xyHyVtSxfffCYoxMkgkoMumfF9UsMCSCMFeoylcsYNhzMyjcyFQQIfP PuD5a6dcwFeTdMpZE1N43YjKcmfdO4a3WMQGjVv7h/lc4/0yqnJ/Kgtjj BoQyMy9A8zHx2ouyyVLWt0XC2nCUBfzBLHd1QVM9P0VvWhwSJJjuHj0UG g==; X-IronPort-AV: E=McAfee;i="6600,9927,10657"; a="339410413" X-IronPort-AV: E=Sophos;i="5.98,283,1673942400"; d="scan'208";a="339410413" Received: from orsmga001.jf.intel.com ([10.7.209.18]) by fmsmga103.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 22 Mar 2023 18:54:30 -0700 X-ExtLoop1: 1 X-IronPort-AV: E=McAfee;i="6600,9927,10657"; a="714618278" X-IronPort-AV: E=Sophos;i="5.98,283,1673942400"; d="scan'208";a="714618278" Received: from yhuang6-desk2.sh.intel.com (HELO yhuang6-desk2.ccr.corp.intel.com) ([10.238.208.55]) by orsmga001-auth.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 22 Mar 2023 18:54:25 -0700 From: "Huang, Ying" To: "Liu, Yujie" Cc: lkp , "bharata@amd.com" , "Yin, Fengwei" , "willy@infradead.org" , "mike.kravetz@oracle.com" , "shy828301@gmail.com" , "akpm@linux-foundation.org" , "linux-kernel@vger.kernel.org" , "xhao@linux.alibaba.com" , "Tang, Feng" , "linux-mm@kvack.org" , "oe-lkp@lists.linux.dev" , "ziy@nvidia.com" , "zhengjun.xing@linux.intel.com" , "osalvador@suse.de" , "baolin.wang@linux.alibaba.com" , "minchan@kernel.org" , "42.hyeyoo@gmail.com" <42.hyeyoo@gmail.com>, "apopple@nvidia.com" Subject: Re: [linus:master] [migrate_pages] 7e12beb8ca: vm-scalability.throughput -3.4% regression References: <202303192325.ecbaf968-yujie.liu@intel.com> <87o7onua4t.fsf@yhuang6-desk2.ccr.corp.intel.com> <87h6ueu0ae.fsf@yhuang6-desk2.ccr.corp.intel.com> Date: Thu, 23 Mar 2023 09:53:23 +0800 In-Reply-To: (Yujie Liu's message of "Wed, 22 Mar 2023 13:17:54 +0800") Message-ID: <87zg84s06k.fsf@yhuang6-desk2.ccr.corp.intel.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Spam-Status: No, score=-2.5 required=5.0 tests=DKIMWL_WL_HIGH,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_MED,SPF_HELO_NONE, SPF_NONE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org "Liu, Yujie" writes: > On Tue, 2023-03-21 at 13:43 +0800, Huang, Ying wrote: >> "Liu, Yujie" writes: >> >> > Hi Ying, >> > >> > On Mon, 2023-03-20 at 15:58 +0800, Huang, Ying wrote: >> > > Hi, Yujie, >> > > >> > > kernel test robot writes: >> > > >> > > > Hello, >> > > > >> > > > FYI, we noticed a -3.4% regression of vm-scalability.throughput du= e to commit: >> > > > >> > > > commit: 7e12beb8ca2ac98b2ec42e0ea4b76cdc93b58654 ("migrate_pages: = batch flushing TLB") >> > > > https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git ma= ster >> > > > >> > > > in testcase: vm-scalability >> > > > on test machine: 96 threads 2 sockets Intel(R) Xeon(R) Platinum 82= 60L CPU @ 2.40GHz (Cascade Lake) with 128G memory >> > > > with following parameters: >> > > > >> > > > runtime: 300s >> > > > size: 512G >> > > > test: anon-cow-rand-mt >> > > > cpufreq_governor: performance >> > > > >> > > > test-description: The motivation behind this suite is to exercise = functions and regions of the mm/ of the Linux kernel which are of interest = to us. >> > > > test-url: https://git.kernel.org/cgit/linux/kernel/git/wfg/vm-scal= ability.git/ >> > > > >> > > > >> > > > If you fix the issue, kindly add following tag >> > > > > Reported-by: kernel test robot >> > > > > Link: https://lore.kernel.org/oe-lkp/202303192325.ecbaf968-yujie= .liu@intel.com >> > > > >> > > >> > > Thanks a lot for report! Can you try whether the debug patch as >> > > below can restore the regression? >> > >> > We've tested the patch and found the throughput score was partially >> > restored from -3.6% to -1.4%, still with a slight performance drop. >> > Please check the detailed data as follows: >> >> Good! Thanks for your detailed data! >> >> > 0.09 =C2=B1 17% +1.2 1.32 =C2=B1 7% +0.4 = 0.45 =C2=B1 21% perf-profile.children.cycles-pp.flush_tlb_func >> >> It appears that we can reduce the unnecessary TLB flushing effectively >> with the previous debug patch. But the batched flush (full flush) is >> still slower than the non-batched flush (flush one page). >> >> Can you try the debug patch as below to check whether it can restore the >> regression completely? The new debug patch can be applied on top of the >> previous debug patch. > > The second debug patch got a -0.7% performance change. The data have > some fluctuations from test to test, and the standard deviation is even > a bit larger than 0.7%, which make the performance score not very > convincing. Please check other metrics to see if the regression is > fully restored. Thanks. Thanks for testing! > 0.09 =C2=B1 17% +0.4 0.45 =C2=B1 21% +0.0 0= .09 =C2=B1 12% perf-profile.children.cycles-pp.flush_tlb_func From the profiling data, the TLB flushing overhead has been restored. So I think the remaining 0.7% regression should be at noise level. I will prepare the fixing patch based on the test results. Best Regards, Huang, Ying