Received: by 2002:ac0:a679:0:0:0:0:0 with SMTP id p54csp112950imp; Wed, 20 Feb 2019 20:47:11 -0800 (PST) X-Google-Smtp-Source: AHgI3Iaw0pVbh0O+ACd92qqTBBi+05Ax/7I9yFLs24WZgbptZ8eqMYP4iHcVmB+NhidFafmcSWkY X-Received: by 2002:a63:f40e:: with SMTP id g14mr32518083pgi.326.1550724431243; Wed, 20 Feb 2019 20:47:11 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1550724431; cv=none; d=google.com; s=arc-20160816; b=gadNECVX8uRBbFU62arwT0cWFyggk4Mg6o5zhe5hWdwcK8ctBvmgeXpesCgIXefcZA EqnlZbOwAHWd+OXCseme0tD3msQK1ncK/1n9FrxccPmnima8rkNAB6fRiJyHjr8O32KS WHionExiZGZBeOXvC1boM1uMPCW57vH2XGh2tWxY7F4o+53e+yaoCZdavyFgZk70e77m wIC1Ju13uHocfWhasHKE9v2So7zEKS5IVmws6yfLaFeu9BeesUToo8q0u5+G+4s2BdHL OaHvez3k2Ljrl54rU/aA3Iq7c1P+3NJ3Q8SpSsqhK3hYu6jKx7Yf5/2bgsBvtLqQ3Tk8 9omQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:user-agent:message-id :in-reply-to:date:references:subject:cc:to:from; bh=kwzx5Zx7AYbKweYKjsaxeL1RZtrnMrA6WxUxWXViT8M=; b=yzrRmkfIIrDMB6eqqgLTSNRSzfYGn/JvjVG0OA9FFFMUWKvMtThfZGJLC31Zp2nqnm oYTm3mBPP9ZrrFqsqm4Z4oIu3edXE3DX2/iBr8pUAupLyVgtW+3T33Jx0ebDPYawfiRh BySw6hKP0Pg6AOqHnVnEfq9cGGDmqUCH2oQaNngqB+0I9SnYa/Szuz86h6AT4hFLoWO4 RL8jKgZ0IO2oPtBENC/hR3LxOaPtmv/ypVsluUXBynPR2vTNxC90EIHBKiZgh0NOT7w1 0jCbaMNcCyH32+oU7gexxJdleNZhOuq2Gd3Pr8fnjD0R6MmbGsiibqDMZzzasQQ5ojqL wcJg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id g8si7281600pgb.128.2019.02.20.20.46.55; Wed, 20 Feb 2019 20:47:11 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727219AbfBUEqV (ORCPT + 99 others); Wed, 20 Feb 2019 23:46:21 -0500 Received: from mga14.intel.com ([192.55.52.115]:56141 "EHLO mga14.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725880AbfBUEqU (ORCPT ); Wed, 20 Feb 2019 23:46:20 -0500 X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from fmsmga006.fm.intel.com ([10.253.24.20]) by fmsmga103.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 20 Feb 2019 20:46:20 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.58,393,1544515200"; d="scan'208";a="320786135" Received: from yhuang-dev.sh.intel.com (HELO yhuang-dev) ([10.239.159.151]) by fmsmga006.fm.intel.com with ESMTP; 20 Feb 2019 20:46:19 -0800 From: "Huang\, Ying" To: Wei Yang Cc: kernel test robot , Greg Kroah-Hartman , Stephen Rothwell , "Rafael J. Wysocki" , , LKML Subject: Re: [LKP] [driver core] 570d020012: will-it-scale.per_thread_ops -12.2% regression References: <20190218075442.GI29177@shao2-debian> <20190219005945.GA16734@richard> <20190219121904.GA24103@kroah.com> <20190221031049.GE28258@shao2-debian> <20190221034612.GA15147@richard> Date: Thu, 21 Feb 2019 12:46:18 +0800 In-Reply-To: <20190221034612.GA15147@richard> (Wei Yang's message of "Thu, 21 Feb 2019 11:46:12 +0800") Message-ID: <87h8cx21gl.fsf@yhuang-dev.intel.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/26.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=ascii Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Wei Yang writes: > On Thu, Feb 21, 2019 at 11:10:49AM +0800, kernel test robot wrote: >>On Tue, Feb 19, 2019 at 01:19:04PM +0100, Greg Kroah-Hartman wrote: >>> On Tue, Feb 19, 2019 at 08:59:45AM +0800, Wei Yang wrote: >>> > On Mon, Feb 18, 2019 at 03:54:42PM +0800, kernel test robot wrote: >>> > >Greeting, >>> > > >>> > >FYI, we noticed a -12.2% regression of will-it-scale.per_thread_ops due to commit: >>> > > >>> > > >>> > >commit: 570d0200123fb4f809aa2f6226e93a458d664d70 ("driver core: move device->knode_class to device_private") >>> > >https://git.kernel.org/cgit/linux/kernel/git/next/linux-next.git master >>> > > >>> > >>> > This is interesting. >>> > >>> > I didn't expect the move of this field will impact the performance. >>> > >>> > The reason is struct device is a hotter memory than device->device_private? >>> > >>> > >in testcase: will-it-scale >>> > >on test machine: 288 threads Knights Mill with 80G memory >>> > >with following parameters: >>> > > >>> > > nr_task: 100% >>> > > mode: thread >>> > > test: unlink2 >>> > > cpufreq_governor: performance >>> > > >>> > >test-description: Will It Scale takes a testcase and runs it from 1 through to n parallel copies to see if the testcase will scale. It builds both a process and threads based test in order to see any differences between the two. >>> > >test-url: https://github.com/antonblanchard/will-it-scale >>> > > >>> > >In addition to that, the commit also has significant impact on the following tests: >>> > > >>> > >+------------------+---------------------------------------------------------------+ >>> > >| testcase: change | will-it-scale: will-it-scale.per_thread_ops -29.9% regression | >>> > >| test machine | 288 threads Knights Mill with 80G memory | >>> > >| test parameters | cpufreq_governor=performance | >>> > >| | mode=thread | >>> > >| | nr_task=100% | >>> > >| | test=signal1 | >>> >>> Ok, I'm going to blame your testing system, or something here, and not >>> the above patch. >>> >>> All this test does is call raise(3). That does not touch the driver >>> core at all. >>> >>> > >+------------------+---------------------------------------------------------------+ >>> > >| testcase: change | will-it-scale: will-it-scale.per_thread_ops -16.5% regression | >>> > >| test machine | 288 threads Knights Mill with 80G memory | >>> > >| test parameters | cpufreq_governor=performance | >>> > >| | mode=thread | >>> > >| | nr_task=100% | >>> > >| | test=open1 | >>> > >+------------------+---------------------------------------------------------------+ >>> >>> Same here, open1 just calls open/close a lot. No driver core >>> interaction at all there either. >>> >>> So are you _sure_ this is the offending patch? >> >>Hi Greg, >> >>We did an experiment, recovered the layout of struct device. and we >>found the regression is gone. I guess the regession is not from the >>patch but related to the struct layout. >> >> >>tests: 1 >>testcase/path_params/tbox_group/run: will-it-scale/performance-thread-100%-unlink2/lkp-knm01 >> >>570d0200123fb4f8 a36dc70b810afe9183de2ea18f >>---------------- -------------------------- >> %stddev change %stddev >> \ | \ >> 237096 14% 270789 will-it-scale.workload >> 823 14% 939 will-it-scale.per_thread_ops >> > > Do you have the comparison between a36dc70b810afe9183de2ea18f and the one > before 570d020012? > >> >>tests: 1 >>testcase/path_params/tbox_group/run: will-it-scale/performance-thread-100%-signal1/lkp-knm01 >> >>570d0200123fb4f8 a36dc70b810afe9183de2ea18f >>---------------- -------------------------- >> %stddev change %stddev >> \ | \ >> 93.51 3% 48% 138.53 3% will-it-scale.time.user_time >> 186 40% 261 will-it-scale.per_thread_ops >> 53909 40% 75507 will-it-scale.workload >> >> >>tests: 1 >>testcase/path_params/tbox_group/run: will-it-scale/performance-thread-100%-open1/lkp-knm01 >> >>570d0200123fb4f8 a36dc70b810afe9183de2ea18f >>---------------- -------------------------- >> %stddev change %stddev >> \ | \ >> 447722 22% 546258 10% will-it-scale.time.involuntary_context_switches >> 226995 19% 269751 will-it-scale.workload >> 787 19% 936 will-it-scale.per_thread_ops >> >> >> >>commit a36dc70b810afe9183de2ea18faa4c0939c139ac >>Author: 0day robot >>Date: Wed Feb 20 14:21:19 2019 +0800 >> >> backfile klist_node in struct device for debugging >> >> Signed-off-by: 0day robot >> >>diff --git a/include/linux/device.h b/include/linux/device.h >>index d0e452fd0bff2..31666cb72b3ba 100644 >>--- a/include/linux/device.h >>+++ b/include/linux/device.h >>@@ -1035,6 +1035,7 @@ struct device { >> spinlock_t devres_lock; >> struct list_head devres_head; >> >>+ struct klist_node knode_class_test_by_rongc; >> struct class *class; >> const struct attribute_group **groups; /* optional groups */ > > Hmm... because this is not properly aligned? > > struct klist_node { > void *n_klist; /* never access directly */ > struct list_head n_node; > struct kref n_ref; > }; > > Except struct kref has one "int" type, others are pointers. > > But... I am still confused. I guess because the size of struct device is changed, it influences some alignment changes in the system. Thus influence the benchmark score. Best Regards, Huang, Ying >> >>Best Regards, >>Rong Chen