Received: by 2002:ac0:a679:0:0:0:0:0 with SMTP id p54csp161306imp; Wed, 20 Feb 2019 22:03:25 -0800 (PST) X-Google-Smtp-Source: AHgI3IaNmhtbp0Nsu40VBVURDviWeUrMzA6Z6FfY4jw8TGw1DNjEWjzxdaLvG5C9TwdEZ7cEgsgm X-Received: by 2002:a63:e950:: with SMTP id q16mr33120966pgj.138.1550729004989; Wed, 20 Feb 2019 22:03:24 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1550729004; cv=none; d=google.com; s=arc-20160816; b=uZ7SUjo1bAtfkQ+qFG5TWC6vOS6Jfa+BwKrhYQnEUmMh8aTWV2O3rXAxEGsi5aZP2a Vk0k3Zq/8deNlJMEwAEliA8h5IR29y6woDOKfapzNqpGF/V2YYDIZ9zMGwqYbbLMqaRq N81o0+rVM/yrlS67HetaP9el+MphTuSRV+u0PSNXKJ/0Ij51x1AZ9nyY7cdYqOJSU+v4 iX+ANs1No7T9WNHcH2jy9famigap9oJIrTjXlbE0zy4fcjIK5kPuFy8QfeI1Mr7REOHV GRNEB0E/kHgnyNPh3DEd/JLePsd+fuUt0MHSoMpSsEz1jqG818PmiFJ3JtgOxLAAe2R/ IXZw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:reply-to:message-id :subject:cc:to:from:date; bh=SwK73XN8fvm9sigKXRtRnCcXHl8YMkGh3SgkVxrG20g=; b=sCAkBLjwRlK/vmQH/S9p4vFjMEctVC5ZmVxmYrXTYAxNJQYgVDUOyxTMwkC5hQhhli XmQhfsVBXcQ9+fWUDK57w9SIY+NdBk42x+avl4GwwV+3fbuGqcDo4VUWNpH+KpNFKGms b0ycJNh/YeVSoi85tHDfRWa3ZRzhpsITMcYKthUFXaBJcVG/mYh0GoPyC6VMcuYebELU TXuZZTizh9htGypbq5DJUb6DPG89o2Z+2IaUdGEoG94M/j1E6j6I9WeqLKmDQTloCBwd tvivQbb/pcCcQC4xXjj1eSBuXtiDqBvhHGMoAs9LIukNC/RFrBDuICXD6CuXKtLIKzYK YkPQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id k13si15097177pgh.501.2019.02.20.22.03.07; Wed, 20 Feb 2019 22:03:24 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1725943AbfBUGCr (ORCPT + 99 others); Thu, 21 Feb 2019 01:02:47 -0500 Received: from mga12.intel.com ([192.55.52.136]:49677 "EHLO mga12.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725648AbfBUGCq (ORCPT ); Thu, 21 Feb 2019 01:02:46 -0500 X-Amp-Result: UNKNOWN X-Amp-Original-Verdict: FILE UNKNOWN X-Amp-File-Uploaded: False Received: from orsmga005.jf.intel.com ([10.7.209.41]) by fmsmga106.fm.intel.com with ESMTP/TLS/DHE-RSA-AES256-GCM-SHA384; 20 Feb 2019 22:02:44 -0800 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.58,393,1544515200"; d="scan'208";a="301383769" Received: from richard.sh.intel.com (HELO localhost) ([10.239.159.54]) by orsmga005.jf.intel.com with ESMTP; 20 Feb 2019 22:02:42 -0800 Date: Thu, 21 Feb 2019 14:02:18 +0800 From: Wei Yang To: "Huang, Ying" Cc: Wei Yang , kernel test robot , Greg Kroah-Hartman , Stephen Rothwell , "Rafael J. Wysocki" , lkp@01.org, LKML Subject: Re: [LKP] [driver core] 570d020012: will-it-scale.per_thread_ops -12.2% regression Message-ID: <20190221060218.GA19466@richard> Reply-To: Wei Yang References: <20190218075442.GI29177@shao2-debian> <20190219005945.GA16734@richard> <20190219121904.GA24103@kroah.com> <20190221031049.GE28258@shao2-debian> <20190221034612.GA15147@richard> <87h8cx21gl.fsf@yhuang-dev.intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <87h8cx21gl.fsf@yhuang-dev.intel.com> User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Feb 21, 2019 at 12:46:18PM +0800, Huang, Ying wrote: >Wei Yang writes: > >> On Thu, Feb 21, 2019 at 11:10:49AM +0800, kernel test robot wrote: >>>On Tue, Feb 19, 2019 at 01:19:04PM +0100, Greg Kroah-Hartman wrote: >>>> On Tue, Feb 19, 2019 at 08:59:45AM +0800, Wei Yang wrote: >>>> > On Mon, Feb 18, 2019 at 03:54:42PM +0800, kernel test robot wrote: >>>> > >Greeting, >>>> > > >>>> > >FYI, we noticed a -12.2% regression of will-it-scale.per_thread_ops due to commit: >>>> > > >>>> > > >>>> > >commit: 570d0200123fb4f809aa2f6226e93a458d664d70 ("driver core: move device->knode_class to device_private") >>>> > >https://git.kernel.org/cgit/linux/kernel/git/next/linux-next.git master >>>> > > >>>> > >>>> > This is interesting. >>>> > >>>> > I didn't expect the move of this field will impact the performance. >>>> > >>>> > The reason is struct device is a hotter memory than device->device_private? >>>> > >>>> > >in testcase: will-it-scale >>>> > >on test machine: 288 threads Knights Mill with 80G memory >>>> > >with following parameters: >>>> > > >>>> > > nr_task: 100% >>>> > > mode: thread >>>> > > test: unlink2 >>>> > > cpufreq_governor: performance >>>> > > >>>> > >test-description: Will It Scale takes a testcase and runs it from 1 through to n parallel copies to see if the testcase will scale. It builds both a process and threads based test in order to see any differences between the two. >>>> > >test-url: https://github.com/antonblanchard/will-it-scale >>>> > > >>>> > >In addition to that, the commit also has significant impact on the following tests: >>>> > > >>>> > >+------------------+---------------------------------------------------------------+ >>>> > >| testcase: change | will-it-scale: will-it-scale.per_thread_ops -29.9% regression | >>>> > >| test machine | 288 threads Knights Mill with 80G memory | >>>> > >| test parameters | cpufreq_governor=performance | >>>> > >| | mode=thread | >>>> > >| | nr_task=100% | >>>> > >| | test=signal1 | >>>> >>>> Ok, I'm going to blame your testing system, or something here, and not >>>> the above patch. >>>> >>>> All this test does is call raise(3). That does not touch the driver >>>> core at all. >>>> >>>> > >+------------------+---------------------------------------------------------------+ >>>> > >| testcase: change | will-it-scale: will-it-scale.per_thread_ops -16.5% regression | >>>> > >| test machine | 288 threads Knights Mill with 80G memory | >>>> > >| test parameters | cpufreq_governor=performance | >>>> > >| | mode=thread | >>>> > >| | nr_task=100% | >>>> > >| | test=open1 | >>>> > >+------------------+---------------------------------------------------------------+ >>>> >>>> Same here, open1 just calls open/close a lot. No driver core >>>> interaction at all there either. >>>> >>>> So are you _sure_ this is the offending patch? >>> >>>Hi Greg, >>> >>>We did an experiment, recovered the layout of struct device. and we >>>found the regression is gone. I guess the regession is not from the >>>patch but related to the struct layout. >>> >>> >>>tests: 1 >>>testcase/path_params/tbox_group/run: will-it-scale/performance-thread-100%-unlink2/lkp-knm01 >>> >>>570d0200123fb4f8 a36dc70b810afe9183de2ea18f >>>---------------- -------------------------- >>> %stddev change %stddev >>> \ | \ >>> 237096 14% 270789 will-it-scale.workload >>> 823 14% 939 will-it-scale.per_thread_ops >>> >> >> Do you have the comparison between a36dc70b810afe9183de2ea18f and the one >> before 570d020012? >> >>> >>>tests: 1 >>>testcase/path_params/tbox_group/run: will-it-scale/performance-thread-100%-signal1/lkp-knm01 >>> >>>570d0200123fb4f8 a36dc70b810afe9183de2ea18f >>>---------------- -------------------------- >>> %stddev change %stddev >>> \ | \ >>> 93.51 3% 48% 138.53 3% will-it-scale.time.user_time >>> 186 40% 261 will-it-scale.per_thread_ops >>> 53909 40% 75507 will-it-scale.workload >>> >>> >>>tests: 1 >>>testcase/path_params/tbox_group/run: will-it-scale/performance-thread-100%-open1/lkp-knm01 >>> >>>570d0200123fb4f8 a36dc70b810afe9183de2ea18f >>>---------------- -------------------------- >>> %stddev change %stddev >>> \ | \ >>> 447722 22% 546258 10% will-it-scale.time.involuntary_context_switches >>> 226995 19% 269751 will-it-scale.workload >>> 787 19% 936 will-it-scale.per_thread_ops >>> >>> >>> >>>commit a36dc70b810afe9183de2ea18faa4c0939c139ac >>>Author: 0day robot >>>Date: Wed Feb 20 14:21:19 2019 +0800 >>> >>> backfile klist_node in struct device for debugging >>> >>> Signed-off-by: 0day robot >>> >>>diff --git a/include/linux/device.h b/include/linux/device.h >>>index d0e452fd0bff2..31666cb72b3ba 100644 >>>--- a/include/linux/device.h >>>+++ b/include/linux/device.h >>>@@ -1035,6 +1035,7 @@ struct device { >>> spinlock_t devres_lock; >>> struct list_head devres_head; >>> >>>+ struct klist_node knode_class_test_by_rongc; >>> struct class *class; >>> const struct attribute_group **groups; /* optional groups */ >> >> Hmm... because this is not properly aligned? >> >> struct klist_node { >> void *n_klist; /* never access directly */ >> struct list_head n_node; >> struct kref n_ref; >> }; >> >> Except struct kref has one "int" type, others are pointers. >> >> But... I am still confused. > >I guess because the size of struct device is changed, it influences some >alignment changes in the system. Thus influence the benchmark score. > That's interesting. I wrote a module to see the exact size of these two structure on my x86_64. sizeof(struct device) = 736 = 8 * 92 sizeof(struct device_private) = 160 = 8 * 20 sizeof(struct klist_node) = 32 = 8 * 4 Even klist_node has one 4 byte field, c complier would pack the structure to make it aligned. Which system alignment it would affect? After the patch, size would change like this: struct device 736 -> 704 struce device_private 160 -> 192 Would this size change affect system? >Best Regards, >Huang, Ying > >>> >>>Best Regards, >>>Rong Chen -- Wei Yang Help you, Help me