Received: by 2002:ac0:a679:0:0:0:0:0 with SMTP id p54csp204467imp; Wed, 20 Feb 2019 23:11:06 -0800 (PST) X-Google-Smtp-Source: AHgI3IYwePtvvGWm41a2398NgBEbPh3xStkV9GVRD9FmgoRZ8l60V4Y0uDUakmcvd4ED5ohLkqAZ X-Received: by 2002:a17:902:a506:: with SMTP id s6mr8036520plq.24.1550733066133; Wed, 20 Feb 2019 23:11:06 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1550733066; cv=none; d=google.com; s=arc-20160816; b=Bi4ryPY/y2IZuYO4SS1LWf5s7wpHb8XFpmQRIPJOFf0KdI5orXYuhYYk/oH85wKpiX EXc63Sqpb65659WIyDxZU6bUiqtYKQ7gmSr60qzPUL8l7JraH2N4fLEBg4u5eQ/GQcn+ QYF2MFF6reoOpRvpv4b8TsvVqyrErerbu/QtSCUO9tyDRTRbr3Elrp/tQupZwnZsq+SX 1zw7FEZRFM3tcx5B6NW7DLBcJEREI7QDX7IlvClEcJTkVNRGQ3wDWTfb7d1Na+LstIFn TXWuIwy+Fc9OV7837KzWTKeeKCoHwtr+rL8icZ+/EGy6KYaCSDy4ZCVRfnOuk8osu2Sv 1NCg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-transfer-encoding:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=hwz6XWemvwLMzQKIexisl8FiDkCvUmrj4QtjRl3xOwc=; b=R1eT4b5FGGvtL3g1SZTJeXr+jS8+nRC/ltuNb+vS72bkn+ylKykRPqva94v6b/3N0g drTU/o9S+xfD0v4Vh8DQu+pIFKX0Q4KmRaAKETkTLey6w4yG7aPwK3mdopu0QhG6JgRC TTh/53l5c98xA96sxe+62zCm07tFchbdUmVr+bM2H+lrf5Vbo21DVs8lFyVILNA0+318 fGhetQQPxAdLWvw4WD+In3EH7HWndX1QOnaBIHx24qIXRSjE5fVvT86Ddrd9MKhOvTLf jUqYmk9chCn03iqRVhyNCiuwWCtTpYB3oBA9mVVtsLyqWbgyEX8rTGTpPoj4klZl6okN teKg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=zOL9zPtv; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id x8si21173247plv.137.2019.02.20.23.10.50; Wed, 20 Feb 2019 23:11:06 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=zOL9zPtv; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726671AbfBUHK2 (ORCPT + 99 others); Thu, 21 Feb 2019 02:10:28 -0500 Received: from mail.kernel.org ([198.145.29.99]:49676 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726292AbfBUHK2 (ORCPT ); Thu, 21 Feb 2019 02:10:28 -0500 Received: from localhost (5356596B.cm-6-7b.dynamic.ziggo.nl [83.86.89.107]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id E3D602084F; Thu, 21 Feb 2019 07:10:26 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1550733027; bh=hTMUluKnkma7kplHCAoHMG8i2cHt9VXt+2W145Dij4M=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=zOL9zPtveJ4gmBOmrusB/Vypwx7Yj/vbod/fW5t6tUQY1/vFDLJA/XCn4by+XfiWE MPuOg+GGykvE/gINnh4xTb+3mImcYTv63AW2kYrhEUh1L1/6dNNgi/0j2eSx4K591E JI2Ssg0IsWMFQmmxS1it7fIrz0zy0rx31QSNWK3U= Date: Thu, 21 Feb 2019 08:10:23 +0100 From: Greg Kroah-Hartman To: kernel test robot Cc: Wei Yang , Stephen Rothwell , "Rafael J. Wysocki" , lkp@01.org, "Huang, Ying" , LKML Subject: Re: [LKP] [driver core] 570d020012: will-it-scale.per_thread_ops -12.2% regression Message-ID: <20190221071023.GA28637@kroah.com> References: <20190218075442.GI29177@shao2-debian> <20190219005945.GA16734@richard> <20190219121904.GA24103@kroah.com> <20190221031049.GE28258@shao2-debian> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20190221031049.GE28258@shao2-debian> User-Agent: Mutt/1.11.3 (2019-02-01) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Feb 21, 2019 at 11:10:49AM +0800, kernel test robot wrote: > On Tue, Feb 19, 2019 at 01:19:04PM +0100, Greg Kroah-Hartman wrote: > > On Tue, Feb 19, 2019 at 08:59:45AM +0800, Wei Yang wrote: > > > On Mon, Feb 18, 2019 at 03:54:42PM +0800, kernel test robot wrote: > > > >Greeting, > > > > > > > >FYI, we noticed a -12.2% regression of will-it-scale.per_thread_ops due to commit: > > > > > > > > > > > >commit: 570d0200123fb4f809aa2f6226e93a458d664d70 ("driver core: move device->knode_class to device_private") > > > >https://git.kernel.org/cgit/linux/kernel/git/next/linux-next.git master > > > > > > > > > > This is interesting. > > > > > > I didn't expect the move of this field will impact the performance. > > > > > > The reason is struct device is a hotter memory than device->device_private? > > > > > > >in testcase: will-it-scale > > > >on test machine: 288 threads Knights Mill with 80G memory > > > >with following parameters: > > > > > > > > nr_task: 100% > > > > mode: thread > > > > test: unlink2 > > > > cpufreq_governor: performance > > > > > > > >test-description: Will It Scale takes a testcase and runs it from 1 through to n parallel copies to see if the testcase will scale. It builds both a process and threads based test in order to see any differences between the two. > > > >test-url: https://github.com/antonblanchard/will-it-scale > > > > > > > >In addition to that, the commit also has significant impact on the following tests: > > > > > > > >+------------------+---------------------------------------------------------------+ > > > >| testcase: change | will-it-scale: will-it-scale.per_thread_ops -29.9% regression | > > > >| test machine | 288 threads Knights Mill with 80G memory | > > > >| test parameters | cpufreq_governor=performance | > > > >| | mode=thread | > > > >| | nr_task=100% | > > > >| | test=signal1 | > > > > Ok, I'm going to blame your testing system, or something here, and not > > the above patch. > > > > All this test does is call raise(3). That does not touch the driver > > core at all. > > > > > >+------------------+---------------------------------------------------------------+ > > > >| testcase: change | will-it-scale: will-it-scale.per_thread_ops -16.5% regression | > > > >| test machine | 288 threads Knights Mill with 80G memory | > > > >| test parameters | cpufreq_governor=performance | > > > >| | mode=thread | > > > >| | nr_task=100% | > > > >| | test=open1 | > > > >+------------------+---------------------------------------------------------------+ > > > > Same here, open1 just calls open/close a lot. No driver core > > interaction at all there either. > > > > So are you _sure_ this is the offending patch? > > Hi Greg, > > We did an experiment, recovered the layout of struct device. and we > found the regression is gone. I guess the regession is not from the > patch but related to the struct layout. > > > tests: 1 > testcase/path_params/tbox_group/run: will-it-scale/performance-thread-100%-unlink2/lkp-knm01 > > 570d0200123fb4f8 a36dc70b810afe9183de2ea18f > ---------------- -------------------------- > %stddev change %stddev > \ | \ > 237096 14% 270789 will-it-scale.workload > 823 14% 939 will-it-scale.per_thread_ops > > > tests: 1 > testcase/path_params/tbox_group/run: will-it-scale/performance-thread-100%-signal1/lkp-knm01 > > 570d0200123fb4f8 a36dc70b810afe9183de2ea18f > ---------------- -------------------------- > %stddev change %stddev > \ | \ > 93.51 ? 3% 48% 138.53 ? 3% will-it-scale.time.user_time > 186 40% 261 will-it-scale.per_thread_ops > 53909 40% 75507 will-it-scale.workload > > > tests: 1 > testcase/path_params/tbox_group/run: will-it-scale/performance-thread-100%-open1/lkp-knm01 > > 570d0200123fb4f8 a36dc70b810afe9183de2ea18f > ---------------- -------------------------- > %stddev change %stddev > \ | \ > 447722 22% 546258 ? 10% will-it-scale.time.involuntary_context_switches > 226995 19% 269751 will-it-scale.workload > 787 19% 936 will-it-scale.per_thread_ops > > > > commit a36dc70b810afe9183de2ea18faa4c0939c139ac > Author: 0day robot > Date: Wed Feb 20 14:21:19 2019 +0800 > > backfile klist_node in struct device for debugging > > Signed-off-by: 0day robot > > diff --git a/include/linux/device.h b/include/linux/device.h > index d0e452fd0bff2..31666cb72b3ba 100644 > --- a/include/linux/device.h > +++ b/include/linux/device.h > @@ -1035,6 +1035,7 @@ struct device { > spinlock_t devres_lock; > struct list_head devres_head; > > + struct klist_node knode_class_test_by_rongc; > struct class *class; > const struct attribute_group **groups; /* optional groups */ While this is fun to worry about alignment and structure size of 'struct device' I find it odd given that the syscalls and userspace load of those test programs have nothing to do with 'struct device' at all. So I can work on fixing up the alignment of struct device, as that's a nice thing to do for systems with 30k of these in memory, but that shouldn't affect a workload of a constant string of signal calls. thanks, greg k-h