Received: by 2002:a25:4158:0:0:0:0:0 with SMTP id o85csp466576yba; Fri, 26 Apr 2019 03:21:10 -0700 (PDT) X-Google-Smtp-Source: APXvYqw5ZjnwDutpjfOKsqzMAZ8aAiySA/9b2bQmKqzz2AUZ51Ad44ghOtJqW3Brpl1N4OaDuqhZ X-Received: by 2002:a62:6807:: with SMTP id d7mr44962147pfc.75.1556274070623; Fri, 26 Apr 2019 03:21:10 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1556274070; cv=none; d=google.com; s=arc-20160816; b=s6uE4dmUh6LnCH3XD+jw966cNBQNV7VQgG/HIcYA7PHaSmcTlI6cpyZGloxKPhDqwO U7HYgTdUH0wodj6CfsCvHt7ppruEHdGLKHbCTh3xPjFXuSnayc0x07VlL35NBOsi7fLO jjz+KjiwFO7QhgiRUqxlAMfS5iyIBR9Uz8zdC3vSk0lx2cPfCvZuV9StSnOujcUc7RTu u21UYBm/URmL5BN3+HEI1//30MA/TO7AS2JClG6XPpUsg9mOdFi3tkIJTyyTRgSDqkdz LB2yIL/I6SDHsFldFEzsm8b5qcVPUSQ7mylPpDdgvSQ/a1efF7LUavteXA469scPS4rf YeYw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=Dw/hsV7n9QLZKb1RWeE1+2l31ueVFlh3EnQLHeKXZFM=; b=syP1upcr4en/HSuu8bAl6SFysLGH1cg0W5OUXHK/GhlvNn22z4oAqQOp3cjjKkOA8D MkdyPC0+l6+btp2AE2w4w5L6ApShj3eBJxb9+OLg8k7HYfdKQJdgFuL1F2z7Ap/udHaQ DG1OnbLPsU5ot/aPd9pQeMkNPR5FUTeQnmFXhw4FzntY588ZAwC1H1+AR/qXDz5/dZG5 c/BF3Nrbxdz3wI5jiVS7hIjO+X+CJS6K1SV00U2etjRRAxa9M78zclZjLiOiPS8s/yZt kjnQ9gERWSUJl5g//J5pdOrhjjsPVI8KRjrXVLMFmfHhlBmTRg8/IH6HiJRQg56OamYK uX1A== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id s20si23820799pgs.509.2019.04.26.03.20.55; Fri, 26 Apr 2019 03:21:10 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726267AbfDZKTv (ORCPT + 99 others); Fri, 26 Apr 2019 06:19:51 -0400 Received: from outbound-smtp24.blacknight.com ([81.17.249.192]:53301 "EHLO outbound-smtp24.blacknight.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725942AbfDZKTv (ORCPT ); Fri, 26 Apr 2019 06:19:51 -0400 Received: from mail.blacknight.com (pemlinmail01.blacknight.ie [81.17.254.10]) by outbound-smtp24.blacknight.com (Postfix) with ESMTPS id EE94FB8918 for ; Fri, 26 Apr 2019 11:19:48 +0100 (IST) Received: (qmail 10966 invoked from network); 26 Apr 2019 10:19:48 -0000 Received: from unknown (HELO techsingularity.net) (mgorman@techsingularity.net@[37.228.225.79]) by 81.17.254.9 with ESMTPSA (AES256-SHA encrypted, authenticated); 26 Apr 2019 10:19:48 -0000 Date: Fri, 26 Apr 2019 11:19:47 +0100 From: Mel Gorman To: Ingo Molnar Cc: Aubrey Li , Julien Desfossez , Vineeth Remanan Pillai , Nishanth Aravamudan , Peter Zijlstra , Tim Chen , Thomas Gleixner , Paul Turner , Linus Torvalds , Linux List Kernel Mailing , Subhra Mazumdar , Fr?d?ric Weisbecker , Kees Cook , Greg Kerr , Phil Auld , Aaron Lu , Valentin Schneider , Pawan Gupta , Paolo Bonzini , Jiri Kosina Subject: Re: [RFC PATCH v2 00/17] Core scheduling v2 Message-ID: <20190426101947.GZ18914@techsingularity.net> References: <20190424140013.GA14594@sinkpad> <20190425095508.GA8387@gmail.com> <20190425144619.GX18914@techsingularity.net> <20190425185343.GA122353@gmail.com> <20190425213145.GY18914@techsingularity.net> <20190426094545.GD126896@gmail.com> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-15 Content-Disposition: inline In-Reply-To: <20190426094545.GD126896@gmail.com> User-Agent: Mutt/1.10.1 (2018-07-13) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Apr 26, 2019 at 11:45:45AM +0200, Ingo Molnar wrote: > > * Mel Gorman wrote: > > > > > I can show a comparison with equal levels of parallelisation but with > > > > HT off, it is a completely broken configuration and I do not think a > > > > comparison like that makes any sense. > > > > > > I would still be interested in that comparison, because I'd like > > > to learn whether there's any true *inherent* performance advantage to > > > HyperThreading for that particular workload, for exactly tuned > > > parallelism. > > > > > > > It really isn't a fair comparison. MPI seems to behave very differently > > when a machine is saturated. It's documented as changing its behaviour > > as it tries to avoid the worst consequences of saturation. > > > > Curiously, the results on the 2-socket machine were not as bad as I > > feared when the HT configuration is running with twice the number of > > threads as there are CPUs > > > > Amean bt 771.15 ( 0.00%) 1086.74 * -40.93%* > > Amean cg 445.92 ( 0.00%) 543.41 * -21.86%* > > Amean ep 70.01 ( 0.00%) 96.29 * -37.53%* > > Amean is 16.75 ( 0.00%) 21.19 * -26.51%* > > Amean lu 882.84 ( 0.00%) 595.14 * 32.59%* > > Amean mg 84.10 ( 0.00%) 80.02 * 4.84%* > > Amean sp 1353.88 ( 0.00%) 1384.10 * -2.23%* > > Yeah, so what I wanted to suggest is a parallel numeric throughput test > with few inter-process data dependencies, and see whether HT actually > improves total throughput versus the no-HT case. > > No over-saturation - but exactly as many threads as logical CPUs. > > I.e. with 20 physical cores and 40 logical CPUs the numbers to compare > would be a 'nosmt' benchmark running 20 threads, versus a SMT test > running 40 threads. > > I.e. how much does SMT improve total throughput when the workload's > parallelism is tuned to utilize 100% of the available CPUs? > > Does this make sense? > Yes. Here is the comparison. Amean bt 678.75 ( 0.00%) 789.13 * -16.26%* Amean cg 261.22 ( 0.00%) 428.82 * -64.16%* Amean ep 55.36 ( 0.00%) 84.41 * -52.48%* Amean is 13.25 ( 0.00%) 17.82 * -34.47%* Amean lu 1065.08 ( 0.00%) 1090.44 ( -2.38%) Amean mg 89.96 ( 0.00%) 84.28 * 6.31%* Amean sp 1579.52 ( 0.00%) 1506.16 * 4.64%* Amean ua 611.87 ( 0.00%) 663.26 * -8.40%* This is the socket machine and with HT On, there are 80 logical CPUs versus HT Off with 40 logical CPUs. -- Mel Gorman SUSE Labs