Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752562AbdHKHUF (ORCPT ); Fri, 11 Aug 2017 03:20:05 -0400 Received: from mx1.redhat.com ([209.132.183.28]:52416 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751370AbdHKHUD (ORCPT ); Fri, 11 Aug 2017 03:20:03 -0400 DMARC-Filter: OpenDMARC Filter v1.3.2 mx1.redhat.com 1A78E52142 Authentication-Results: ext-mx01.extmail.prod.ext.phx2.redhat.com; dmarc=none (p=none dis=none) header.from=redhat.com Authentication-Results: ext-mx01.extmail.prod.ext.phx2.redhat.com; spf=fail smtp.mailfrom=cohuck@redhat.com Date: Fri, 11 Aug 2017 09:19:53 +0200 From: Cornelia Huck To: Eric Farman Cc: "Longpeng (Mike)" , pbonzini@redhat.com, rkrcmar@redhat.com, agraf@suse.com, borntraeger@de.ibm.com, christoffer.dall@linaro.org, marc.zyngier@arm.com, james.hogan@imgtec.com, kvm@vger.kernel.org, linux-kernel@vger.kernel.org, weidong.huang@huawei.com, arei.gonglei@huawei.com, wangxinxin.wang@huawei.com, longpeng.mike@gmail.com, david@redhat.com Subject: Re: [PATCH v2 0/4] KVM: optimize the kvm_vcpu_on_spin Message-ID: <20170811091953.75a8376e.cohuck@redhat.com> In-Reply-To: <65ce708e-480a-6173-f678-d7934c630439@linux.vnet.ibm.com> References: <1502165135-4784-1-git-send-email-longpeng2@huawei.com> <20170808094153.1b5bf8f4@gondolin> <598972E3.1030807@huawei.com> <65ce708e-480a-6173-f678-d7934c630439@linux.vnet.ibm.com> Organization: Red Hat GmbH MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.25]); Fri, 11 Aug 2017 07:20:03 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2350 Lines: 67 On Thu, 10 Aug 2017 09:18:09 -0400 Eric Farman wrote: > On 08/08/2017 04:14 AM, Longpeng (Mike) wrote: > > > > > > On 2017/8/8 15:41, Cornelia Huck wrote: > > > >> On Tue, 8 Aug 2017 12:05:31 +0800 > >> "Longpeng(Mike)" wrote: > >> > >>> This is a simple optimization for kvm_vcpu_on_spin, the > >>> main idea is described in patch-1's commit msg. > >> > >> I think this generally looks good now. > >> > >>> > >>> I did some tests base on the RFC version, the result shows > >>> that it can improves the performance slightly. > >> > >> Did you re-run tests on this version? > > > > > > Hi Cornelia, > > > > I didn't re-run tests on V2. But the major difference between RFC and V2 > > is that V2 only cache result for X86 (s390/arm needn't) and V2 saves a > > expensive operation ( 440-1400 cycles on my test machine ) for X86/VMX. > > > > So I think V2's performance is at least the same as RFC or even slightly > > better. :) > > > >> > >> I would also like to see some s390 numbers; unfortunately I only have a > >> z/VM environment and any performance numbers would be nearly useless > >> there. Maybe somebody within IBM with a better setup can run a quick > >> test? > > Won't swear I didn't screw something up, but here's some quick numbers. > Host was 4.12.0 with and without this series, running QEMU 2.10.0-rc0. > Created 4 guests, each with 4 CPU (unpinned) and 4GB RAM. VM1 did full > kernel compiles with kernbench, which took averages of 5 runs of > different job sizes (I threw away the "-j 1" numbers). VM2-VM4 ran cpu > burners on 2 of their 4 cpus. > > Numbers from VM1 kernbench output, and the delta between runs: > > load -j 3 before after delta > Elapsed Time 183.178 182.58 -0.598 > User Time 534.19 531.52 -2.67 > System Time 32.538 33.37 0.832 > Percent CPU 308.8 309 0.2 > Context Switches 98484.6 99001 516.4 > Sleeps 227347 228752 1405 > > load -j 16 before after delta > Elapsed Time 153.352 147.59 -5.762 > User Time 545.829 533.41 -12.419 > System Time 34.289 34.85 0.561 > Percent CPU 347.6 348 0.4 > Context Switches 160518 159120 -1398 > Sleeps 240740 240536 -204 Thanks a lot, Eric! The decreases in elapsed time look nice, and we probably should not care about the increases reported.