Received: by 2002:a25:8b91:0:0:0:0:0 with SMTP id j17csp6533313ybl; Wed, 15 Jan 2020 06:16:01 -0800 (PST) X-Google-Smtp-Source: APXvYqy0BUJaY0ME41Sk0JlaF+NVgtk/x142/9UQb5fqQ3YY8V3uO0rNf+KzZ5/kufhobVeWx4cm X-Received: by 2002:a9d:51c1:: with SMTP id d1mr2760558oth.136.1579097760922; Wed, 15 Jan 2020 06:16:00 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1579097760; cv=none; d=google.com; s=arc-20160816; b=hnxQLhANCzJQO/XGcaBJE8QHm9hp3x3t40+zHw7NV0Jq9tTJGtZGd8HZrU3Sx0HoOM sloFyl53QkPWO/Hu8OGeWBIEsnbqiSstXkoSlJXxkUm86ftMKaeZ+Wle57JBf5nF0peU +H44k67FUR2H3y65loVJlalEamus9CBqWyupR3ecyHcZUGzF9W/nfR84GVvJoaP6kgIx k7LN7gZebJO65nihUL6SSydKGibuPRfECYjrTfRKe+5l7dPs2DXAUWwTaufDQibO2RME /PfcObKyz/Chm4OhicyzCeyaig9rAQeADhBVzdsBVuvxoJZsilUtx0xnULiF6AZU+4hH T15Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:message-id:references :in-reply-to:subject:cc:to:from:date:content-transfer-encoding :mime-version:dkim-signature; bh=/eK3xkjQ6nslWoRZpNhOw9XGUQHZNxndTXHT0zGiiWQ=; b=pbFit9Qrd+6hN+tgV/YfJkPK7CBAbpFE6EE68b+XBtvpz3y0GlHGrgyvNxrDLya1iO H5jLRyvA1wE1E6PxgrNEmd3R7RVq8zy/c/yz4a8HDDKGlCRNSn79xas6MRiDkvfyPqAi I38w6GlrU8juN3+HO0Y3Ic34eUrJRG546CYGPFD5BiNiBDnRTWMRksnWmG32Qqg8Koqy zvfI75TcdrAez6/63804Kc+WxPZUBU7rC/UgMsfJic8TXvuieuuhXnyHPN4viVtRNE3d Y3MCyCdugUCeML+Pdnl7UjyeM0a3ZVrSqJ4Qg9RprJ7oSFgLPmws3v8dpWGEGzRDEU6J aPPg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=IfVjmj7K; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id r10si9951175otn.241.2020.01.15.06.15.48; Wed, 15 Jan 2020 06:16:00 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=IfVjmj7K; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730667AbgAOOOO (ORCPT + 99 others); Wed, 15 Jan 2020 09:14:14 -0500 Received: from mail.kernel.org ([198.145.29.99]:39642 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1729154AbgAOOON (ORCPT ); Wed, 15 Jan 2020 09:14:13 -0500 Received: from disco-boy.misterjones.org (disco-boy.misterjones.org [51.254.78.96]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id B7CDC24679; Wed, 15 Jan 2020 14:14:11 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1579097651; bh=CaXoAkqOajL0kmg6h715f9PHn0XAZVLVKqsyXgOXacM=; h=Date:From:To:Cc:Subject:In-Reply-To:References:From; b=IfVjmj7KSiyixL26Y39CJZiiWPKnwpOP90yCrThZ4afM3dMYPowazyc4h9gC+j/q/ AW95yu14rcNeC/TEWOUi3I0du1tfRtRn00bWjyiSecTPMTfwJDnMJG09iFY87NdN1S yZ3p6nWb+/NtCMwUmPpa43pUBiyDYqjj1Lq8ET1I= Received: from disco-boy.misterjones.org ([51.254.78.96] helo=www.loen.fr) by disco-boy.misterjones.org with esmtpsa (TLS1.2:ECDHE_RSA_AES_128_GCM_SHA256:128) (Exim 4.92) (envelope-from ) id 1irjQw-0000fF-2L; Wed, 15 Jan 2020 14:14:10 +0000 MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII; format=flowed Content-Transfer-Encoding: 7bit Date: Wed, 15 Jan 2020 14:14:10 +0000 From: Marc Zyngier To: Will Deacon Cc: Zengruan Ye , linux-kernel@vger.kernel.org, linux-arm-kernel@lists.infradead.org, kvmarm@lists.cs.columbia.edu, kvm@vger.kernel.org, linux-doc@vger.kernel.org, virtualization@lists.linux-foundation.org, james.morse@arm.com, linux@armlinux.org.uk, suzuki.poulose@arm.com, julien.thierry.kdev@gmail.com, catalin.marinas@arm.com, mark.rutland@arm.com, steven.price@arm.com, daniel.lezcano@linaro.org, peterz@infradead.org Subject: Re: [PATCH v2 0/6] KVM: arm64: VCPU preempted check support In-Reply-To: <20200113121240.GC3260@willie-the-truck> References: <20191226135833.1052-1-yezengruan@huawei.com> <20200113121240.GC3260@willie-the-truck> Message-ID: X-Sender: maz@kernel.org User-Agent: Roundcube Webmail/1.3.8 X-SA-Exim-Connect-IP: 51.254.78.96 X-SA-Exim-Rcpt-To: will@kernel.org, yezengruan@huawei.com, linux-kernel@vger.kernel.org, linux-arm-kernel@lists.infradead.org, kvmarm@lists.cs.columbia.edu, kvm@vger.kernel.org, linux-doc@vger.kernel.org, virtualization@lists.linux-foundation.org, james.morse@arm.com, linux@armlinux.org.uk, suzuki.poulose@arm.com, julien.thierry.kdev@gmail.com, catalin.marinas@arm.com, mark.rutland@arm.com, steven.price@arm.com, daniel.lezcano@linaro.org, peterz@infradead.org X-SA-Exim-Mail-From: maz@kernel.org X-SA-Exim-Scanned: No (on disco-boy.misterjones.org); SAEximRunCond expanded to false Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2020-01-13 12:12, Will Deacon wrote: > [+PeterZ] > > On Thu, Dec 26, 2019 at 09:58:27PM +0800, Zengruan Ye wrote: >> This patch set aims to support the vcpu_is_preempted() functionality >> under KVM/arm64, which allowing the guest to obtain the VCPU is >> currently running or not. This will enhance lock performance on >> overcommitted hosts (more runnable VCPUs than physical CPUs in the >> system) as doing busy waits for preempted VCPUs will hurt system >> performance far worse than early yielding. >> >> We have observed some performace improvements in uninx benchmark >> tests. >> >> unix benchmark result: >> host: kernel 5.5.0-rc1, HiSilicon Kunpeng920, 8 CPUs >> guest: kernel 5.5.0-rc1, 16 VCPUs >> >> test-case | after-patch | >> before-patch >> ----------------------------------------+-------------------+------------------ >> Dhrystone 2 using register variables | 334600751.0 lps | >> 335319028.3 lps >> Double-Precision Whetstone | 32856.1 MWIPS | >> 32849.6 MWIPS >> Execl Throughput | 3662.1 lps | >> 2718.0 lps >> File Copy 1024 bufsize 2000 maxblocks | 432906.4 KBps | >> 158011.8 KBps >> File Copy 256 bufsize 500 maxblocks | 116023.0 KBps | >> 37664.0 KBps >> File Copy 4096 bufsize 8000 maxblocks | 1432769.8 KBps | >> 441108.8 KBps >> Pipe Throughput | 6405029.6 lps | >> 6021457.6 lps >> Pipe-based Context Switching | 185872.7 lps | >> 184255.3 lps >> Process Creation | 4025.7 lps | >> 3706.6 lps >> Shell Scripts (1 concurrent) | 6745.6 lpm | >> 6436.1 lpm >> Shell Scripts (8 concurrent) | 998.7 lpm | >> 931.1 lpm >> System Call Overhead | 3913363.1 lps | >> 3883287.8 lps >> ----------------------------------------+-------------------+------------------ >> System Benchmarks Index Score | 1835.1 | >> 1327.6 > > Interesting, thanks for the numbers. > > So it looks like there is a decent improvement to be had from targetted > vCPU > wakeup, but I really dislike the explicit PV interface and it's already > been > shown to interact badly with the WFE-based polling in > smp_cond_load_*(). > > Rather than expose a divergent interface, I would instead like to > explore an > improvement to smp_cond_load_*() and see how that performs before we > commit > to something more intrusive. Marc and I looked at this very briefly in > the > past, and the basic idea is to register all of the WFE sites with the > hypervisor, indicating which register contains the address being spun > on > and which register contains the "bad" value. That way, you don't bother > rescheduling a vCPU if the value at the address is still bad, because > you > know it will exit immediately. > > Of course, the devil is in the details because when I say "address", > that's > a guest virtual address, so you need to play some tricks in the > hypervisor > so that you have a separate mapping for the lockword (it's enough to > keep > track of the physical address). > > Our hacks are here but we basically ran out of time to work on them > beyond > an unoptimised and hacky prototype: > > https://git.kernel.org/pub/scm/linux/kernel/git/maz/arm-platforms.git/log/?h=kvm-arm64/pvcy > > Marc -- how would you prefer to handle this? Let me try and rebase this thing to a modern kernel (I doubt it applies without conflicts to mainline). We can then have discussion about its merit on the list once I post it. It'd be good to have a pointer to the benchamrks that have been used here. Thanks, M. -- Jazz is not dead. It just smells funny...