Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752604AbcKPLns (ORCPT ); Wed, 16 Nov 2016 06:43:48 -0500 Received: from merlin.infradead.org ([205.233.59.134]:39786 "EHLO merlin.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751464AbcKPLno (ORCPT ); Wed, 16 Nov 2016 06:43:44 -0500 Date: Wed, 16 Nov 2016 12:43:13 +0100 From: Peter Zijlstra To: Christian Borntraeger Cc: Pan Xinhui , Pan Xinhui , linux-kernel@vger.kernel.org, linuxppc-dev@lists.ozlabs.org, virtualization@lists.linux-foundation.org, linux-s390@vger.kernel.org, xen-devel-request@lists.xenproject.org, kvm@vger.kernel.org, xen-devel@lists.xenproject.org, x86@kernel.org, benh@kernel.crashing.org, paulus@samba.org, mpe@ellerman.id.au, mingo@redhat.com, paulmck@linux.vnet.ibm.com, will.deacon@arm.com, kernellwp@gmail.com, jgross@suse.com, pbonzini@redhat.com, bsingharora@gmail.com, boqun.feng@gmail.com, rkrcmar@redhat.com, David.Laight@ACULAB.COM, dave@stgolabs.net, konrad.wilk@oracle.com Subject: Re: [PATCH v7 06/11] x86, paravirt: Add interface to support kvm/xen vcpu preempted check Message-ID: <20161116114313.GS3142@twins.programming.kicks-ass.net> References: <1478077718-37424-1-git-send-email-xinhui.pan@linux.vnet.ibm.com> <1478077718-37424-7-git-send-email-xinhui.pan@linux.vnet.ibm.com> <20161115154706.GF11311@worktop.programming.kicks-ass.net> <20161116102355.GP3142@twins.programming.kicks-ass.net> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.23.1 (2014-03-12) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1958 Lines: 48 On Wed, Nov 16, 2016 at 12:29:44PM +0100, Christian Borntraeger wrote: > On 11/16/2016 11:23 AM, Peter Zijlstra wrote: > > On Wed, Nov 16, 2016 at 12:19:09PM +0800, Pan Xinhui wrote: > >> Hi, Peter. > >> I think we can avoid a function call in a simpler way. How about below > >> > >> static inline bool vcpu_is_preempted(int cpu) > >> { > >> /* only set in pv case*/ > >> if (pv_lock_ops.vcpu_is_preempted) > >> return pv_lock_ops.vcpu_is_preempted(cpu); > >> return false; > >> } > > > > That is still more expensive. It needs to do an actual load and makes it > > hard to predict the branch, you'd have to actually wait for the load to > > complete etc. > > Out of curiosity, why is that hard to predict? > On s390 the branch prediction runs asynchronously ahead of the downstream > pipeline (e.g. search for "IBM z Systems Processor Optimization Primer" page 11). > given enough capacity, I would assume that modern x86 processors would do the same > and be able to predict this is as soon as it becomes hot (and otherwise you would > not notice the branch miss anyway). Is x86 behaving differently here? Not sure how exactly it works, but it seems to me that an immediate assignment to the value you're going to compare would leave very little doubt. Then again, maybe cores aren't that smart and only look at the hysterical btb for prediction. > > Also, it generates more code. > > > > Paravirt muck should strive to be as cheap as possible when ran on > > native hardware. > > As I am interested in this series from the s390 point of view, this is > the only thing that block this series? Ingo was rewriting the changelog, other than that, no, I can do this on top. Just spotted this because Ingo and me talked it over. > Is there a chance to add a static key around the paravirt ops somehow? More code generation still, replacing the call with an immediate assignment to the return register is the shortest possible option I think.