Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752324AbdIVM4D (ORCPT ); Fri, 22 Sep 2017 08:56:03 -0400 Received: from bombadil.infradead.org ([65.50.211.133]:57877 "EHLO bombadil.infradead.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751986AbdIVM4C (ORCPT ); Fri, 22 Sep 2017 08:56:02 -0400 Date: Fri, 22 Sep 2017 14:55:56 +0200 From: Peter Zijlstra To: Marcelo Tosatti Cc: Konrad Rzeszutek Wilk , mingo@redhat.com, kvm@vger.kernel.org, linux-kernel@vger.kernel.org, Thomas Gleixner Subject: Re: [patch 3/3] x86: kvm guest side support for KVM_HC_RT_PRIO hypercall Message-ID: <20170922125556.cyzybj6c7jqypbmo@hirez.programming.kicks-ass.net> References: <20170921113835.031375194@redhat.com> <20170921114039.466130276@redhat.com> <20170921133653.GO26248@char.us.oracle.com> <20170921140628.zliqlz7mrlqs5pzz@hirez.programming.kicks-ass.net> <20170922011039.GB20133@amt.cnet> <20170922100004.ydmaxvgpc2zx7j25@hirez.programming.kicks-ass.net> <20170922105609.deln6kylvvpaijg7@hirez.programming.kicks-ass.net> <20170922123305.GB29608@amt.cnet> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20170922123305.GB29608@amt.cnet> User-Agent: NeoMutt/20170609 (1.8.3) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1683 Lines: 49 On Fri, Sep 22, 2017 at 09:33:05AM -0300, Marcelo Tosatti wrote: > > That is, running a RT guest and not having _all_ VCPUs being RT tasks on > > the host is absolutely and completely insane and broken. > > Can you explain why, please? You just explained it yourself. If the thread that needs to complete what you're waiting on has lower priority, it will _never_ get to run if you're busy waiting on it. This is _trivial_. And even for !RT it can be quite costly, because you can end up having to burn your entire slot of CPU time before you run the other task. Userspace spinning is _bad_, do not do this. (the one exception where it works is where you have a single thread per cpu, because then there's effectively no scheduling). > > Fix whatever needs fixing to allow your VCPU0 to be RT, don't do insane > > things like this. > > VCPU0 can be RT, but you'll get the following hang, if the emulator > thread is sharing a pCPU with VCPU0: > > 1. submit IO. > 2. busy spin. > > As executed by the guest vcpu (its a natural problem). > > Do you have a better suggestion as how to fix the problem? Yes, not busy wait. Go to sleep and make sure you're woken up once the IO completes. > We can fix the BIOS, but userspace will still be allowed to > generate the code pattern above. What does the BIOS have to do with anything? > And increasing the priority of the emulator thread, at random times > (so it can inject interrupts to vcpu-0), can cause it to interrupt > vcpu-0 in a spinlock protected section. You can equally boost the emulator thread while you're spin-waiting, but that's ugly as heck too. The normal, sane solution is to not spin-wait but block.