Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753877Ab0LFQrJ (ORCPT ); Mon, 6 Dec 2010 11:47:09 -0500 Received: from david.siemens.de ([192.35.17.14]:17771 "EHLO david.siemens.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752307Ab0LFQrI (ORCPT ); Mon, 6 Dec 2010 11:47:08 -0500 Message-ID: <4CFD1377.7030008@siemens.com> Date: Mon, 06 Dec 2010 17:46:47 +0100 From: Jan Kiszka User-Agent: Mozilla/5.0 (X11; U; Linux i686 (x86_64); de; rv:1.8.1.12) Gecko/20080226 SUSE/2.0.0.12-1.1 Thunderbird/2.0.0.12 Mnenhy/0.7.5.666 MIME-Version: 1.0 To: Avi Kivity CC: Jan Kiszka , Thomas Gleixner , Marcelo Tosatti , "linux-kernel@vger.kernel.org" , kvm , Tom Lyon , Alex Williamson , "Michael S. Tsirkin" Subject: Re: [PATCH 5/5] KVM: Allow host IRQ sharing for passed-through PCI 2.3 devices References: <10b9265d513950f2574edf4e79be3b808e741a78.1291419444.git.jan.kiszka@web.de> <4CFD0D91.3050303@redhat.com> <4CFD108F.5000200@siemens.com> <4CFD1208.4070600@redhat.com> In-Reply-To: <4CFD1208.4070600@redhat.com> Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2307 Lines: 53 Am 06.12.2010 17:40, Avi Kivity wrote: > On 12/06/2010 06:34 PM, Jan Kiszka wrote: >>> >>> What's the protocol for doing this? I suppose userspace has to disable >>> interrupts, ioctl(SET_INTX_MASK, masked), ..., ioctl(SET_INTX_MASK, >>> unmasked), enable interrupts? >> >> Userspace just has to synchronize against itself - what it already does: >> qemu_mutex, and masking/unmasking is synchronous /wrt the the executing >> VCPU. Otherwise, masking/unmasking is naturally racy, also in Real Life. >> The guest resolves the remaining races. > > I meant when qemu sets INTX_MASK and the kernel clears it immediately > afterwards because the two are not synchronized. I guess that won't > happen in practice because playing with INTX_MASK is very rare. Ah, there is indeed a race, and the qemu-kvm patches I did not post yet (to wait for the kernel interface to settle) actually suffer from it: userspace needs to set the kernel mask before writing the config space (it's the other way around ATM). This avoids that the kernel overwrites what userspace just wrote out. We always suffer from the race the other way around, see below. > > >>> >>> Isn't there a race window between the two operations? >>> >>> Maybe we should give the kernel full ownership of that bit. >> >> I think this is what VFIO does and is surely cleaner than this approach. >> But it's not possible with the existing interface (sysfs + KVM ioctls) - >> or can you restrict the sysfs access to the config space in such details? > > I'm sure you can, not sure it's worth it. Can the situation be > exploited? what if userspace lies? That's also the above scenario inverted: Userspace can mask or unmask at any time. If it unmasks a yet unhandled, thus raise interrupt, it will trigger another one. The kernel will catch it and mask it again. That can repeat forever with the frequency userspace is able to run its unmasking code. Not nice, but nothing to leverage for a DoS. Jan -- Siemens AG, Corporate Technology, CT T DE IT 1 Corporate Competence Center Embedded Linux -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/