Received: by 2002:ac0:a5a7:0:0:0:0:0 with SMTP id m36-v6csp1410479imm; Wed, 1 Aug 2018 15:42:45 -0700 (PDT) X-Google-Smtp-Source: AAOMgpekktWwNal8tWgIMhuelurvx4XomM8GmEM0K2piZBKodccGEowufly+1jHZ6ieampxc7kTh X-Received: by 2002:a62:6cc7:: with SMTP id h190-v6mr245589pfc.113.1533163365595; Wed, 01 Aug 2018 15:42:45 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1533163365; cv=none; d=google.com; s=arc-20160816; b=DXExjHp3U5OlXt3khP9jo49Is+2zeJBl/h5sxTW8DPdlOukMCcFwIXlMkkMDaXLcOF 8bGl1zeHP08k4UCJDc1hjP5OI68VYrQ7kRJVjxqofLOt1/7WapkR3iyKkiYqQgZQmmw8 4PTNoCTMfhBn0UKzmAzreaoL1O26DbIuG7zR7K6w+WM8G5B5KF56vo2EUNOnSr2B0PwC V5Hwz6ESQL9pRZvIKhxiNf4LFl3kMts0RFzX/2QiqdVQTsYBOX5l0GbW7gjyrgSWJ13T fzvY8fbHATXyhOGYsqyEf0U/L9WZ74wDWzCKTfLpwiqDxKV7lc2uZWu3pdYUEIISfOSu 3eNA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date :arc-authentication-results; bh=+tug1WmS+YVwK3NKu4Lnsgk606KH4Ff0SbIJQzXhf48=; b=em9jVJaKumtznSKIbne7opmD4NIKcqSdwJvK7qnm4jitYpVdP0HHrmP2Uvkn0k39e0 hfdbHM9wOLxfHFR99acbE+uUD2BHccfejW1SiHoWviuvoQD+0dox3uw52FGxsxCX4Z4u OeJsWHzntatD7FiTZ8mjVXpqo9W+q3mHv7K1MwlLLcbin/8vGMfb0SluVQH3m12b3mvz 9uRYiarW2Wr1VvPX9ItrAn7oK+C+fqCyhAWlx43U4UCZaBsiy1pDBzA8knULsJL+yC8p N2sE3Drf0X/VjIKbLEFe2xd3CDPe2DR4TXqJMjrnvUK7l2wTjkMJXRXQoJXmSzh7oBp8 UTcw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id d22-v6si111322plr.318.2018.08.01.15.42.31; Wed, 01 Aug 2018 15:42:45 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1732261AbeHBA32 (ORCPT + 99 others); Wed, 1 Aug 2018 20:29:28 -0400 Received: from mx3-rdu2.redhat.com ([66.187.233.73]:50500 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1729647AbeHBA31 (ORCPT ); Wed, 1 Aug 2018 20:29:27 -0400 Received: from smtp.corp.redhat.com (int-mx03.intmail.prod.int.rdu2.redhat.com [10.11.54.3]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id DCAFB7262D; Wed, 1 Aug 2018 22:41:23 +0000 (UTC) Received: from redhat.com (ovpn-121-8.rdu2.redhat.com [10.10.121.8]) by smtp.corp.redhat.com (Postfix) with SMTP id E02C71006EB5; Wed, 1 Aug 2018 22:41:20 +0000 (UTC) Date: Thu, 2 Aug 2018 01:41:20 +0300 From: "Michael S. Tsirkin" To: Will Deacon Cc: Christoph Hellwig , Benjamin Herrenschmidt , Anshuman Khandual , virtualization@lists.linux-foundation.org, linux-kernel@vger.kernel.org, linuxppc-dev@lists.ozlabs.org, aik@ozlabs.ru, robh@kernel.org, joe@perches.com, elfring@users.sourceforge.net, david@gibson.dropbear.id.au, jasowang@redhat.com, mpe@ellerman.id.au, linuxram@us.ibm.com, haren@linux.vnet.ibm.com, paulus@samba.org, srikar@linux.vnet.ibm.com, robin.murphy@arm.com, jean-philippe.brucker@arm.com, marc.zyngier@arm.com Subject: Re: [RFC 0/4] Virtio uses DMA API for all devices Message-ID: <20180802014028-mutt-send-email-mst@kernel.org> References: <20180727095804.GA25592@arm.com> <20180730093414.GD26245@infradead.org> <20180730125100-mutt-send-email-mst@kernel.org> <20180730111802.GA9830@infradead.org> <20180730155633-mutt-send-email-mst@kernel.org> <20180731173052.GA17153@infradead.org> <3d6e81511571260de1c8047aaffa8ac4df093d2e.camel@kernel.crashing.org> <20180801081637.GA14438@arm.com> <20180801083639.GF26378@infradead.org> <20180801090535.GB14438@arm.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20180801090535.GB14438@arm.com> X-Scanned-By: MIMEDefang 2.78 on 10.11.54.3 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.11.55.2]); Wed, 01 Aug 2018 22:41:24 +0000 (UTC) X-Greylist: inspected by milter-greylist-4.5.16 (mx1.redhat.com [10.11.55.2]); Wed, 01 Aug 2018 22:41:24 +0000 (UTC) for IP:'10.11.54.3' DOMAIN:'int-mx03.intmail.prod.int.rdu2.redhat.com' HELO:'smtp.corp.redhat.com' FROM:'mst@redhat.com' RCPT:'' Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Aug 01, 2018 at 10:05:35AM +0100, Will Deacon wrote: > Hi Christoph, > > On Wed, Aug 01, 2018 at 01:36:39AM -0700, Christoph Hellwig wrote: > > On Wed, Aug 01, 2018 at 09:16:38AM +0100, Will Deacon wrote: > > > On arm/arm64, the problem we have is that legacy virtio devices on the MMIO > > > transport (so definitely not PCI) have historically been advertised by qemu > > > as not being cache coherent, but because the virtio core has bypassed DMA > > > ops then everything has happened to work. If we blindly enable the arch DMA > > > ops, > > > > No one is suggesting that as far as I can tell. > > Apologies: it's me that wants the DMA ops enabled to handle legacy devices > behind an IOMMU, but see below. > > > > we'll plumb in the non-coherent ops and start getting data corruption, > > > so we do need a way to quirk virtio as being "always coherent" if we want to > > > use the DMA ops (which we do, because our emulation platforms have an IOMMU > > > for all virtio devices). > > > > From all that I've gather so far: no you do not want that. We really > > need to figure out virtio "dma" interacts with the host / device. > > > > If you look at the current iommu spec it does talk of physical address > > with a little careveout for VIRTIO_F_IOMMU_PLATFORM. > > That's true, although that doesn't exist in the legacy virtio spec, and we > have an existing emulation platform which puts legacy virtio devices behind > an IOMMU. Currently, Linux is unable to boot on this platform unless the > IOMMU is configured as bypass. If we can use the coherent IOMMU DMA ops, > then it works perfectly. > > > So between that and our discussion in this thread and its previous > > iterations I think we need to stick to the current always physical, > > bypass system dma ops mode of virtio operation as the default. > > As above -- that means we hang during boot because we get stuck trying to > bring up a virtio-block device whose DMA is aborted by the IOMMU. The easy > answer is "just upgrade to latest virtio and advertise the presence of the > IOMMU". I'm pushing for that in future platforms, but it seems a shame not > to support the current platform, especially given that other systems do have > hacks in mainline to get virtio working. > > > We just need to figure out how to deal with devices that deviate > > from the default. One things is that VIRTIO_F_IOMMU_PLATFORM really > > should become VIRTIO_F_PLATFORM_DMA to cover the cases of non-iommu > > dma tweaks (offsets, cache flushing), which seems well in spirit of > > the original design. The other issue is VIRTIO_F_IO_BARRIER > > which is very vaguely defined, and which needs a better definition. > > And last but not least we'll need some text explaining the challenges > > of hardware devices - I think VIRTIO_F_PLATFORM_DMA + VIRTIO_F_IO_BARRIER > > is what would basically cover them, but a good description including > > an explanation of why these matter. > > I agree that this makes sense for future revisions of virtio (or perhaps > it can just be a clarification to virtio 1.0), but we're still left in the > dark with legacy devices and it would be nice to have them work on the > systems which currently exist, even if it's a legacy-only hack in the arch > code. > > Will Myself I'm sympathetic to this use-case and I see more uses to this than just legacy support. But more work is required IMHO. Will post tomorrow though - it's late here ... -- MST