Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp1335397imu; Thu, 13 Dec 2018 13:19:36 -0800 (PST) X-Google-Smtp-Source: AFSGD/U0e3OoKloOGSPziqQQ9xxaiYsAmQZ0ki6k6QucjQzR0wDxnnd8HQKTgTkGLImREghQTsHM X-Received: by 2002:a17:902:690c:: with SMTP id j12mr394537plk.206.1544735976929; Thu, 13 Dec 2018 13:19:36 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1544735976; cv=none; d=google.com; s=arc-20160816; b=tMRUWKdEKSyEfzrLYHDKmsciP6XbEBn/dTaCIdq4qwprNM+1ngp/RJCJBK7WUnzg8u Ftf7jFp2UmpxTa5/uRLOEUmgJKwFYnHrMQuv47f876fslU/4p992NocPQRS6ChEXlxeZ BVijrjawHasn6QSkwd7LUvi5NFUTPJbwfHAuMiql5pGjwfKJkBuGiuRgm/HHv351TRLb MlrABa4528bNyiNWkuI9VMc1n3fzBhdUrTO0out9m5v1/0p4pnw8zbRlgeThL3yVsWEq b4BVL1g3h3qJ1SThh4vycwXlUZhBTTcRMcA9Ln6BbIPcn3lStmf2XeaeWD06e4LCkOxl jxGA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=grGrepbdccr5K8f4a4DntqmKq5Puoelfvx/ehoWUq/k=; b=nbjTaQlS9fnt1zWMmaFFPO/JiOPa2DODP1briieThleyMWEnc1AcoSZM8ihVb4ZOK6 8glocRwjjjPI/uj5L30KGwcgxDJA4zLUX5mCWc0sYfvnlCsPS4wPf2oCmG9wWGXegjdM AfAe1dp9UjxydqS+CCnpM6HmpZ2ueoj3DvPVTneraB4tm0uES+MXbM0zDpkuTgce9gQI 0qygLz8SOe4qiyzMCOpBcfAOUHO+ZhbyCOySjEgdyfaA2Q5LFGK1Hd1uA5HcT9MXAXnv 4jnQnf4UYt2iObV8rIrZCaV+JsAMav9w+P1JqYPneu5uYTFpMKCWILLsnzLIOpF2qukL hkVw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id z9si2281074pgf.54.2018.12.13.13.19.20; Thu, 13 Dec 2018 13:19:36 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727749AbeLMVSY (ORCPT + 99 others); Thu, 13 Dec 2018 16:18:24 -0500 Received: from mx1.redhat.com ([209.132.183.28]:38558 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726435AbeLMVSX (ORCPT ); Thu, 13 Dec 2018 16:18:23 -0500 Received: from smtp.corp.redhat.com (int-mx03.intmail.prod.int.phx2.redhat.com [10.5.11.13]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id D218130001E3; Thu, 13 Dec 2018 21:18:22 +0000 (UTC) Received: from horse.redhat.com (unknown [10.18.25.234]) by smtp.corp.redhat.com (Postfix) with ESMTP id D10816764A; Thu, 13 Dec 2018 21:18:19 +0000 (UTC) Received: by horse.redhat.com (Postfix, from userid 10451) id 698F82208FC; Thu, 13 Dec 2018 16:18:19 -0500 (EST) Date: Thu, 13 Dec 2018 16:18:19 -0500 From: Vivek Goyal To: Dan Williams Cc: "Dr. David Alan Gilbert" , linux-fsdevel , Linux Kernel Mailing List , KVM list , Miklos Szeredi , Stefan Hajnoczi , sweil@redhat.com, Steven Whitehouse Subject: Re: [PATCH 15/52] fuse: map virtio_fs DAX window BAR Message-ID: <20181213211819.GF4384@redhat.com> References: <20181210171318.16998-1-vgoyal@redhat.com> <20181210171318.16998-16-vgoyal@redhat.com> <20181213200936.GU2313@work-vm> <20181213204052.GE4384@redhat.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20181213204052.GE4384@redhat.com> User-Agent: Mutt/1.9.1 (2017-09-22) X-Scanned-By: MIMEDefang 2.79 on 10.5.11.13 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.43]); Thu, 13 Dec 2018 21:18:23 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Dec 13, 2018 at 03:40:52PM -0500, Vivek Goyal wrote: > On Thu, Dec 13, 2018 at 12:15:51PM -0800, Dan Williams wrote: > > On Thu, Dec 13, 2018 at 12:09 PM Dr. David Alan Gilbert > > wrote: > > > > > > * Dan Williams (dan.j.williams@intel.com) wrote: > > > > On Mon, Dec 10, 2018 at 9:22 AM Vivek Goyal wrote: > > > > > > > > > > From: Stefan Hajnoczi > > > > > > > > > > Experimental QEMU code introduces an MMIO BAR for mapping portions of > > > > > files in the virtio-fs device. Map this BAR so that FUSE DAX can access > > > > > file contents from the host page cache. > > > > > > > > FUSE DAX sounds terrifying, can you explain a bit more about what this is? > > > > > > We've got a guest running in QEMU, it sees an emulated PCI device; > > > that runs a FUSE protocol over virtio on that PCI device, but also has > > > a trick where via commands sent over the virtio queue associated with that device, > > > (fragments of) host files get mmap'd into the qemu virtual memory that corresponds > > > to the kvm slot exposed to the guest for that bar. > > > > > > The guest sees those chunks in that BAR, and thus you can read/write > > > to the host file by directly writing into that BAR. > > > > Ok so it's all software emulated and there won't be hardware DMA > > initiated by the guest to that address? > > That's my understanding. > > > I.e. if the host file gets > > truncated / hole-punched the guest would just cause a refault and the > > filesystem could fill in the block, > > Right > > > or the guest is expected to die if > > the fault to the truncated file range results in SIGBUS. > > Are you referring to the case where a file page is mapped in qemu and > another guest/process trucates that page and when qemu tries to access it it > will get SIGBUS. Have not tried it, will give it a try. Not sure what > happens when QEMU receives SIGBUS. > > Having said that, this is not different from the case of one process > mapping a file and another process truncating the file and first process > getting SIGBUS, right? Ok, tried this and guest process hangs. Stefan, dgilbert, this reminds me that we have faced this issue during our testing and we decided that this will need some fixing in KVM. I even put this in as part of changelog of patch with subject "fuse: Take inode lock for dax inode truncation" "Another problem is, if we setup a mapping in fuse_iomap_begin(), and file gets truncated and dax read/write happens, KVM currently hangs. It tries to fault in a page which does not exist on host (file got truncated). It probably requries fixing in KVM." Not sure what should happen though when qemu receives SIGBUS in this case. Thanks Vivek