Received: by 2002:a05:6a10:9848:0:0:0:0 with SMTP id x8csp878063pxf; Thu, 8 Apr 2021 15:12:54 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzb/sSlwyMMl32e9oL6qOr57xAcd+KgeE9lGXPajPKjigFZTz7czk9mwU++FEe45Du/4cmI X-Received: by 2002:a05:6402:10c9:: with SMTP id p9mr14618709edu.268.1617919973840; Thu, 08 Apr 2021 15:12:53 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1617919973; cv=none; d=google.com; s=arc-20160816; b=xixribP4wEgLVSSwhxwljmVF164qvm5zq0z5W11Mvsy6w+F6rJbHrVdq5CXTg/qkrM IamEYvJGWxBqKi4g+dU/9jD3c3vo41+VvucHNEgoHRsu6e6jn0GAlz0uQDwYUTYfGHaZ GWJFW1CWBCjSRHFq8HUVWeIcTq07vlmkBScwaqLEhE30Cb9NJiRA8Ye4L8+gyPmD39hi 49HrBENWizoYaRWh6JMHFOMqChe3l5icKj2gHaCvW7fWpszODkMjzOcpCgxW6BABy0UE h6hxJjz7Zh7ki3umqKCwsV8JmIAtrx8RVA/VeD2L//IrbC7nxEfMUiIUKLbMPDLfEKcg mhnA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=F+kgsyAW6ZGwRHaK6+HRecYZ+mozGyP8XvJ47iAkS/s=; b=ZoLPkRSxKGOeAwMwgXuefckOFmzLlllN18LXtciqjByg/ovh9r41YChUkMZP3q6Wx3 ws5BL+WH6KHRMmLVRmQQ+MY5v5BLtKxQ3l+PZh9h26mcSPZEUsFvBuTGqy6OFj9eG0uH TvSjInxP9+3zPIO5gqlCwW2EenFGpyTiSpaVOyKfUvdPr1xY5amY2yLL92BCrVjIey7L xKuiQYMOH6B94Km4w7e7sHIr58vBVlQmn4zS0A0qaDOby/4ay5z8k6wYcZo71K0oIfsL oFzr+/3zLPS7jfeQn/HQanksEk/XHdELKWmkPxvTnkRPd/mkLbMgtvbfrfAqz5phe9oe cf2A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=aDiatDdZ; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id q19si439099ejr.701.2021.04.08.15.12.30; Thu, 08 Apr 2021 15:12:53 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=aDiatDdZ; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232728AbhDHWLr (ORCPT + 99 others); Thu, 8 Apr 2021 18:11:47 -0400 Received: from mail.kernel.org ([198.145.29.99]:40174 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232265AbhDHWLp (ORCPT ); Thu, 8 Apr 2021 18:11:45 -0400 Received: by mail.kernel.org (Postfix) with ESMTPSA id C668361107; Thu, 8 Apr 2021 22:11:33 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1617919893; bh=JHtxDsKdHaTfSByGIt0LBvw1nnygzJIyZWR4wvxXFYU=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=aDiatDdZBUiYkGFU31JmrQVsKVFW+mN8umpeZ/t6bTNxSRnI5VgIPtNZ0MqMtEDqS vilyR6P1dNLbbfLdTouNeKeSrC2k04mpG8ByCKDq/o+Ygtyj/AKYpYSD6Fbnz5oe7y pkn3Xw+QZS8m0zNkxyRlaIeSJa5zWPvsxTYUxeExxOugb3cD79zY4yodH+5lpUWA56 s+G4R/UyRUko2Lw4YPpOwRyprb3A5jk4wbKXMlSAQ+dtkvuCalCzujDGYGQUy0I+DF S2h8jZiMahtkDhjLIK7CDFpWe7FLBYBDXAItu5WnWcA93WgFhkFAr6NpuD/ZebOjsA kNL51WVseojZQ== Date: Thu, 8 Apr 2021 15:11:31 -0700 From: "Darrick J. Wong" To: Shiyang Ruan Cc: linux-kernel@vger.kernel.org, linux-xfs@vger.kernel.org, linux-nvdimm@lists.01.org, linux-fsdevel@vger.kernel.org, darrick.wong@oracle.com, dan.j.williams@intel.com, willy@infradead.org, jack@suse.cz, viro@zeniv.linux.org.uk, linux-btrfs@vger.kernel.org, david@fromorbit.com, hch@lst.de, rgoldwyn@suse.de, Goldwyn Rodrigues , Ritesh Harjani Subject: Re: [PATCH v4 2/7] fsdax: Replace mmap entry in case of CoW Message-ID: <20210408221131.GZ3957620@magnolia> References: <20210408120432.1063608-1-ruansy.fnst@fujitsu.com> <20210408120432.1063608-3-ruansy.fnst@fujitsu.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20210408120432.1063608-3-ruansy.fnst@fujitsu.com> Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Apr 08, 2021 at 08:04:27PM +0800, Shiyang Ruan wrote: > We replace the existing entry to the newly allocated one in case of CoW. > Also, we mark the entry as PAGECACHE_TAG_TOWRITE so writeback marks this > entry as writeprotected. This helps us snapshots so new write > pagefaults after snapshots trigger a CoW. > > Signed-off-by: Goldwyn Rodrigues > Signed-off-by: Shiyang Ruan > Reviewed-by: Christoph Hellwig > Reviewed-by: Ritesh Harjani > --- > fs/dax.c | 39 ++++++++++++++++++++++++++++----------- > 1 file changed, 28 insertions(+), 11 deletions(-) > > diff --git a/fs/dax.c b/fs/dax.c > index b4fd3813457a..e6c1354b27a8 100644 > --- a/fs/dax.c > +++ b/fs/dax.c > @@ -722,6 +722,10 @@ static int copy_cow_page_dax(struct block_device *bdev, struct dax_device *dax_d > return 0; > } > > +/* DAX Insert Flag for the entry we insert */ Might be worth mentioning that these are xarray marks for the inserted entry, since this comment didn't help much. > +#define DAX_IF_DIRTY (1 << 0) > +#define DAX_IF_COW (1 << 1) > + > /* > * By this point grab_mapping_entry() has ensured that we have a locked entry > * of the appropriate size so we don't have to worry about downgrading PMDs to > @@ -729,16 +733,19 @@ static int copy_cow_page_dax(struct block_device *bdev, struct dax_device *dax_d > * already in the tree, we will skip the insertion and just dirty the PMD as > * appropriate. > */ > -static void *dax_insert_entry(struct xa_state *xas, > - struct address_space *mapping, struct vm_fault *vmf, > - void *entry, pfn_t pfn, unsigned long flags, bool dirty) > +static void *dax_insert_entry(struct xa_state *xas, struct vm_fault *vmf, > + void *entry, pfn_t pfn, unsigned long flags, > + unsigned int insert_flags) Urk, two flags arguments. Oh, I see. We insert (shifted) pfn_t values into the mapping as xarray values, so @flags determines the state flags of the new entry value, whereas @insert_flags determines what xarray mark we're going to attach (if any) to the inserted value. --D > { > + struct address_space *mapping = vmf->vma->vm_file->f_mapping; > void *new_entry = dax_make_entry(pfn, flags); > + bool dirty = insert_flags & DAX_IF_DIRTY; > + bool cow = insert_flags & DAX_IF_COW; > > if (dirty) > __mark_inode_dirty(mapping->host, I_DIRTY_PAGES); > > - if (dax_is_zero_entry(entry) && !(flags & DAX_ZERO_PAGE)) { > + if (cow || (dax_is_zero_entry(entry) && !(flags & DAX_ZERO_PAGE))) { > unsigned long index = xas->xa_index; > /* we are replacing a zero page with block mapping */ > if (dax_is_pmd_entry(entry)) > @@ -750,7 +757,7 @@ static void *dax_insert_entry(struct xa_state *xas, > > xas_reset(xas); > xas_lock_irq(xas); > - if (dax_is_zero_entry(entry) || dax_is_empty_entry(entry)) { > + if (cow || dax_is_zero_entry(entry) || dax_is_empty_entry(entry)) { > void *old; > > dax_disassociate_entry(entry, mapping, false); > @@ -774,6 +781,9 @@ static void *dax_insert_entry(struct xa_state *xas, > if (dirty) > xas_set_mark(xas, PAGECACHE_TAG_DIRTY); > > + if (cow) > + xas_set_mark(xas, PAGECACHE_TAG_TOWRITE); > + > xas_unlock_irq(xas); > return entry; > } > @@ -1109,8 +1119,7 @@ static vm_fault_t dax_load_hole(struct xa_state *xas, > pfn_t pfn = pfn_to_pfn_t(my_zero_pfn(vaddr)); > vm_fault_t ret; > > - *entry = dax_insert_entry(xas, mapping, vmf, *entry, pfn, > - DAX_ZERO_PAGE, false); > + *entry = dax_insert_entry(xas, vmf, *entry, pfn, DAX_ZERO_PAGE, 0); > > ret = vmf_insert_mixed(vmf->vma, vaddr, pfn); > trace_dax_load_hole(inode, vmf, ret); > @@ -1137,8 +1146,8 @@ static vm_fault_t dax_pmd_load_hole(struct xa_state *xas, struct vm_fault *vmf, > goto fallback; > > pfn = page_to_pfn_t(zero_page); > - *entry = dax_insert_entry(xas, mapping, vmf, *entry, pfn, > - DAX_PMD | DAX_ZERO_PAGE, false); > + *entry = dax_insert_entry(xas, vmf, *entry, pfn, > + DAX_PMD | DAX_ZERO_PAGE, 0); > > if (arch_needs_pgtable_deposit()) { > pgtable = pte_alloc_one(vma->vm_mm); > @@ -1444,6 +1453,7 @@ static vm_fault_t dax_fault_actor(struct vm_fault *vmf, pfn_t *pfnp, > bool write = vmf->flags & FAULT_FLAG_WRITE; > bool sync = dax_fault_is_synchronous(flags, vmf->vma, iomap); > unsigned long entry_flags = pmd ? DAX_PMD : 0; > + unsigned int insert_flags = 0; > int err = 0; > pfn_t pfn; > void *kaddr; > @@ -1466,8 +1476,15 @@ static vm_fault_t dax_fault_actor(struct vm_fault *vmf, pfn_t *pfnp, > if (err) > return pmd ? VM_FAULT_FALLBACK : dax_fault_return(err); > > - *entry = dax_insert_entry(xas, mapping, vmf, *entry, pfn, entry_flags, > - write && !sync); > + if (write) { > + if (!sync) > + insert_flags |= DAX_IF_DIRTY; > + if (iomap->flags & IOMAP_F_SHARED) > + insert_flags |= DAX_IF_COW; > + } > + > + *entry = dax_insert_entry(xas, vmf, *entry, pfn, entry_flags, > + insert_flags); > > if (write && > srcmap->addr != IOMAP_HOLE && srcmap->addr != iomap->addr) { > -- > 2.31.0 > > >