Received: by 2002:a05:6a10:7420:0:0:0:0 with SMTP id hk32csp527628pxb; Tue, 15 Feb 2022 20:57:24 -0800 (PST) X-Google-Smtp-Source: ABdhPJw3urydEtbs8cc/OIW8YMarsV1ZwZDV58uXs4JanHTIeh2RrhaE+hD7XruaQAPJQQWjbhJF X-Received: by 2002:a05:6a00:1911:b0:4e0:f0ed:282e with SMTP id y17-20020a056a00191100b004e0f0ed282emr1449605pfi.43.1644987444164; Tue, 15 Feb 2022 20:57:24 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1644987444; cv=none; d=google.com; s=arc-20160816; b=aPAsIqdx5GttyJpl9paBlu/gVmrw8c7vxqeeAwnWOC+IXEfCLNnThlh40R20MEWzGc P5eUfDEz0Ah1fHxPzUhXOzZIyhxDJrC+xNI0bU2ioIkQ/fZ74kdGQZmu0J7ZK5g1owkY eVH+xj0rXPAkFwyzGUg3Bm9TIvhe9F87M9renjegfiQgxV9YGFuYAnjZcxa8UvOvNIKQ 3eZelWUD6UQIhkbYqzmNYB4XviMCzvJ7wg77gp/a7uM+QsShLo6syI7UitL6NUmJ/yK0 su7esrTvB2SNvoRKW3X1ByQgoOZ/KklpaHVw1ziRL1MibE4eBjrjuIhGRMBEc2isQNxj /F3Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=Q+Ju8G+K3yl0rXewZnv2N1IC9X1msq/8ymspEttmFB0=; b=Oy4s/+lVCGMviQGsSj2lEAMVepYjpmzAzPJtD55Z5W2eRalhmPzCz70rCl7m33KCtM mEG1/Ikbe2poayQb+iLzaFiq1AIWzquLFEQOIVNZrTHedrYxM9KKvfpDLO9PLIdEWmVv C+qhHZ2QgjAHen3J0YS+DTXy3Q3sCWSkrb11NLZB4UTiEAcACZtCUY5M7XLfZuuWy6JZ y1lMcHatuEO60xd2bbmaDZrEErMydjEtEzI2tpIOsm6mveJw0Y1rEA3dTnVYDGyM58G7 xuraX/206RWqtRfMbfiKTLN6XJuA6r1JMoodsGTnjK9r5DvCcvcEDQYWb/9F4KoWKhwZ XxEQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@intel-com.20210112.gappssmtp.com header.s=20210112 header.b=g0pX+9pa; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id k3si21010124pfd.217.2022.02.15.20.57.07; Tue, 15 Feb 2022 20:57:24 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@intel-com.20210112.gappssmtp.com header.s=20210112 header.b=g0pX+9pa; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1343537AbiBPCKH (ORCPT + 99 others); Tue, 15 Feb 2022 21:10:07 -0500 Received: from mxb-00190b01.gslb.pphosted.com ([23.128.96.19]:49094 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1343530AbiBPCKE (ORCPT ); Tue, 15 Feb 2022 21:10:04 -0500 Received: from mail-pj1-x102a.google.com (mail-pj1-x102a.google.com [IPv6:2607:f8b0:4864:20::102a]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 4C6BBFBA59 for ; Tue, 15 Feb 2022 18:09:53 -0800 (PST) Received: by mail-pj1-x102a.google.com with SMTP id v4so1155928pjh.2 for ; Tue, 15 Feb 2022 18:09:53 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=intel-com.20210112.gappssmtp.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=Q+Ju8G+K3yl0rXewZnv2N1IC9X1msq/8ymspEttmFB0=; b=g0pX+9paDLIsFTJvf2Iwxzc5R+2Rg/M68J6vnIomnLZCJeTgykYN4I9DF6JoQ/58EO ewBbo66r7K5E9FQVu8/isDoqbnf054RXWsWWuL1Jrm/fckkK3fQ2euKN7xMyWXRR/y89 1vp+hdgXJvlZLsQ159STnVV/30QoM8mBNMZWL8PsuXrmif2YeiPdoF2nqr12WWN/6D0T ZAN8Cy7+m55QT6oqbw7RZXcXjb/34BhJ00YPvdKeXsSkJ3yh95N1Oiz8LdcD7g6gGtUw uSBzOBGFL+ngPatwlg3p7PuELehciDFJqC12bD0Dym4214TAZAxI7z8RBhTFsuVU/0Ur riOg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=Q+Ju8G+K3yl0rXewZnv2N1IC9X1msq/8ymspEttmFB0=; b=uZd0uvgNJzxlkUe5T0If/tvyMnKe/MgtjamwhRSxfHs2OJte3oKQMQZKk/YxJ7HJxT r+xGvkL2UBd8ukl9C8OCt3xp/nM3OxI5XNuwourgIyDqIPbejUE5I0qZZbetAJlhEySR tOzzMwjkrbhj1DY0wXrAwWmySLQ8eX75jimU3JPSJD5XamscQmb5WhdbVvKKjEMtv80V nsWkj1ziWGzLI/V96hPmgEnBhP3iJEOSLIevK9mAJsfrE3JCXSyJ/l2kisMFWnehzB++ KeEMmbkXJbbt6Q0IDeYqzxuSCyHSxS0B7lL4nLs3OlZn/CVeT9IYScV0c+Z/38qYSZ7o msug== X-Gm-Message-State: AOAM5316vXwcJXSMvRcZBtIcfPx0TgzSdWCfPNreyiC9BE/0PfwgftLO J7R4J5qfydcdI/c4UGF9B8AsB8YPW18yzdEa/9NyrA== X-Received: by 2002:a17:90b:1a92:b0:1b9:8094:446b with SMTP id ng18-20020a17090b1a9200b001b98094446bmr357580pjb.93.1644977392787; Tue, 15 Feb 2022 18:09:52 -0800 (PST) MIME-Version: 1.0 References: <20220127124058.1172422-1-ruansy.fnst@fujitsu.com> <20220127124058.1172422-10-ruansy.fnst@fujitsu.com> In-Reply-To: <20220127124058.1172422-10-ruansy.fnst@fujitsu.com> From: Dan Williams Date: Tue, 15 Feb 2022 18:09:46 -0800 Message-ID: Subject: Re: [PATCH v10 9/9] fsdax: set a CoW flag when associate reflink mappings To: Shiyang Ruan Cc: Linux Kernel Mailing List , linux-xfs , Linux NVDIMM , Linux MM , linux-fsdevel , "Darrick J. Wong" , david , Christoph Hellwig , Jane Chu Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_NONE, T_SCC_BODY_TEXT_LINE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Jan 27, 2022 at 4:41 AM Shiyang Ruan wrote: > > Introduce a PAGE_MAPPING_DAX_COW flag to support association with CoW file > mappings. In this case, the dax-RMAP already takes the responsibility > to look up for shared files by given dax page. The page->mapping is no > longer to used for rmap but for marking that this dax page is shared. > And to make sure disassociation works fine, we use page->index as > refcount, and clear page->mapping to the initial state when page->index > is decreased to 0. > > With the help of this new flag, it is able to distinguish normal case > and CoW case, and keep the warning in normal case. > > Signed-off-by: Shiyang Ruan > --- > fs/dax.c | 65 ++++++++++++++++++++++++++++++++------ > include/linux/page-flags.h | 6 ++++ > 2 files changed, 62 insertions(+), 9 deletions(-) > > diff --git a/fs/dax.c b/fs/dax.c > index 250794a5b789..88879c579c1f 100644 > --- a/fs/dax.c > +++ b/fs/dax.c > @@ -334,13 +334,46 @@ static unsigned long dax_end_pfn(void *entry) > for (pfn = dax_to_pfn(entry); \ > pfn < dax_end_pfn(entry); pfn++) > > +static inline void dax_mapping_set_cow_flag(struct address_space *mapping) > +{ > + mapping = (struct address_space *)PAGE_MAPPING_DAX_COW; > +} > + > +static inline bool dax_mapping_is_cow(struct address_space *mapping) > +{ > + return (unsigned long)mapping == PAGE_MAPPING_DAX_COW; > +} > + > /* > - * TODO: for reflink+dax we need a way to associate a single page with > - * multiple address_space instances at different linear_page_index() > - * offsets. > + * Set or Update the page->mapping with FS_DAX_MAPPING_COW flag. > + * Return true if it is an Update. > + */ > +static inline bool dax_mapping_set_cow(struct page *page) > +{ > + if (page->mapping) { > + /* flag already set */ > + if (dax_mapping_is_cow(page->mapping)) > + return false; > + > + /* > + * This page has been mapped even before it is shared, just > + * need to set this FS_DAX_MAPPING_COW flag. > + */ > + dax_mapping_set_cow_flag(page->mapping); > + return true; > + } > + /* Newly associate CoW mapping */ > + dax_mapping_set_cow_flag(page->mapping); > + return false; > +} > + > +/* > + * When it is called in dax_insert_entry(), the cow flag will indicate that > + * whether this entry is shared by multiple files. If so, set the page->mapping > + * to be FS_DAX_MAPPING_COW, and use page->index as refcount. > */ > static void dax_associate_entry(void *entry, struct address_space *mapping, > - struct vm_area_struct *vma, unsigned long address) > + struct vm_area_struct *vma, unsigned long address, bool cow) > { > unsigned long size = dax_entry_size(entry), pfn, index; > int i = 0; > @@ -352,9 +385,17 @@ static void dax_associate_entry(void *entry, struct address_space *mapping, > for_each_mapped_pfn(entry, pfn) { > struct page *page = pfn_to_page(pfn); > > - WARN_ON_ONCE(page->mapping); > - page->mapping = mapping; > - page->index = index + i++; > + if (cow) { > + if (dax_mapping_set_cow(page)) { > + /* Was normal, now updated to CoW */ > + page->index = 2; > + } else > + page->index++; > + } else { > + WARN_ON_ONCE(page->mapping); > + page->mapping = mapping; > + page->index = index + i++; > + } > } > } > > @@ -370,7 +411,12 @@ static void dax_disassociate_entry(void *entry, struct address_space *mapping, > struct page *page = pfn_to_page(pfn); > > WARN_ON_ONCE(trunc && page_ref_count(page) > 1); > - WARN_ON_ONCE(page->mapping && page->mapping != mapping); > + if (!dax_mapping_is_cow(page->mapping)) { > + /* keep the CoW flag if this page is still shared */ > + if (page->index-- > 0) > + continue; > + } else > + WARN_ON_ONCE(page->mapping && page->mapping != mapping); > page->mapping = NULL; > page->index = 0; > } > @@ -810,7 +856,8 @@ static void *dax_insert_entry(struct xa_state *xas, > void *old; > > dax_disassociate_entry(entry, mapping, false); > - dax_associate_entry(new_entry, mapping, vmf->vma, vmf->address); > + dax_associate_entry(new_entry, mapping, vmf->vma, vmf->address, > + false); Where is the caller that passes 'true'? Also when that caller arrives introduce a separate dax_associate_cow_entry() as that's easier to read than dax_associate_entry(..., true) in case someone does not remember what that boolean flag means. However, it's not clear to me that this approach is a good idea given that the filesystem is the source of truth for how many address_spaces this page mapping might be duplicated. What about a iomap_page_ops for fsdax to ask the filesystem when it is ok to clear the mapping association for a page? > /* > * Only swap our new entry into the page cache if the current > * entry is a zero page or an empty entry. If a normal PTE or > diff --git a/include/linux/page-flags.h b/include/linux/page-flags.h > index 1c3b6e5c8bfd..6370d279795a 100644 > --- a/include/linux/page-flags.h > +++ b/include/linux/page-flags.h > @@ -572,6 +572,12 @@ __PAGEFLAG(Reported, reported, PF_NO_COMPOUND) > #define PAGE_MAPPING_KSM (PAGE_MAPPING_ANON | PAGE_MAPPING_MOVABLE) > #define PAGE_MAPPING_FLAGS (PAGE_MAPPING_ANON | PAGE_MAPPING_MOVABLE) > > +/* > + * Different with flags above, this flag is used only for fsdax mode. It > + * indicates that this page->mapping is now under reflink case. > + */ > +#define PAGE_MAPPING_DAX_COW 0x1 > + > static __always_inline int PageMappingFlags(struct page *page) > { > return ((unsigned long)page->mapping & PAGE_MAPPING_FLAGS) != 0; > -- > 2.34.1 > > >