Received: by 2002:a25:6193:0:0:0:0:0 with SMTP id v141csp2876658ybb; Fri, 27 Mar 2020 13:48:21 -0700 (PDT) X-Google-Smtp-Source: ADFU+vuwOJR/G6Rb0r+muvQuedT3UKRZbQcTsDwcSpY9P7tMS4X2ymIhgsP6vQJGxWvCJsnHwoEH X-Received: by 2002:aca:d40f:: with SMTP id l15mr575376oig.90.1585342101035; Fri, 27 Mar 2020 13:48:21 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1585342101; cv=none; d=google.com; s=arc-20160816; b=aUu6J0jECWolOhUh3XUqMGstuYeZZ5/jS5F3gfs7Zzvs9p8LCNucAG0sQ3nr6ixOE7 HYHzSnKK/Gzrmlv4qy1OkpCmG/3NMyycvAGTRKNkEukyzguvcbtD+yuEFITCYK4vd3dU 0U/oVQQqh5kU8ATioC+qKBLutfmBpXvXRJ9vch8kWRSVnnSgg9rf1SXGn+/ZJvNo1LhI EixuJpL/ZL2rPJ53XE7BFyWxquGXovpFVvD1rfvrMy9YLWLOYwcjzIWblkBtHfFWWNed 3advN15QZtcaO3sdLyuqDwPWeXkd8MnfXI7lkP/3aZI5h9vXIZ8aVGuL3+nXywk43w8d 5YkQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:dkim-signature; bh=7vL3iO9JiDFVO+iu/iS2D/h6W6favjm/f3dgl2deRQE=; b=tQDcebrsY5jKU05pw4afvc7trIEXcAWGLh8YnTAynu63upJRoUotsizmUfLUuA5vPl A5E6KyI4OG+802CvWQBnytU0rdLFuqd/Cem89VDygPvytn8B+mR5eEws+ffJpky/vB3X JdSMRmvx2C9T6LCodepd2UK4JZ04kM5d86WFCD/Os/8BtD68ku4xmSz0mF7jP5+JAttK aulS0F5ZfgciYbNWdM7d4C7iScjhywHb2rmHwiiYB8zJaniCc+SUdvQNw2WZuADp7uWg qeyTkSdu15/Otx8Lfv1r1CU1FmKAzT7j1llgphTHFvYQEK0za2GyVUa2DCdwh/1B3bkg Onnw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b="EsFr/UuU"; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id y9si2962345oia.47.2020.03.27.13.48.08; Fri, 27 Mar 2020 13:48:21 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b="EsFr/UuU"; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727593AbgC0UqK (ORCPT + 99 others); Fri, 27 Mar 2020 16:46:10 -0400 Received: from mail-ed1-f68.google.com ([209.85.208.68]:41078 "EHLO mail-ed1-f68.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727322AbgC0UqJ (ORCPT ); Fri, 27 Mar 2020 16:46:09 -0400 Received: by mail-ed1-f68.google.com with SMTP id v1so12903812edq.8 for ; Fri, 27 Mar 2020 13:46:08 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=7vL3iO9JiDFVO+iu/iS2D/h6W6favjm/f3dgl2deRQE=; b=EsFr/UuU5dpDMXpnrhEE6kHwwiZjficTgF7BSwp1Dn9JpshoQsB7rmDiaZp59n5Bnn 6gY8Wj9Nmx/3TswWk8mvjZpZDKsJCZeHQDOMJrvxi5Yne61Tv8XNt2vYUOwSFnj035vs tsyaZL2NBzj2l1JBLlhYuZk6Mhw0qpsleTgXSNMmnpfPs5wvNN/8TqJZbDWRbHwK+cw/ MM7yWrhpldEa6yYILefsbOP3CnfF3vc48QTK1iHyZJ+rE24iBeA2t9vHjKWOIYMFAesX F3kU+hNBt3I9mk25YP0CzEtNMcZSQA/qmdyjwxH6Cc+GWYygPqm+UZ9Ig+60wTXFxQQq TKgA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=7vL3iO9JiDFVO+iu/iS2D/h6W6favjm/f3dgl2deRQE=; b=AVwnf4ZDvvsY7yPiN100ySpC4ZSVmGTEIHG+d/VdqVS7qFviXueLEHUiBHW9DPgz7b qYFjcGlKzPw4tRlINISK7XA4DM5C0BZUGNTGuw75K8EPmr4f5ZP0wOOLIdKw4SAbYOLY wwRormKouvGRJkxKpEqXvBLXXcljJhujHQiGdV1hhxqmc1qzc18vEHpStWQf4cefjUCz AHY78sDwHPbyNKq/0q/zfYMWe3kUvIzoKReNEWy/P2I2uD6xcQryVcRZ40cy+EvUIS5b VJw/2O5XBEYAbIic4rvyqlbmMA78EPnvo32skHHxe8OH6bn95vJsP03Jc7WmhnV7CpIY 0IRQ== X-Gm-Message-State: ANhLgQ3Nzx2GecUEucVs46L+X1Yyho404oYTdqaOPWZbQwX7KbnRv3GM Rbjg/h1YKbLAaZ37EhroXD+81TffLaWql85KXVs= X-Received: by 2002:a50:930e:: with SMTP id m14mr976638eda.256.1585341967651; Fri, 27 Mar 2020 13:46:07 -0700 (PDT) MIME-Version: 1.0 References: <20200327170601.18563-1-kirill.shutemov@linux.intel.com> <20200327170601.18563-6-kirill.shutemov@linux.intel.com> In-Reply-To: <20200327170601.18563-6-kirill.shutemov@linux.intel.com> From: Yang Shi Date: Fri, 27 Mar 2020 13:45:55 -0700 Message-ID: Subject: Re: [PATCH 5/7] khugepaged: Allow to collapse PTE-mapped compound pages To: "Kirill A. Shutemov" Cc: Andrew Morton , Andrea Arcangeli , Linux MM , Linux Kernel Mailing List , "Kirill A. Shutemov" Content-Type: text/plain; charset="UTF-8" Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Fri, Mar 27, 2020 at 10:06 AM Kirill A. Shutemov wrote: > > We can collapse PTE-mapped compound pages. We only need to avoid > handling them more than once: lock/unlock page only once if it's present > in the PMD range multiple times as it handled on compound level. The > same goes for LRU isolation and putpack. > > Signed-off-by: Kirill A. Shutemov > --- > mm/khugepaged.c | 41 +++++++++++++++++++++++++++++++---------- > 1 file changed, 31 insertions(+), 10 deletions(-) > > diff --git a/mm/khugepaged.c b/mm/khugepaged.c > index b47edfe57f7b..c8c2c463095c 100644 > --- a/mm/khugepaged.c > +++ b/mm/khugepaged.c > @@ -515,6 +515,17 @@ void __khugepaged_exit(struct mm_struct *mm) > > static void release_pte_page(struct page *page) > { > + /* > + * We need to unlock and put compound page on LRU only once. > + * The rest of the pages have to be locked and not on LRU here. > + */ > + VM_BUG_ON_PAGE(!PageCompound(page) && > + (!PageLocked(page) && PageLRU(page)), page); > + > + if (!PageLocked(page)) > + return; > + > + page = compound_head(page); > dec_node_page_state(page, NR_ISOLATED_ANON + page_is_file_cache(page)); > unlock_page(page); > putback_lru_page(page); BTW, wouldn't this unlock the whole THP and put it back to LRU? Then we may copy the following PTE mapped pages with page unlocked and on LRU. I don't see critical problem, just the pages might be on and off LRU by others, i.e. vmscan, compaction, migration, etc. But no one could take the page away since try_to_unmap() would fail, but not very productive. > @@ -537,6 +548,7 @@ static int __collapse_huge_page_isolate(struct vm_area_struct *vma, > pte_t *_pte; > int none_or_zero = 0, result = 0, referenced = 0; > bool writable = false; > + LIST_HEAD(compound_pagelist); > > for (_pte = pte; _pte < pte+HPAGE_PMD_NR; > _pte++, address += PAGE_SIZE) { > @@ -561,13 +573,23 @@ static int __collapse_huge_page_isolate(struct vm_area_struct *vma, > goto out; > } > > - /* TODO: teach khugepaged to collapse THP mapped with pte */ > + VM_BUG_ON_PAGE(!PageAnon(page), page); > + > if (PageCompound(page)) { > - result = SCAN_PAGE_COMPOUND; > - goto out; > - } > + struct page *p; > + page = compound_head(page); > > - VM_BUG_ON_PAGE(!PageAnon(page), page); > + /* > + * Check if we have dealt with the compount page > + * already > + */ > + list_for_each_entry(p, &compound_pagelist, lru) { > + if (page == p) > + break; > + } > + if (page == p) > + continue; > + } > > /* > * We can do it before isolate_lru_page because the > @@ -640,6 +662,9 @@ static int __collapse_huge_page_isolate(struct vm_area_struct *vma, > page_is_young(page) || PageReferenced(page) || > mmu_notifier_test_young(vma->vm_mm, address)) > referenced++; > + > + if (PageCompound(page)) > + list_add_tail(&page->lru, &compound_pagelist); > } > if (likely(writable)) { > if (likely(referenced)) { > @@ -1185,11 +1210,7 @@ static int khugepaged_scan_pmd(struct mm_struct *mm, > goto out_unmap; > } > > - /* TODO: teach khugepaged to collapse THP mapped with pte */ > - if (PageCompound(page)) { > - result = SCAN_PAGE_COMPOUND; > - goto out_unmap; > - } > + page = compound_head(page); > > /* > * Record which node the original page is from and save this > -- > 2.26.0 > >