Received: by 2002:ab2:710b:0:b0:1ef:a325:1205 with SMTP id z11csp890326lql; Tue, 12 Mar 2024 00:27:37 -0700 (PDT) X-Forwarded-Encrypted: i=3; AJvYcCVQ+0woupA+rLydauQNZwVsgwcsMAahyluyjo/W6AyltiNu8xnk73B++7o8qY58ukAbi6n6ra0A5RoxDKgf1Z+n8rG+qu2qhlPnhtcLrg== X-Google-Smtp-Source: AGHT+IEcb12whyMbC8ROXqPAu8kL+Myirr4M4GnIposCLJMgMmTtuY5EQ5Oun8XbEBc/TIDeFy5j X-Received: by 2002:a05:6808:11c4:b0:3c2:423a:5563 with SMTP id p4-20020a05680811c400b003c2423a5563mr1226854oiv.45.1710228457744; Tue, 12 Mar 2024 00:27:37 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1710228456; cv=pass; d=google.com; s=arc-20160816; b=aYM3IEEcRxgjZYjUYautUHpojtPV3kQDLdkKFTR4V5+VTG+7GYsYA5ruamgekDfiYJ BtRQrG+Pnp7rQAKhEL/LxofynRs8FWifkrL1S7qgUfqC/xlWHjhA0ed82lyZcIjBpjBk 0EqWAjenx3ZOsnxExnspMs1tNJE9aggTYzIE5ZsBYk/Ta/LxG1VNWshOAoL4SRoqQczI GW3+P4stDssRIsalcqdBwSzmyzXksmT0gGVw4JPNsCRxqQnHrkcI+uceJpVKmLDVyKnF ByAV5w4YYg5tqh6+S0HoOR+Jt+o7jBtMzcFyr4xPl7iLfH9BzGNjxY7+4JeV3IUcR22v 8UKg== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:in-reply-to:from:references:cc:to:subject :user-agent:mime-version:list-unsubscribe:list-subscribe:list-id :precedence:date:message-id:dkim-signature; bh=kSSt8SHpYapetnfBjwiq6UQNHHnNk1Xqt0pSKtWzsiU=; fh=v+w+ae0UzDF72uj2ouqJtUvEEDEs2QUpWe4vseUbtxg=; b=XBOJef08ZjHx8UxX8pNOemtMTzVLAd8LMbS0fH6K6u2e4QyH67lt82Qs06kq0oGpmr 3ba8Uq59mhoS7FMd/4bn28esuq8yuENW1tf5ZK4GLyyqMUVQ4/4LM/6kO4ysWnPMUFH4 IPxC/lerQZa/+PJMmGA1IM03ZsZr/1frym2PfV/9seF2vs4MpZ+5JcRKr2nTWorYWojw fCRuGL4AoTWWzx8dInXOOO9N+P7lXzXWtsAZMPeTuTFpuA4HfVx9bkunN0HYLYDkOp5H aX1VLgyni9fZDiMruWGH1jUtZu1kg4zrbzcf7bmjnOtq1rchM+oUMb/Cd+TJ6yTGuF6+ d0vQ==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@linux.alibaba.com header.s=default header.b=TEIceE4V; arc=pass (i=1 spf=pass spfdomain=linux.alibaba.com dkim=pass dkdomain=linux.alibaba.com dmarc=pass fromdomain=linux.alibaba.com); spf=pass (google.com: domain of linux-kernel+bounces-99910-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:40f1:3f00::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-99910-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linux.alibaba.com Return-Path: Received: from sy.mirrors.kernel.org (sy.mirrors.kernel.org. [2604:1380:40f1:3f00::1]) by mx.google.com with ESMTPS id x6-20020aa79a46000000b006e6a93e7a25si240917pfj.44.2024.03.12.00.27.36 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 12 Mar 2024 00:27:36 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel+bounces-99910-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:40f1:3f00::1 as permitted sender) client-ip=2604:1380:40f1:3f00::1; Authentication-Results: mx.google.com; dkim=pass header.i=@linux.alibaba.com header.s=default header.b=TEIceE4V; arc=pass (i=1 spf=pass spfdomain=linux.alibaba.com dkim=pass dkdomain=linux.alibaba.com dmarc=pass fromdomain=linux.alibaba.com); spf=pass (google.com: domain of linux-kernel+bounces-99910-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:40f1:3f00::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-99910-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linux.alibaba.com Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by sy.mirrors.kernel.org (Postfix) with ESMTPS id A6D32B2117D for ; Tue, 12 Mar 2024 07:27:32 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 73B0F6995B; Tue, 12 Mar 2024 07:27:26 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (1024-bit key) header.d=linux.alibaba.com header.i=@linux.alibaba.com header.b="TEIceE4V" Received: from out30-110.freemail.mail.aliyun.com (out30-110.freemail.mail.aliyun.com [115.124.30.110]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 0690069958 for ; Tue, 12 Mar 2024 07:27:21 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=115.124.30.110 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1710228445; cv=none; b=M4rOcP0MMvhdWo3Fs418YU67QiC4RfrEwN3veAKeMs3Zz54IIpTSFT0SZmt0EFFvgPA6K1W2aDCxmvL9fMhtnquiU6BS9FCZ6bbNzJJDxB6AcHkoyVygjzNqWQmFHZ0o23wp2SabU5CtbBS9wu0kVLbbBdHaeBrB7V5pGi6o9BA= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1710228445; c=relaxed/simple; bh=rau2ZXhw2u4iwC+3dkjnjkUaq/e+8lozHDLSMOcuGUQ=; h=Message-ID:Date:MIME-Version:Subject:To:Cc:References:From: In-Reply-To:Content-Type; b=AdLqMQ6PfmqCL3ivZvejudv84XMkQW8XS9mFfpiBOwUi1k93g1XGNFb6b2WGMLnUyqc3uP6x/K+vyKOGZ1RPg4X5PFf5PZ7OnieksnTpOPl6sTwdEJv8oEmhjrlca3Q4nS5TZvEFVudwIEbl37PfCbFe6YHuq/T8f6x3OYwmgbk= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=linux.alibaba.com; spf=pass smtp.mailfrom=linux.alibaba.com; dkim=pass (1024-bit key) header.d=linux.alibaba.com header.i=@linux.alibaba.com header.b=TEIceE4V; arc=none smtp.client-ip=115.124.30.110 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=linux.alibaba.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=linux.alibaba.com DKIM-Signature:v=1; a=rsa-sha256; c=relaxed/relaxed; d=linux.alibaba.com; s=default; t=1710228434; h=Message-ID:Date:MIME-Version:Subject:To:From:Content-Type; bh=kSSt8SHpYapetnfBjwiq6UQNHHnNk1Xqt0pSKtWzsiU=; b=TEIceE4VZQ95NcQh+ctSRTGybQZ9NWOK/7Kupl7/+DC+pwCEgu/Nsokqu+uN2sIzqJhGhuMXQ2PU9Ca5vpSuwtfkgf7mzFYAjXViEu49F+t6DQv0lK23MDz/rmX2IFtqq7oNpADK4zkx0ZicFluYdEaHrM+OzT00TgZy10yvhcA= X-Alimail-AntiSpam:AC=PASS;BC=-1|-1;BR=01201311R141e4;CH=green;DM=||false|;DS=||;FP=0|-1|-1|-1|0|-1|-1|-1;HT=ay29a033018046051;MF=baolin.wang@linux.alibaba.com;NM=1;PH=DS;RN=9;SR=0;TI=SMTPD_---0W2KyR9a_1710228432; Received: from 30.97.56.54(mailfrom:baolin.wang@linux.alibaba.com fp:SMTPD_---0W2KyR9a_1710228432) by smtp.aliyun-inc.com; Tue, 12 Mar 2024 15:27:14 +0800 Message-ID: <081dc7bb-ae60-4a38-b9c8-560280cf5cf8@linux.alibaba.com> Date: Tue, 12 Mar 2024 15:27:12 +0800 Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: [PATCH v2] mm/migrate: put dest folio on deferred split list if source was there. To: Zi Yan , linux-mm@kvack.org Cc: Andrew Morton , "Matthew Wilcox (Oracle)" , Yang Shi , Huang Ying , "Kirill A . Shutemov" , Ryan Roberts , linux-kernel@vger.kernel.org References: <20240311195848.135067-1-zi.yan@sent.com> From: Baolin Wang In-Reply-To: <20240311195848.135067-1-zi.yan@sent.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit On 2024/3/12 03:58, Zi Yan wrote: > From: Zi Yan > > Commit 616b8371539a6 ("mm: thp: enable thp migration in generic path") > did not check if a THP is on deferred split list before migration, thus, > the destination THP is never put on deferred split list even if the source > THP might be. The opportunity of reclaiming free pages in a partially > mapped THP during deferred list scanning is lost, but no other harmful > consequence is present[1]. Checking source folio deferred split list > status before page unmapped and add destination folio to the list if > source was after migration. > > [1]: https://lore.kernel.org/linux-mm/03CE3A00-917C-48CC-8E1C-6A98713C817C@nvidia.com/ > > From v1: > 1. Used dst to get correct deferred split list after migration > (per Ryan Roberts). > > Fixes: 616b8371539a ("mm: thp: enable thp migration in generic path") > Signed-off-by: Zi Yan > --- > mm/huge_memory.c | 22 ---------------------- > mm/internal.h | 23 +++++++++++++++++++++++ > mm/migrate.c | 26 +++++++++++++++++++++++++- > 3 files changed, 48 insertions(+), 23 deletions(-) > > diff --git a/mm/huge_memory.c b/mm/huge_memory.c > index 9859aa4f7553..c6d4d0cdf4b3 100644 > --- a/mm/huge_memory.c > +++ b/mm/huge_memory.c > @@ -766,28 +766,6 @@ pmd_t maybe_pmd_mkwrite(pmd_t pmd, struct vm_area_struct *vma) > return pmd; > } > > -#ifdef CONFIG_MEMCG > -static inline > -struct deferred_split *get_deferred_split_queue(struct folio *folio) > -{ > - struct mem_cgroup *memcg = folio_memcg(folio); > - struct pglist_data *pgdat = NODE_DATA(folio_nid(folio)); > - > - if (memcg) > - return &memcg->deferred_split_queue; > - else > - return &pgdat->deferred_split_queue; > -} > -#else > -static inline > -struct deferred_split *get_deferred_split_queue(struct folio *folio) > -{ > - struct pglist_data *pgdat = NODE_DATA(folio_nid(folio)); > - > - return &pgdat->deferred_split_queue; > -} > -#endif > - > void folio_prep_large_rmappable(struct folio *folio) > { > if (!folio || !folio_test_large(folio)) > diff --git a/mm/internal.h b/mm/internal.h > index d1c69119b24f..8fa36e84463a 100644 > --- a/mm/internal.h > +++ b/mm/internal.h > @@ -1107,6 +1107,29 @@ struct page *follow_trans_huge_pmd(struct vm_area_struct *vma, > unsigned long addr, pmd_t *pmd, > unsigned int flags); > > +#ifdef CONFIG_MEMCG > +static inline > +struct deferred_split *get_deferred_split_queue(struct folio *folio) > +{ > + struct mem_cgroup *memcg = folio_memcg(folio); > + struct pglist_data *pgdat = NODE_DATA(folio_nid(folio)); > + > + if (memcg) > + return &memcg->deferred_split_queue; > + else > + return &pgdat->deferred_split_queue; > +} > +#else > +static inline > +struct deferred_split *get_deferred_split_queue(struct folio *folio) > +{ > + struct pglist_data *pgdat = NODE_DATA(folio_nid(folio)); > + > + return &pgdat->deferred_split_queue; > +} > +#endif > + > + > /* > * mm/mmap.c > */ > diff --git a/mm/migrate.c b/mm/migrate.c > index 73a052a382f1..591e65658535 100644 > --- a/mm/migrate.c > +++ b/mm/migrate.c > @@ -20,6 +20,7 @@ > #include > #include > #include > +#include > #include > #include > #include > @@ -1037,7 +1038,10 @@ static int move_to_new_folio(struct folio *dst, struct folio *src, > enum { > PAGE_WAS_MAPPED = BIT(0), > PAGE_WAS_MLOCKED = BIT(1), > - PAGE_OLD_STATES = PAGE_WAS_MAPPED | PAGE_WAS_MLOCKED, > + PAGE_WAS_ON_DEFERRED_LIST = BIT(2), > + PAGE_OLD_STATES = PAGE_WAS_MAPPED | > + PAGE_WAS_MLOCKED | > + PAGE_WAS_ON_DEFERRED_LIST, > }; > > static void __migrate_folio_record(struct folio *dst, > @@ -1168,6 +1172,17 @@ static int migrate_folio_unmap(new_folio_t get_new_folio, > folio_lock(src); > } > locked = true; > + if (folio_test_large_rmappable(src) && IMO, you should check folio_test_large() before calling folio_test_large_rmappable(), since the PG_large_rmappable flag is stored in the first tail page. > + !list_empty(&src->_deferred_list)) { > + struct deferred_split *ds_queue = get_deferred_split_queue(src); > + > + spin_lock(&ds_queue->split_queue_lock); > + ds_queue->split_queue_len--; > + list_del_init(&src->_deferred_list); > + spin_unlock(&ds_queue->split_queue_lock); > + old_page_state |= PAGE_WAS_ON_DEFERRED_LIST; > + } > + > if (folio_test_mlocked(src)) > old_page_state |= PAGE_WAS_MLOCKED; > > @@ -1307,6 +1322,15 @@ static int migrate_folio_move(free_folio_t put_new_folio, unsigned long private, > if (old_page_state & PAGE_WAS_MAPPED) > remove_migration_ptes(src, dst, false); > > + if (old_page_state & PAGE_WAS_ON_DEFERRED_LIST) { > + struct deferred_split *ds_queue = get_deferred_split_queue(dst); > + > + spin_lock(&ds_queue->split_queue_lock); > + ds_queue->split_queue_len++; > + list_add(&dst->_deferred_list, &ds_queue->split_queue); > + spin_unlock(&ds_queue->split_queue_lock); > + } > + > out_unlock_both: > folio_unlock(dst); > set_page_owner_migrate_reason(&dst->page, reason);