Received: by 2002:a05:7208:3003:b0:81:def:69cd with SMTP id f3csp167524rba; Tue, 26 Mar 2024 12:13:58 -0700 (PDT) X-Forwarded-Encrypted: i=3; AJvYcCUkTEDrzXRcGJHrX9iS0rgRwpLiwtf1qzvKcokucPLQt5Ng/5MZ/7xqwZAmtuASeKcUpZPFPYFW8a0H+9o/3Lfc3pVZe/9A3XPnVP8f9g== X-Google-Smtp-Source: AGHT+IGAHfUEXn8nHSyusrYbV4JbsDLuocwbdglCq+aUSgadUc9Llk2zaYTraW/VA9bg34Ee6uAt X-Received: by 2002:ac2:5b05:0:b0:513:d08b:3790 with SMTP id v5-20020ac25b05000000b00513d08b3790mr1607312lfn.49.1711479955138; Tue, 26 Mar 2024 12:05:55 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1711479955; cv=pass; d=google.com; s=arc-20160816; b=RXxBH+0c5f6N6oBr3vWzxCI+LAP0xKDxAT5Z2gCgu/xj+Riwqq1R92+a4smu9jxeMv Vfmop5wW1j7CYpf+fi4QQsxXzuie7cUKCaOTpIcVcT76y3RejE/0vKqugDf71jGEIe6M WVRCz7ZdnLPXdmd0k6/+jlx+MAsdc+ndRjg4ATBWGro6TBiIBLl895KtbWIodrqlWsib DzscmFdEJhRDClbdXT9xXThVsBMz1071WnF98zG6HIkdd3JRu6oVaTzymd35jlRb0qRG YSh1AAyKLTre9yYD+VnN1mYpEKZjc0ANdZ0U0ZdqhanYlED7gRhjuBKNdmrXhfJD7hNu BmrQ== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:mime-version:list-unsubscribe :list-subscribe:list-id:precedence:reply-to:references:in-reply-to :message-id:date:subject:cc:to:from:dkim-signature; bh=ztDoqKIOUdyGA4S9aoIHNjHCO+9b1c8CvnSrz3vjQuA=; fh=/+4Jeind+SDmMjt622oaHZJC3n1VR1cdPGxnpHbIqms=; b=fCZsIk12leeR94Xgxx0MlF6X89M4T3O+5jXeGEk014M7SOfOdM6J69HwtSKEWdx4qU aBoY9PJKlVh3jMgOIIGaq6p0hqZpNyy0EndFEAQjyNvP4QHYe111l1HROPHr8oQBE9jk f/3+VoP26gUlYK8UdU9hw/6UnL6C2vXRRGXNrc5PCc3T+hNo+ZmQHQKQDTiRLj2sbFbU 3x1RbtPFZbK/1H24T17culZwzwBZgedcj6owUlckWR+R+7DAcX4N78F3yghqTll/wpNb SWWd86kB1IaAPJfuysUtVCNaUG9l457kSx7CDJzFr0Ke26NU/D3nNeLimH63emaL+hkf ufPQ==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@gmail.com header.s=20230601 header.b=bfcol694; arc=pass (i=1 spf=pass spfdomain=gmail.com dkim=pass dkdomain=gmail.com dmarc=pass fromdomain=gmail.com); spf=pass (google.com: domain of linux-kernel+bounces-119748-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:4601:e00::3 as permitted sender) smtp.mailfrom="linux-kernel+bounces-119748-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from am.mirrors.kernel.org (am.mirrors.kernel.org. [2604:1380:4601:e00::3]) by mx.google.com with ESMTPS id y9-20020a056402440900b0056c28889318si1056474eda.361.2024.03.26.12.05.43 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 26 Mar 2024 12:05:55 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel+bounces-119748-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:4601:e00::3 as permitted sender) client-ip=2604:1380:4601:e00::3; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20230601 header.b=bfcol694; arc=pass (i=1 spf=pass spfdomain=gmail.com dkim=pass dkdomain=gmail.com dmarc=pass fromdomain=gmail.com); spf=pass (google.com: domain of linux-kernel+bounces-119748-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:4601:e00::3 as permitted sender) smtp.mailfrom="linux-kernel+bounces-119748-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by am.mirrors.kernel.org (Postfix) with ESMTPS id 5F34C1F80976 for ; Tue, 26 Mar 2024 19:05:43 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 6945F13D267; Tue, 26 Mar 2024 19:04:47 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b="bfcol694" Received: from mail-pf1-f171.google.com (mail-pf1-f171.google.com [209.85.210.171]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 453B813CC6D for ; Tue, 26 Mar 2024 19:04:44 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=209.85.210.171 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1711479886; cv=none; b=UdGRndyGgKlHm8bR8qLGke0izkUPOBL70sam9JAKDJ+48uDk1/+hhQe1qJfA+1ytXpHC2dZtCBL8B1LImxgN+eYaOOotUbSvsWaaQTv5xUB1eaqZXLrq7BUr+ag0D4ZNG4klZk0nooLWiChNPUL2tWQH34lAluFo6NyueI86nd8= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1711479886; c=relaxed/simple; bh=6tcr6iMdmPAE6fgxinABbdtbLscebxtcpDh1uab02Z0=; h=From:To:Cc:Subject:Date:Message-ID:In-Reply-To:References: MIME-Version; b=l5FlzZ6pcjIO505srSPdfwLOGSKeDIG5xp6Hue5WCX0RuY6lkl4Z+OwtqmYHLCPi71Vyz8yed53Gd9W9+chW37COe49Hn/FX0KI0nXRiA1v5JAc+Rmv+ZWhi+GCiCuToN1PAOfd8orJRVned8dKyTLm1qG13x2WRYSZTln4oPt8= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=gmail.com; spf=pass smtp.mailfrom=gmail.com; dkim=pass (2048-bit key) header.d=gmail.com header.i=@gmail.com header.b=bfcol694; arc=none smtp.client-ip=209.85.210.171 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=gmail.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=gmail.com Received: by mail-pf1-f171.google.com with SMTP id d2e1a72fcca58-6e6f69e850bso5283459b3a.0 for ; Tue, 26 Mar 2024 12:04:44 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20230601; t=1711479883; x=1712084683; darn=vger.kernel.org; h=content-transfer-encoding:mime-version:reply-to:references :in-reply-to:message-id:date:subject:cc:to:from:from:to:cc:subject :date:message-id:reply-to; bh=ztDoqKIOUdyGA4S9aoIHNjHCO+9b1c8CvnSrz3vjQuA=; b=bfcol694OWmDr04/fe8qdMd0GsOBA/13M4cDkn8QPm+PJpwFxWk1A/6Zh+vAJdQmw/ IVLet5mdjQEipU2XlMWMr2UKhCbsEdFABTeorz7vPaD7dGqIseRHw4IpnRoOi4htkHNf 52+ITGnHZKqhSXwNKthoaZPHgi/hp+HQCu1i6I4Gv9UME0QjVXEnPTPeLoTjKHBWOH6H Q1UGD0cLQLbFA/iaPFwbh+hw8QXemVkeWix1gleYlXyLywEITSY+7aSRLpkG1A1bSv3Y Z3x+MyiHWTwu+HgMIkTaaTcjSgRlFFP4lqlQ8wzBQx/6Osgd7e0dYoaZVxklTxkmhbj6 QjEg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1711479883; x=1712084683; h=content-transfer-encoding:mime-version:reply-to:references :in-reply-to:message-id:date:subject:cc:to:from:x-gm-message-state :from:to:cc:subject:date:message-id:reply-to; bh=ztDoqKIOUdyGA4S9aoIHNjHCO+9b1c8CvnSrz3vjQuA=; b=BgJBbf1hYpCbluzHEwrlfPjEZHvmDevnQXJQbJbaX56NlQZp+3/Z+GTVvDh//O6e+k Pg3NCyKXFZg8Iu7BDsnboLegd2AZL0xdGXok6DDb06ErD4gmGWu5FT3TmX1RfbU0B+45 yqll09BaCSlz1R1XGz57cCXwcyZNDSkvOmJtQekXDAesR2wIszbRLhmpq+LSnYaqCkVl dIxHisOI6GOsWbd+9qexgZN7sKNbVQ8ggMJ6HJ2oSA5OWargPdApIH23IZYwshTilDad yJI8JY1R6UxqfINZw7m2eQYP1l1ChX+xDdafvfhc+yGumuttGQlu1SPo5kTNnVEyZcl7 4OWQ== X-Forwarded-Encrypted: i=1; AJvYcCXjkVQQchT+CRjxSMjJZhTevpyRBrAdA9AbjBg/3VKe5z7t2ChqSGR2DiZigJr9Vs83DGs31Y0ZOC1iWRwxQ/f/x3ex74FFNw/p346J X-Gm-Message-State: AOJu0YyO+SwuEzDCF5nmG+haXpTF4qip5bTi5e9/T95kQTTl50j36VYC Z1JOb5u7+jnaWzUdfGZi2fIRA1WsPvNVzsZ9Pwy5HJjCJSmIAwOA X-Received: by 2002:a05:6a00:21d2:b0:6e6:8df5:f77a with SMTP id t18-20020a056a0021d200b006e68df5f77amr2245769pfj.31.1711479883480; Tue, 26 Mar 2024 12:04:43 -0700 (PDT) Received: from KASONG-MB2.tencent.com ([115.171.40.106]) by smtp.gmail.com with ESMTPSA id j14-20020aa783ce000000b006ea790c2232sm6298350pfn.79.2024.03.26.12.04.39 (version=TLS1_3 cipher=TLS_CHACHA20_POLY1305_SHA256 bits=256/256); Tue, 26 Mar 2024 12:04:42 -0700 (PDT) From: Kairui Song To: linux-mm@kvack.org Cc: "Huang, Ying" , Chris Li , Minchan Kim , Barry Song , Ryan Roberts , Yu Zhao , SeongJae Park , David Hildenbrand , Yosry Ahmed , Johannes Weiner , Matthew Wilcox , Nhat Pham , Chengming Zhou , Andrew Morton , linux-kernel@vger.kernel.org, Kairui Song Subject: [RFC PATCH 06/10] mm/swap: switch to use multi index entries Date: Wed, 27 Mar 2024 02:50:28 +0800 Message-ID: <20240326185032.72159-7-ryncsn@gmail.com> X-Mailer: git-send-email 2.43.0 In-Reply-To: <20240326185032.72159-1-ryncsn@gmail.com> References: <20240326185032.72159-1-ryncsn@gmail.com> Reply-To: Kairui Song Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Transfer-Encoding: 8bit From: Kairui Song From: Kairui Song Since now all explicit shadow clearing is gone and all swapin / swapout path is all using swap cache, switch swap cache to use multi index so swapping out of THP will be faster, also using less memory. Test result of sequential swapin/out of 30G zero page on ZRAM: Before (us) After (us) Swapout: 33648529 33713283 Swapin: 40667696 40954646 Swapout (THP): 7658664 6921176 (+9.7%) Swapin (THP) : 40602278 40891953 And after swapping out 30G with THP, the radix node usage dropped by a lot: Before: radix_tree_node 73728K After: radix_tree_node 7056K (-94%) Signed-off-by: Kairui Song --- mm/filemap.c | 27 +++++++++++++++++ mm/huge_memory.c | 77 +++++++++++++++++++----------------------------- mm/internal.h | 2 ++ mm/swap_state.c | 54 ++++++++++----------------------- 4 files changed, 75 insertions(+), 85 deletions(-) diff --git a/mm/filemap.c b/mm/filemap.c index 0ccdc9e92764..5e8e3fd26b8d 100644 --- a/mm/filemap.c +++ b/mm/filemap.c @@ -919,6 +919,33 @@ static int __filemap_lock_store(struct xa_state *xas, struct folio *folio, return xas_error(xas); } +int __filemap_add_swapcache(struct address_space *mapping, struct folio *folio, + pgoff_t index, gfp_t gfp, void **shadowp) +{ + XA_STATE_ORDER(xas, &mapping->i_pages, index, folio_order(folio)); + long nr; + int ret; + + VM_BUG_ON_FOLIO(!folio_test_locked(folio), folio); + VM_BUG_ON_FOLIO(!folio_test_swapcache(folio), folio); + mapping_set_update(&xas, mapping); + + nr = folio_nr_pages(folio); + folio_ref_add(folio, nr); + + ret = __filemap_lock_store(&xas, folio, index, gfp, shadowp); + if (likely(!ret)) { + mapping->nrpages += nr; + __node_stat_mod_folio(folio, NR_FILE_PAGES, nr); + __lruvec_stat_mod_folio(folio, NR_SWAPCACHE, nr); + xas_unlock_irq(&xas); + } else { + folio_put_refs(folio, nr); + } + + return ret; +} + noinline int __filemap_add_folio(struct address_space *mapping, struct folio *folio, pgoff_t index, gfp_t gfp, void **shadowp) { diff --git a/mm/huge_memory.c b/mm/huge_memory.c index 9859aa4f7553..4fd2f74b94a9 100644 --- a/mm/huge_memory.c +++ b/mm/huge_memory.c @@ -2886,14 +2886,12 @@ static void __split_huge_page_tail(struct folio *folio, int tail, lru_add_page_tail(head, page_tail, lruvec, list); } -static void __split_huge_page(struct page *page, struct list_head *list, - pgoff_t end, unsigned int new_order) +static void __split_huge_page(struct address_space *mapping, struct page *page, + struct list_head *list, pgoff_t end, unsigned int new_order) { struct folio *folio = page_folio(page); struct page *head = &folio->page; struct lruvec *lruvec; - struct address_space *swap_cache = NULL; - unsigned long offset = 0; int i, nr_dropped = 0; unsigned int new_nr = 1 << new_order; int order = folio_order(folio); @@ -2902,12 +2900,6 @@ static void __split_huge_page(struct page *page, struct list_head *list, /* complete memcg works before add pages to LRU */ split_page_memcg(head, order, new_order); - if (folio_test_anon(folio) && folio_test_swapcache(folio)) { - offset = swp_offset(folio->swap); - swap_cache = swap_address_space(folio->swap); - xa_lock(&swap_cache->i_pages); - } - /* lock lru list/PageCompound, ref frozen by page_ref_freeze */ lruvec = folio_lruvec_lock(folio); @@ -2919,18 +2911,18 @@ static void __split_huge_page(struct page *page, struct list_head *list, if (head[i].index >= end) { struct folio *tail = page_folio(head + i); - if (shmem_mapping(folio->mapping)) + if (shmem_mapping(mapping)) nr_dropped++; else if (folio_test_clear_dirty(tail)) folio_account_cleaned(tail, - inode_to_wb(folio->mapping->host)); + inode_to_wb(mapping->host)); __filemap_remove_folio(tail, NULL); folio_put(tail); } else if (!PageAnon(page)) { - __xa_store(&folio->mapping->i_pages, head[i].index, + __xa_store(&mapping->i_pages, head[i].index, head + i, 0); - } else if (swap_cache) { - __xa_store(&swap_cache->i_pages, offset + i, + } else if (folio_test_swapcache(folio)) { + __xa_store(&mapping->i_pages, swp_offset(folio->swap) + i, head + i, 0); } } @@ -2948,23 +2940,17 @@ static void __split_huge_page(struct page *page, struct list_head *list, split_page_owner(head, order, new_order); /* See comment in __split_huge_page_tail() */ - if (folio_test_anon(folio)) { + if (mapping) { /* Additional pin to swap cache */ - if (folio_test_swapcache(folio)) { - folio_ref_add(folio, 1 + new_nr); - xa_unlock(&swap_cache->i_pages); - } else { - folio_ref_inc(folio); - } - } else { - /* Additional pin to page cache */ folio_ref_add(folio, 1 + new_nr); - xa_unlock(&folio->mapping->i_pages); + xa_unlock(&mapping->i_pages); + } else { + folio_ref_inc(folio); } local_irq_enable(); if (nr_dropped) - shmem_uncharge(folio->mapping->host, nr_dropped); + shmem_uncharge(mapping->host, nr_dropped); remap_page(folio, nr); if (folio_test_swapcache(folio)) @@ -3043,11 +3029,12 @@ int split_huge_page_to_list_to_order(struct page *page, struct list_head *list, struct deferred_split *ds_queue = get_deferred_split_queue(folio); /* reset xarray order to new order after split */ XA_STATE_ORDER(xas, &folio->mapping->i_pages, folio->index, new_order); + struct address_space *mapping = folio_mapping(folio);; struct anon_vma *anon_vma = NULL; - struct address_space *mapping = NULL; int extra_pins, ret; pgoff_t end; bool is_hzp; + gfp_t gfp; VM_BUG_ON_FOLIO(!folio_test_locked(folio), folio); VM_BUG_ON_FOLIO(!folio_test_large(folio), folio); @@ -3079,7 +3066,6 @@ int split_huge_page_to_list_to_order(struct page *page, struct list_head *list, } } - is_hzp = is_huge_zero_page(&folio->page); if (is_hzp) { pr_warn_ratelimited("Called split_huge_page for huge zero page\n"); @@ -3089,6 +3075,17 @@ int split_huge_page_to_list_to_order(struct page *page, struct list_head *list, if (folio_test_writeback(folio)) return -EBUSY; + if (mapping) { + gfp = current_gfp_context(mapping_gfp_mask(mapping) & + GFP_RECLAIM_MASK); + + xas_split_alloc(&xas, folio, folio_order(folio), gfp); + if (xas_error(&xas)) { + ret = xas_error(&xas); + goto out; + } + } + if (folio_test_anon(folio)) { /* * The caller does not necessarily hold an mmap_lock that would @@ -3104,33 +3101,19 @@ int split_huge_page_to_list_to_order(struct page *page, struct list_head *list, goto out; } end = -1; - mapping = NULL; anon_vma_lock_write(anon_vma); } else { - gfp_t gfp; - - mapping = folio->mapping; - /* Truncated ? */ if (!mapping) { ret = -EBUSY; goto out; } - gfp = current_gfp_context(mapping_gfp_mask(mapping) & - GFP_RECLAIM_MASK); - if (!filemap_release_folio(folio, gfp)) { ret = -EBUSY; goto out; } - xas_split_alloc(&xas, folio, folio_order(folio), gfp); - if (xas_error(&xas)) { - ret = xas_error(&xas); - goto out; - } - anon_vma = NULL; i_mmap_lock_read(mapping); @@ -3189,7 +3172,9 @@ int split_huge_page_to_list_to_order(struct page *page, struct list_head *list, int nr = folio_nr_pages(folio); xas_split(&xas, folio, folio_order(folio)); - if (folio_test_pmd_mappable(folio) && + + if (!folio_test_anon(folio) && + folio_test_pmd_mappable(folio) && new_order < HPAGE_PMD_ORDER) { if (folio_test_swapbacked(folio)) { __lruvec_stat_mod_folio(folio, @@ -3202,7 +3187,7 @@ int split_huge_page_to_list_to_order(struct page *page, struct list_head *list, } } - __split_huge_page(page, list, end, new_order); + __split_huge_page(mapping, page, list, end, new_order); ret = 0; } else { spin_unlock(&ds_queue->split_queue_lock); @@ -3218,9 +3203,9 @@ int split_huge_page_to_list_to_order(struct page *page, struct list_head *list, if (anon_vma) { anon_vma_unlock_write(anon_vma); put_anon_vma(anon_vma); - } - if (mapping) + } else { i_mmap_unlock_read(mapping); + } out: xas_destroy(&xas); count_vm_event(!ret ? THP_SPLIT_PAGE : THP_SPLIT_PAGE_FAILED); diff --git a/mm/internal.h b/mm/internal.h index 7e486f2c502c..b2bbfd3c2b50 100644 --- a/mm/internal.h +++ b/mm/internal.h @@ -1059,6 +1059,8 @@ struct migration_target_control { */ size_t splice_folio_into_pipe(struct pipe_inode_info *pipe, struct folio *folio, loff_t fpos, size_t size); +int __filemap_add_swapcache(struct address_space *mapping, struct folio *folio, + pgoff_t index, gfp_t gfp, void **shadowp); /* * mm/vmalloc.c diff --git a/mm/swap_state.c b/mm/swap_state.c index b84e7b0ea4a5..caf69696f47c 100644 --- a/mm/swap_state.c +++ b/mm/swap_state.c @@ -90,48 +90,22 @@ int add_to_swap_cache(struct folio *folio, swp_entry_t entry, { struct address_space *address_space = swap_address_space(entry); pgoff_t idx = swp_offset(entry); - XA_STATE_ORDER(xas, &address_space->i_pages, idx, folio_order(folio)); - unsigned long i, nr = folio_nr_pages(folio); - void *old; - - xas_set_update(&xas, workingset_update_node); + int ret; VM_BUG_ON_FOLIO(!folio_test_locked(folio), folio); VM_BUG_ON_FOLIO(folio_test_swapcache(folio), folio); VM_BUG_ON_FOLIO(!folio_test_swapbacked(folio), folio); - folio_ref_add(folio, nr); folio_set_swapcache(folio); folio->swap = entry; - do { - xas_lock_irq(&xas); - xas_create_range(&xas); - if (xas_error(&xas)) - goto unlock; - for (i = 0; i < nr; i++) { - VM_BUG_ON_FOLIO(xas.xa_index != idx + i, folio); - if (shadowp) { - old = xas_load(&xas); - if (xa_is_value(old)) - *shadowp = old; - } - xas_store(&xas, folio); - xas_next(&xas); - } - address_space->nrpages += nr; - __node_stat_mod_folio(folio, NR_FILE_PAGES, nr); - __lruvec_stat_mod_folio(folio, NR_SWAPCACHE, nr); -unlock: - xas_unlock_irq(&xas); - } while (xas_nomem(&xas, gfp)); - - if (!xas_error(&xas)) - return 0; + ret = __filemap_add_swapcache(address_space, folio, idx, gfp, shadowp); + if (ret) { + folio_clear_swapcache(folio); + folio->swap.val = 0; + } - folio_clear_swapcache(folio); - folio_ref_sub(folio, nr); - return xas_error(&xas); + return ret; } /* @@ -142,7 +116,6 @@ void __delete_from_swap_cache(struct folio *folio, swp_entry_t entry, void *shadow) { struct address_space *address_space = swap_address_space(entry); - int i; long nr = folio_nr_pages(folio); pgoff_t idx = swp_offset(entry); XA_STATE(xas, &address_space->i_pages, idx); @@ -153,11 +126,9 @@ void __delete_from_swap_cache(struct folio *folio, VM_BUG_ON_FOLIO(!folio_test_swapcache(folio), folio); VM_BUG_ON_FOLIO(folio_test_writeback(folio), folio); - for (i = 0; i < nr; i++) { - void *entry = xas_store(&xas, shadow); - VM_BUG_ON_PAGE(entry != folio, entry); - xas_next(&xas); - } + xas_set_order(&xas, idx, folio_order(folio)); + xas_store(&xas, shadow); + folio->swap.val = 0; folio_clear_swapcache(folio); address_space->nrpages -= nr; @@ -252,6 +223,11 @@ void clear_shadow_from_swap_cache(swp_entry_t entry) xas_set_update(&xas, workingset_update_node); + /* + * On unmap, it may delete a larger order shadow here. It's mostly + * fine since not entirely mapped folios are spiltted on swap out + * and leaves shadows with order 0. + */ xa_lock_irq(&address_space->i_pages); if (xa_is_value(xas_load(&xas))) xas_store(&xas, NULL); -- 2.43.0