Received: by 2002:a05:6a10:a841:0:0:0:0 with SMTP id d1csp2746577pxy; Sun, 25 Apr 2021 02:55:52 -0700 (PDT) X-Google-Smtp-Source: ABdhPJy9FIWncreIhdBV6iHM19raDDoc0Q81lhlY/asMNgG6FahCLfN/JJqzA3E/GFCGsKzbLDtp X-Received: by 2002:a17:906:2a46:: with SMTP id k6mr12641058eje.406.1619344552308; Sun, 25 Apr 2021 02:55:52 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1619344552; cv=none; d=google.com; s=arc-20160816; b=Czx654oWTJQaBJg/KPmy97WXKS1Ifla+Wuh0lOARms7RxIvIGHhvQ1l7nHNUqr4kkd w0u0/+37iWJjl90Bx21xXeUiAC8OhJ0EatY+Bx/0TQrVzaQBxZu19veFkpP70H1s6MzW 1a9pzmgM7x2dFeJodMQ9hRXA8eO4ilauodIpAbbikIl8lDwfZ+BPdP65k0zM15kfKsVH f2oCtf1cYysAP8g1jurfoWaOdA2A/q1f2izE/q6zpRzubmRUWi5xFvJC+1YDLTtxr6jJ X3ukPdbgVic4XPzFakvAcILYKBB9Rwbci0mV3JuaMW+pefc+SP0mJp/tsEh0NQCMPiET 4ZuA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from; bh=3XI2WALoqsncF2bOG/kFoBFPENvcgJqtrW7OLkY6nAw=; b=f27PpA6Vp75Gd4J1lQmyL1kVfO7kRV3HPLKN1fhmscTTdVoafCLoCOpCkmd011aVta RAYg30zIkD9Kh9G9KxwwUyY0hydyrV+aHSqOjy3c5NXjBsSfmVH4dG+xJXyNHxOKQpwj c13X+w+6qThbf+ixQYq6jMXfFM+O0yemxaREChIDIRpMoLwQRJuLJS+My3q7Pzx4izDS HKb/c2PhhuxATPzKD4NnoRT+K3tWV/BfiUtn910ZWpVdGs0jueuTzmpgeAenlTMtfyz+ XRVyW0InCbcLnInoHj7DMF+pxm0HjCrXMexfJqgV5C6yAJs6M9QrasF24+o7QdLeaT9d tWdA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=huawei.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id m8si10796782edc.284.2021.04.25.02.55.29; Sun, 25 Apr 2021 02:55:52 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=huawei.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230048AbhDYJzN (ORCPT + 99 others); Sun, 25 Apr 2021 05:55:13 -0400 Received: from szxga02-in.huawei.com ([45.249.212.188]:3532 "EHLO szxga02-in.huawei.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229551AbhDYJzH (ORCPT ); Sun, 25 Apr 2021 05:55:07 -0400 Received: from dggeme703-chm.china.huawei.com (unknown [172.30.72.57]) by szxga02-in.huawei.com (SkyGuard) with ESMTP id 4FSjx754t7zRfyH; Sun, 25 Apr 2021 17:52:11 +0800 (CST) Received: from huawei.com (10.175.104.170) by dggeme703-chm.china.huawei.com (10.1.199.99) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_CBC_SHA256_P256) id 15.1.2176.2; Sun, 25 Apr 2021 17:54:25 +0800 From: Miaohe Lin To: CC: , , , , , , , , , , , , , , , Subject: [PATCH v5 2/4] swap: fix do_swap_page() race with swapoff Date: Sun, 25 Apr 2021 17:54:17 +0800 Message-ID: <20210425095419.3830298-3-linmiaohe@huawei.com> X-Mailer: git-send-email 2.23.0 In-Reply-To: <20210425095419.3830298-1-linmiaohe@huawei.com> References: <20210425095419.3830298-1-linmiaohe@huawei.com> MIME-Version: 1.0 Content-Transfer-Encoding: 7BIT Content-Type: text/plain; charset=US-ASCII X-Originating-IP: [10.175.104.170] X-ClientProxiedBy: dggems701-chm.china.huawei.com (10.3.19.178) To dggeme703-chm.china.huawei.com (10.1.199.99) X-CFilter-Loop: Reflected Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org When I was investigating the swap code, I found the below possible race window: CPU 1 CPU 2 ----- ----- do_swap_page if (data_race(si->flags & SWP_SYNCHRONOUS_IO) swap_readpage if (data_race(sis->flags & SWP_FS_OPS)) { swapoff .. p->swap_file = NULL; .. struct file *swap_file = sis->swap_file; struct address_space *mapping = swap_file->f_mapping;[oops!] Note that for the pages that are swapped in through swap cache, this isn't an issue. Because the page is locked, and the swap entry will be marked with SWAP_HAS_CACHE, so swapoff() can not proceed until the page has been unlocked. Fix this race by using get/put_swap_device() to guard against concurrent swapoff. Fixes: 0bcac06f27d7 ("mm,swap: skip swapcache for swapin of synchronous device") Reported-by: kernel test robot (auto build test ERROR) Reviewed-by: "Huang, Ying" Signed-off-by: Miaohe Lin --- include/linux/swap.h | 9 +++++++++ mm/memory.c | 11 +++++++++-- 2 files changed, 18 insertions(+), 2 deletions(-) diff --git a/include/linux/swap.h b/include/linux/swap.h index c9e7fea10b83..46d51d058d05 100644 --- a/include/linux/swap.h +++ b/include/linux/swap.h @@ -527,6 +527,15 @@ static inline struct swap_info_struct *swp_swap_info(swp_entry_t entry) return NULL; } +static inline struct swap_info_struct *get_swap_device(swp_entry_t entry) +{ + return NULL; +} + +static inline void put_swap_device(struct swap_info_struct *si) +{ +} + #define swap_address_space(entry) (NULL) #define get_nr_swap_pages() 0L #define total_swap_pages 0L diff --git a/mm/memory.c b/mm/memory.c index 27014c3bde9f..39c910678387 100644 --- a/mm/memory.c +++ b/mm/memory.c @@ -3311,6 +3311,7 @@ vm_fault_t do_swap_page(struct vm_fault *vmf) { struct vm_area_struct *vma = vmf->vma; struct page *page = NULL, *swapcache; + struct swap_info_struct *si = NULL; swp_entry_t entry; pte_t pte; int locked; @@ -3338,14 +3339,16 @@ vm_fault_t do_swap_page(struct vm_fault *vmf) goto out; } + /* Prevent swapoff from happening to us. */ + si = get_swap_device(entry); + if (unlikely(!si)) + goto out; delayacct_set_flag(current, DELAYACCT_PF_SWAPIN); page = lookup_swap_cache(entry, vma, vmf->address); swapcache = page; if (!page) { - struct swap_info_struct *si = swp_swap_info(entry); - if (data_race(si->flags & SWP_SYNCHRONOUS_IO) && __swap_count(entry) == 1) { /* skip swapcache */ @@ -3514,6 +3517,8 @@ vm_fault_t do_swap_page(struct vm_fault *vmf) unlock: pte_unmap_unlock(vmf->pte, vmf->ptl); out: + if (si) + put_swap_device(si); return ret; out_nomap: pte_unmap_unlock(vmf->pte, vmf->ptl); @@ -3525,6 +3530,8 @@ vm_fault_t do_swap_page(struct vm_fault *vmf) unlock_page(swapcache); put_page(swapcache); } + if (si) + put_swap_device(si); return ret; } -- 2.23.0