Received: by 2002:ad5:4acb:0:0:0:0:0 with SMTP id n11csp3946774imw; Mon, 11 Jul 2022 20:47:07 -0700 (PDT) X-Google-Smtp-Source: AGRyM1uMshC3skbdzzmT23OdyBtZZe5GBwtczZdePYRASwKJ7EP9AG3FpiUlVhDY3f+Yww0CmOt9 X-Received: by 2002:a17:906:c152:b0:726:35bd:b3bd with SMTP id dp18-20020a170906c15200b0072635bdb3bdmr22033288ejc.201.1657597627188; Mon, 11 Jul 2022 20:47:07 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1657597627; cv=none; d=google.com; s=arc-20160816; b=qN6D8MzJMrqltS5Jt4L/hXzlDWCE0cByULxZYN4VmCA46dFMygnCujbaNOuKWJeCq8 jpkA2MDxjUb9rXvHjHrgkIOmNhMta/+0cX+cYyY4IEAlGLw95iGird712jR7ECSvNvbi 0u2+JSmXk/eLgrLahtY85jNmJ9QjrTCDcjcG6o4pZnexKFD9b3fA/Rdd5LYrtIH/mzj0 ibrKrzJixBjKfzLw/rY8JKmnpO8Ws+5WFZmwZcgR/ffXgycEEVGLdJeKJvQJltJaqnRA k1lgblJtlbWM2Y0S9NdjhiIKcG3MvbDi4wpjg4tTcQeW77JsNq65WFU/bg2vvTdCTxMb EKNA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :dkim-signature; bh=fetpvhLLFbX033QEZ0y/ojPyygR2BZFVpqQBMwNEgGo=; b=I33iwhUSL3Q6YDpmA2k4L5d2YPz6bTXVujAL25Tc7J7oO+BxBV6XXpQFGZH8sOYJFr Y8dWnLnRyDLUdEM6pf15YctMtR/ILkkFc3XasAqHO581BlmJkOgMsqz/T1yk2V5VvtvS hFuYs2FSWX1XGPenXvGflXidy3EUgWjY0aVMOfe//og1jK6Mg/Bqx5V8oycpZ558xC9s hQ6sIq+0T4WItVrP0g9p6YPKFhkiZ07cnV8IzvGwAfsMbE4j6hKGscgxVo7NhObQQX4z 9lXO88OFms+5qXuN9VB7y786BCMejEaleNW648fh3W+Re4+/nSvYUrCMd96tlJN2RUN4 dDRg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linux.dev header.s=key1 header.b=AEjM1vDM; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linux.dev Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id co6-20020a0564020c0600b0043a71e0c8d4si11427226edb.256.2022.07.11.20.46.43; Mon, 11 Jul 2022 20:47:07 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@linux.dev header.s=key1 header.b=AEjM1vDM; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linux.dev Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231990AbiGLD3o (ORCPT + 99 others); Mon, 11 Jul 2022 23:29:44 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:46446 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232053AbiGLD3c (ORCPT ); Mon, 11 Jul 2022 23:29:32 -0400 Received: from out1.migadu.com (out1.migadu.com [IPv6:2001:41d0:2:863f::]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 9D6792250C for ; Mon, 11 Jul 2022 20:29:28 -0700 (PDT) X-Report-Abuse: Please report any abuse attempt to abuse@migadu.com and include these headers. DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linux.dev; s=key1; t=1657596566; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=fetpvhLLFbX033QEZ0y/ojPyygR2BZFVpqQBMwNEgGo=; b=AEjM1vDMNL98lSukj6e3L2un/X0Hu9UbY95wD4fO7CevAFs5qrYMVEHkYTAWD5zN07Lz21 KI0qO92xIuC6N35lhA+8jQ5C0foS/CGsTkWWzEr/rbFy42kobOHQORu5Z7hnlY1n0U8ppk emjKpcixzB5QdXBl5fSZr6ALyBct24w= From: Naoya Horiguchi To: linux-mm@kvack.org Cc: Andrew Morton , David Hildenbrand , Mike Kravetz , Miaohe Lin , Liu Shixin , Yang Shi , Oscar Salvador , Muchun Song , Naoya Horiguchi , linux-kernel@vger.kernel.org Subject: [mm-unstable PATCH v6 4/8] mm, hwpoison: make unpoison aware of raw error info in hwpoisoned hugepage Date: Tue, 12 Jul 2022 12:28:54 +0900 Message-Id: <20220712032858.170414-5-naoya.horiguchi@linux.dev> In-Reply-To: <20220712032858.170414-1-naoya.horiguchi@linux.dev> References: <20220712032858.170414-1-naoya.horiguchi@linux.dev> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Migadu-Flow: FLOW_OUT X-Migadu-Auth-User: linux.dev X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, SPF_HELO_PASS,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Naoya Horiguchi Raw error info list needs to be removed when hwpoisoned hugetlb is unpoisoned. And unpoison handler needs to know how many errors there are in the target hugepage. So add them. HPageVmemmapOptimized(hpage) and HPageRawHwpUnreliable(hpage)) can't be unpoisoned, so let's skip them. Signed-off-by: Naoya Horiguchi Reported-by: kernel test robot --- v5 -> v6: - set type of return value of hugetlb_clear_page_hwpoison() to void - change type of return value of hugetlb_clear_page_hwpoison() to unsigned long v4 -> v5: - fix type of return value of free_raw_hwp_pages() (found by kernel test robot), - prevent unpoison for HPageVmemmapOptimized and HPageRawHwpUnreliable. --- include/linux/swapops.h | 9 ++++++++ mm/memory-failure.c | 46 ++++++++++++++++++++++++++++++++++++----- 2 files changed, 50 insertions(+), 5 deletions(-) diff --git a/include/linux/swapops.h b/include/linux/swapops.h index a01aeb3fcc0b..ddc98f96ad2c 100644 --- a/include/linux/swapops.h +++ b/include/linux/swapops.h @@ -498,6 +498,11 @@ static inline void num_poisoned_pages_dec(void) atomic_long_dec(&num_poisoned_pages); } +static inline void num_poisoned_pages_sub(long i) +{ + atomic_long_sub(i, &num_poisoned_pages); +} + #else static inline swp_entry_t make_hwpoison_entry(struct page *page) @@ -518,6 +523,10 @@ static inline struct page *hwpoison_entry_to_page(swp_entry_t entry) static inline void num_poisoned_pages_inc(void) { } + +static inline void num_poisoned_pages_sub(long i) +{ +} #endif static inline int non_swap_entry(swp_entry_t entry) diff --git a/mm/memory-failure.c b/mm/memory-failure.c index 74195c181d69..56fd9d809013 100644 --- a/mm/memory-failure.c +++ b/mm/memory-failure.c @@ -1720,21 +1720,38 @@ static int hugetlb_set_page_hwpoison(struct page *hpage, struct page *page) return ret; } -void hugetlb_clear_page_hwpoison(struct page *hpage) +static unsigned long free_raw_hwp_pages(struct page *hpage, bool move_flag) { struct llist_head *head; struct llist_node *t, *tnode; + unsigned long count = 0; - if (!HPageRawHwpUnreliable(hpage)) - ClearPageHWPoison(hpage); + /* + * HPageVmemmapOptimized hugepages can't be unpoisoned because + * struct pages for tail pages are required to free hwpoisoned + * hugepages. HPageRawHwpUnreliable hugepages shouldn't be + * unpoisoned by definition. + */ + if (HPageVmemmapOptimized(hpage) || HPageRawHwpUnreliable(hpage)) + return 0; head = raw_hwp_list_head(hpage); llist_for_each_safe(tnode, t, head->first) { struct raw_hwp_page *p = container_of(tnode, struct raw_hwp_page, node); - SetPageHWPoison(p->page); + if (move_flag) + SetPageHWPoison(p->page); kfree(p); + count++; } llist_del_all(head); + return count; +} + +void hugetlb_clear_page_hwpoison(struct page *hpage) +{ + if (!HPageRawHwpUnreliable(hpage)) + ClearPageHWPoison(hpage); + free_raw_hwp_pages(hpage, true); } /* @@ -1878,6 +1895,10 @@ static inline int try_memory_failure_hugetlb(unsigned long pfn, int flags, int * return 0; } +static inline unsigned long free_raw_hwp_pages(struct page *hpage, bool flag) +{ + return 0; +} #endif /* CONFIG_HUGETLB_PAGE */ static int memory_failure_dev_pagemap(unsigned long pfn, int flags, @@ -2283,6 +2304,7 @@ int unpoison_memory(unsigned long pfn) struct page *p; int ret = -EBUSY; int freeit = 0; + unsigned long count = 1; static DEFINE_RATELIMIT_STATE(unpoison_rs, DEFAULT_RATELIMIT_INTERVAL, DEFAULT_RATELIMIT_BURST); @@ -2330,6 +2352,13 @@ int unpoison_memory(unsigned long pfn) ret = get_hwpoison_page(p, MF_UNPOISON); if (!ret) { + if (PageHuge(p)) { + count = free_raw_hwp_pages(page, false); + if (count == 0) { + ret = -EBUSY; + goto unlock_mutex; + } + } ret = TestClearPageHWPoison(page) ? 0 : -EBUSY; } else if (ret < 0) { if (ret == -EHWPOISON) { @@ -2338,6 +2367,13 @@ int unpoison_memory(unsigned long pfn) unpoison_pr_info("Unpoison: failed to grab page %#lx\n", pfn, &unpoison_rs); } else { + if (PageHuge(p)) { + count = free_raw_hwp_pages(page, false); + if (count == 0) { + ret = -EBUSY; + goto unlock_mutex; + } + } freeit = !!TestClearPageHWPoison(p); put_page(page); @@ -2350,7 +2386,7 @@ int unpoison_memory(unsigned long pfn) unlock_mutex: mutex_unlock(&mf_mutex); if (!ret || freeit) { - num_poisoned_pages_dec(); + num_poisoned_pages_sub(count); unpoison_pr_info("Unpoison: Software-unpoisoned page %#lx\n", page_to_pfn(p), &unpoison_rs); } -- 2.25.1