Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752115AbcDSFyf (ORCPT ); Tue, 19 Apr 2016 01:54:35 -0400 Received: from mail-wm0-f47.google.com ([74.125.82.47]:36502 "EHLO mail-wm0-f47.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751780AbcDSFye (ORCPT ); Tue, 19 Apr 2016 01:54:34 -0400 MIME-Version: 1.0 In-Reply-To: <20160418231551.GA18493@hori1.linux.bs1.fc.nec.co.jp> References: <146097982568.15733.13924990169211134049.stgit@buzz> <20160418231551.GA18493@hori1.linux.bs1.fc.nec.co.jp> Date: Tue, 19 Apr 2016 08:54:32 +0300 Message-ID: Subject: Re: [PATCH] mm/memory-failure: fix race with compound page split/merge From: Konstantin Khlebnikov To: Naoya Horiguchi Cc: Konstantin Khlebnikov , "linux-mm@kvack.org" , "linux-kernel@vger.kernel.org" , "Kirill A. Shutemov" , Andrew Morton Content-Type: text/plain; charset=UTF-8 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2048 Lines: 62 On Tue, Apr 19, 2016 at 2:15 AM, Naoya Horiguchi wrote: > # CCed Andrew, > > On Mon, Apr 18, 2016 at 02:43:45PM +0300, Konstantin Khlebnikov wrote: >> Get_hwpoison_page() must recheck relation between head and tail pages. >> >> Signed-off-by: Konstantin Khlebnikov > > Looks good to me. Without this recheck, the race causes kernel to pin > an irrelevant page, and finally makes kernel crash for refcount mismcach... Yep. I seen that a lot. Unfortunately that was in 3.18 branch and it'll took several months to verify this fix. This code and page reference counting overall have changed significantly since then, so probably here is more bugs. For example, I'm not sure about races with atomic set for page reference counting, I've found and removed couple in mellanox driver but there're more in mm and net. > > Acked-by: Naoya Horiguchi > >> --- >> mm/memory-failure.c | 10 +++++++++- >> 1 file changed, 9 insertions(+), 1 deletion(-) >> >> diff --git a/mm/memory-failure.c b/mm/memory-failure.c >> index 78f5f2641b91..ca5acee53b7a 100644 >> --- a/mm/memory-failure.c >> +++ b/mm/memory-failure.c >> @@ -888,7 +888,15 @@ int get_hwpoison_page(struct page *page) >> } >> } >> >> - return get_page_unless_zero(head); >> + if (get_page_unless_zero(head)) { >> + if (head == compound_head(page)) >> + return 1; >> + >> + pr_info("MCE: %#lx cannot catch tail\n", page_to_pfn(page)); > > Recently Chen Yucong replaced the label "MCE:" with "Memory failure:", > but the resolution is trivial, I think. > > Thanks, > Naoya Horiguchi > >> + put_page(head); >> + } >> + >> + return 0; >> } >> EXPORT_SYMBOL_GPL(get_hwpoison_page); >> >> > -- > To unsubscribe, send a message with 'unsubscribe linux-mm' in > the body to majordomo@kvack.org. For more info on Linux MM, > see: http://www.linux-mm.org/ . > Don't email: email@kvack.org