Received: by 2002:a05:6a11:4021:0:0:0:0 with SMTP id ky33csp815102pxb; Wed, 22 Sep 2021 13:38:29 -0700 (PDT) X-Google-Smtp-Source: ABdhPJx1OSAAP8akSqlgYvLjKH/ss+Pu29LPdzRjrvbKas/Zix3US+9w/3yTB1JcCirXYzgqKewC X-Received: by 2002:a6b:dc05:: with SMTP id s5mr800991ioc.109.1632343109071; Wed, 22 Sep 2021 13:38:29 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1632343109; cv=none; d=google.com; s=arc-20160816; b=b7rWTHaCUcNlJ2v8YV560ACKt05MTsM0CeOJ3mgEMOB7XE7byKXWaN/VRbRhetMmrq JSzFxwUSTaukqUrvHjnD6QjRljAJ5fCAIThEeNqZLTx36Bis45fAAAAxcpgITiyWhjvp qqjELUFf+e5TGmBEAL5p0qDBaPUo5f0aFgK951G78oCpQ2wzzYmjfigLIc+y7EisPi0Z oCBvHFDB1g0bNfM6lrdbMVRaOxW1doyqhUwO2VWxQ3Gh2UDDd/YnuGN4fuskzo43787Q ZEUnUvSbIBxGLMl+cr0PVbPBB0HCCw3ZXMgZ2QQKmWg8+yKVBgpfc1d/W7er29Vqdd9l 7E3g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=qhR4BBPrkbqBPDREiZkyVSMc0RaquplEP+Rg3q+eWXU=; b=YelIbttFL39wwJWEsp+SmWCWtqVDLn84aIqwpyQ3BoW64OYnaVl3FA0IuUPYoCRFFh /DEDUiTb+EFJoVO2TpbCP+TuThSOGashZMqtJsBzbD/yovkt3sRM44a63ZKuIyhwrUNf p6oj0eJXWRhK85MxBWHQJ5wq9TI6LE1Yf+xttK7jrkbPdadW6Gb1cQtLzMZTJHXhT1Z/ sWaJhRYPRm2weFQSBa4w2iwxi9TCivYdGvVp46Tzz8HNGr2tk8YHe00QekNT1aQfdr1j Fqq7Wxf2TOZ8WE8Gu4gHzCkofahKHhKj6fzZdYHPasmlrRLORqBayv33R3RbiUQSIt7s aI6Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=BpSz+zin; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id b14si3480606ilj.16.2021.09.22.13.38.17; Wed, 22 Sep 2021 13:38:29 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20210112 header.b=BpSz+zin; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S237660AbhIVUi6 (ORCPT + 99 others); Wed, 22 Sep 2021 16:38:58 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:58106 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S236860AbhIVUi5 (ORCPT ); Wed, 22 Sep 2021 16:38:57 -0400 Received: from mail-ed1-x535.google.com (mail-ed1-x535.google.com [IPv6:2a00:1450:4864:20::535]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id EB9B6C061574 for ; Wed, 22 Sep 2021 13:37:26 -0700 (PDT) Received: by mail-ed1-x535.google.com with SMTP id ee50so14789430edb.13 for ; Wed, 22 Sep 2021 13:37:26 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20210112; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=qhR4BBPrkbqBPDREiZkyVSMc0RaquplEP+Rg3q+eWXU=; b=BpSz+zinLVgEW81plHtbJ6/OM5mmbULUAR/9nhwvKiCPCgPkj9ce3UNEwXrXAfmUtg IoSV2j4WrfvaksSSyDXTFezEwDuEFXqUqGT15kxq5ZExSQAiT3+B3RDYHU+TqO4zaS8U JRRgZO0hhu2I/7vUNP97XzT6K+AvYjQKx/F1PmKti7JLGFKw1lGgWcdfu+XNrO/ir+HK 0kJNzQKaoMNra1TnQZZKbcOLmlQcNEuP+1W3hLTnxGwDO8cMVZ5KRbbkFtgmV54wWrHK lXI87M5yS2SEcLoVR3VRWoU1HZ6BQSkIk2WnBR33hXm2I+kEhxFx/IRFZp5VILAuQODK D88g== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=qhR4BBPrkbqBPDREiZkyVSMc0RaquplEP+Rg3q+eWXU=; b=lPZfgAlcQfs8kPxCaWr2rbm2vPUDifkKufnwgFntQRvTKFiVLPakjOU5iMQfr9ofFW xOnnnDNmimk9De0041zP2K/+/yZxBdUehDAB0OsMsev7dKPuVWxhLHKC0ewYl+9me+DD aL+ihLJ2bKGhK+fkfX5Bsl5PEYGxEJUSNi9OD8fDyCXEV7g90w8QHBQduf6Tq5WopwIV XyrVM7/ciOK1MODLSGcy9fOGFB7jfC3Ie9LFFDoRcXJaYUmVDd6uJlKEItJ9qA819I+j LyXLkamuaDGDMWJ68JFeKWKES8a54Kcx8XFQXn0TMUb4qzETGzbuE/7TUwVVf0xrW8Sj 99EA== X-Gm-Message-State: AOAM530LMgo3Qi1YH6G6VdhYXDqnK0Bh6d6jNlX9LWwKzJ+imfkl3ZB0 6Aq8D+YcxMEVJsLoqNDxW2YtmCDzdSj5CNhLzOo= X-Received: by 2002:a05:6402:14c3:: with SMTP id f3mr1494205edx.312.1632343045520; Wed, 22 Sep 2021 13:37:25 -0700 (PDT) MIME-Version: 1.0 References: <20210819054116.266126-1-shy828301@gmail.com> <20210819054116.266126-3-shy828301@gmail.com> In-Reply-To: From: Yang Shi Date: Wed, 22 Sep 2021 13:37:12 -0700 Message-ID: Subject: Re: [v2 PATCH 3/3] mm: hwpoison: dump page for unhandlable page To: "Luck, Tony" Cc: =?UTF-8?B?SE9SSUdVQ0hJIE5BT1lBKOWggOWPoyDnm7TkuZ8p?= , Oscar Salvador , tdmackey@twitter.com, David Hildenbrand , Matthew Wilcox , Andrew Morton , Jonathan Corbet , Linux MM , Linux Kernel Mailing List Content-Type: text/plain; charset="UTF-8" Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Sep 22, 2021 at 12:58 PM Yang Shi wrote: > > On Wed, Sep 22, 2021 at 12:37 PM Luck, Tony wrote: > > > > On Wed, Aug 18, 2021 at 10:41:16PM -0700, Yang Shi wrote: > > > Currently just very simple message is shown for unhandlable page, e.g. > > > non-LRU page, like: > > > soft_offline: 0x1469f2: unknown non LRU page type 5ffff0000000000 () > > > > > > It is not very helpful for further debug, calling dump_page() could show > > > more useful information. > > > > Looks like your code already caught something. An error injection > > test may have injected into a shared library. Though I'm not sure that > > the refcount/mapcount in the dump agrees with that diagnosis from the > > author of this test. > > The messages from dump_page() are (unwind them from mce logs): > > [ 4817.630520] page:000000003ab9dca4 refcount:1 mapcount:0 > mapping:0000000000000000 index:0x0 pfn:0xcef2747 > [ 4817.646860] flags: > 0x57ffffc0801000(reserved|hwpoison|node=1|zone=2|lastcpupid=0x1fffff) > [ 4818.033689] raw: 0057ffffc0801000 ffd400033bc9d1c8 ffd400033bc9d1c8 > 0000000000000000 > [ 4818.280640] raw: 0000000000000000 0000000000000000 00000001ffffffff > 0000000000000000 Missed one line from the dump: [ 4818.321804] page dumped because: hwpoison: unhandlable page Anyway dump_page() is just called when unhandlable page is met. > > The page flags tell it is a "reserved" page and mapping is NULL. It > doesn't seem like a user page or movable page, so hwpoision can't > handle it so that the messages are dumped. > > > > > Here's what appeared on the console: > > > > [ 4817.622254] mce: Uncorrected hardware memory error in user-access at cef2747000 > > [ 4817.630520] page:000000003ab9dca4 refcount:1 mapcount:0 mapping:0000000000000000 index:0x0 pfn:0xcef2747 > > [ 4817.638651] mce: Uncorrected hardware memory error in user-access at cef2747000 > > [ 4817.646860] flags: 0x57ffffc0801000(reserved|hwpoison|node=1|zone=2|lastcpupid=0x1fffff) > > [ 4818.025515] mce: Uncorrected hardware memory error in user-access at cef2747000 > > [ 4818.033689] raw: 0057ffffc0801000 ffd400033bc9d1c8 ffd400033bc9d1c8 0000000000000000 > > [ 4818.272435] mce: Uncorrected hardware memory error in user-access at cef2747000 > > [ 4818.280640] raw: 0000000000000000 0000000000000000 00000001ffffffff 0000000000000000 > > [ 4818.280658] mce: Uncorrected hardware memory error in user-access at cef2747000 > > [ 4818.313606] mce: Uncorrected hardware memory error in user-access at cef2747000 > > [ 4818.321804] page dumped because: hwpoison: unhandlable page > > [ 4818.564802] mce: Uncorrected hardware memory error in user-access at cef2747000 > > [ 4818.573043] Memory failure: 0xcef2747: recovery action for unknown page: Ignored > > [ 4818.595837] Memory failure: 0xcef2747: already hardware poisoned > > [ 4818.603245] Memory failure: 0xcef2747: Sending SIGBUS to multichase:67460 due to hardware memory corruption > > [ 4818.614297] Memory failure: 0xcef2747: already hardware poisoned > > > > -Tony