Received: by 2002:a05:6a10:8c0a:0:0:0:0 with SMTP id go10csp904862pxb; Thu, 25 Feb 2021 19:29:43 -0800 (PST) X-Google-Smtp-Source: ABdhPJy+Onw3329YTSNYHwEKqAbHkjHewJTgGkmvjuFyp5CWERq+um2m6h+gf1JPGTYpxjyoK5Bd X-Received: by 2002:a17:906:63c2:: with SMTP id u2mr990950ejk.346.1614310183412; Thu, 25 Feb 2021 19:29:43 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1614310183; cv=none; d=google.com; s=arc-20160816; b=GiZWEHnUFvg8ZekdWXqkNIkn3tte3w+STpirQaETUdk5sFDvDAby3xtSTs+BnJa9t0 BSDyJbh/6g12qifk2GARK4PZTP+7B/FTo4j/mHzXra0U3OV2XNbMNFcugI5HjLWQOFs1 CMnLORZdOlaqGOOaXeC8jhIL0FPp5+k2klDsl1TFpvUo26j1kZz/HbQyqtc5f4yEzm5L u8z+v6df60LbT7T7+CX2Vk4/Mo5+CBIj87wynEhpGiNRiSnrtpBIu8HumdOpVLjBERt5 JydHtAWUSYZtf5EeWFeDpV/kwyhEz0JX5LfSdlEh9uIIzCYB0q8UpsH7HGswC3tpZHr5 QmjQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:cc:to:subject :message-id:date:from:in-reply-to:references:mime-version :dkim-signature; bh=78pTGGTVx/VDUeMbmDair/QeL261AqryHviFTtBRyxQ=; b=u4cPEgRSZgxF5U9rtJKjetIWWDUBdGb/4hAfzuqiqqOQ38WWuMBNqoKVPsiYuuNtaU htY9V+r7nGmPYPZ3JTXa5kkx+9Pad15mOpDwO3l79s+5rGVtfrNdaYoWaB219mCqMuyO udzjQMGllpn6Crg9y6Y2FjifrA3Gpc5HJpgfYaMGJ8oAxAFgxQG4R9TrCTHp3xUd+XMS OknrccMESE1vdDptMQ7iHG2qZk4otW0SiB2hX3gzoh2LjFgK61MNqbR0OeP2/1Px1aPt JsnUU2726Yj9+4qCugQatqwpAyjV4TKlmkt89Wrsy2bE6BJg6bqhsjqC1QciVhTxqmcR VRKw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=lzD5E2Oe; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id j8si4905600ejd.623.2021.02.25.19.29.21; Thu, 25 Feb 2021 19:29:43 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=lzD5E2Oe; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229586AbhBZD1l (ORCPT + 99 others); Thu, 25 Feb 2021 22:27:41 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:51036 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229492AbhBZD1j (ORCPT ); Thu, 25 Feb 2021 22:27:39 -0500 Received: from mail-pj1-x102a.google.com (mail-pj1-x102a.google.com [IPv6:2607:f8b0:4864:20::102a]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 04928C06174A; Thu, 25 Feb 2021 19:26:59 -0800 (PST) Received: by mail-pj1-x102a.google.com with SMTP id b15so5199844pjb.0; Thu, 25 Feb 2021 19:26:59 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc:content-transfer-encoding; bh=78pTGGTVx/VDUeMbmDair/QeL261AqryHviFTtBRyxQ=; b=lzD5E2Oep+L+vhd/Q/RENatwuCz++QZ0QwHAhUFQuN+h247ey8hI2Ip3V32+HYHDWG AVADNEmksL6cd5g0x0wgXYkdDBD5g7JrDg8btmO2a3rNaVEUxXqTF38/XgYUMQnc1h6F i/vsAz0n7XkxZx/1x83xCSBpkZTS8JhKfkgjnY1qwY2p9maP5gug2S2mxZmIeWuqF7zi c3NZpbCMpGTbHvJratl4DojAjMf3wcbc74dQ3sYTDTAlMIThl7k02umgA4Aa56MVsfJf qFofPk+NUwJONgTiWwWs7OfALK5WwmhdEz6GMJlBI2J+wuhrDchRU/lkdr8rqHfCbC9n aV2w== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc:content-transfer-encoding; bh=78pTGGTVx/VDUeMbmDair/QeL261AqryHviFTtBRyxQ=; b=qpZLdBlJJ8jPSh9oz/jhPh7QEzw+1+wK5tSeiZ/teCL7oP0UXM2nPKcpUtMYzb8DZs fvo4JUaupQaj/QSrOerMgfQ3LnzwxNBTWUUcHpXUNo3MQL61MrRKTuli3TF/IGkfwRoW WcSlwf98VCsGuc4UbjELcv1qIGMO5pd5X7HEmSZPWEMeXTKzI/7YIjW3gr+8EQvlqCqX 7casv+MCawdYRzuo23ypLIe8kd3vp1MG/9LxG+e5ujS+CdtPIyYGgpPC8q1BQamnfv/3 aEc+364FBzeguRLXboGpBwONdXcq9fB6MqtRSLk8ichPI74SuJng6AXC/v2H54K9wnz2 +dlw== X-Gm-Message-State: AOAM533n0BQ6lImYe1V8xabxQSY3pqVNppiuS2FUJFOf78VgL+HiUDQs om3B4EVtGF8tWp+8rMHVuudP8nBheVg1TmVta0Q= X-Received: by 2002:a17:90a:cb12:: with SMTP id z18mr1186334pjt.163.1614310018579; Thu, 25 Feb 2021 19:26:58 -0800 (PST) MIME-Version: 1.0 References: <20210224151619.67c29731@alex-virtual-machine> <20210224103105.GA16368@linux> <20210225114329.4e1a41c6@alex-virtual-machine> <20210225112818.GA10141@hori.linux.bs1.fc.nec.co.jp> <20210225113930.GA7227@localhost.localdomain> <20210225123806.GA15006@hori.linux.bs1.fc.nec.co.jp> <20210225181542.GA178925@agluck-desk2.amr.corp.intel.com> <20210226021907.GA27861@hori.linux.bs1.fc.nec.co.jp> In-Reply-To: <20210226021907.GA27861@hori.linux.bs1.fc.nec.co.jp> From: Tony Luck Date: Thu, 25 Feb 2021 19:26:47 -0800 Message-ID: Subject: Re: [PATCH] mm,hwpoison: return -EBUSY when page already poisoned To: =?UTF-8?B?SE9SSUdVQ0hJIE5BT1lBKOWggOWPoyDnm7TkuZ8p?= Cc: Oscar Salvador , Aili Yao , "david@redhat.com" , "akpm@linux-foundation.org" , "bp@alien8.de" , "tglx@linutronix.de" , "mingo@redhat.com" , "hpa@zytor.com" , "x86@kernel.org" , "inux-edac@vger.kernel.org" , "linux-kernel@vger.kernel.org" , "linux-mm@kvack.org" , "yangfeng1@kingsoft.com" Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Feb 25, 2021 at 6:23 PM HORIGUCHI NAOYA(=E5=A0=80=E5=8F=A3=E3=80=80= =E7=9B=B4=E4=B9=9F) wrote: > > On Thu, Feb 25, 2021 at 10:15:42AM -0800, Luck, Tony wrote: > > CPU3 reads the poison and starts along same path that CPU2 > > did. > > I think that the MCE loop happening on CPU2 and CPU3 is unexpected > and these threads should immediately kill the current process on > each CPU. force_sig_mceerr() in kill_me_maybe() is supposed to do it, > so Aili's patch would fix this issue too? It would stop the looping. But for the case where the error came from user code we don't have the virtual address that was accessed at this point (normally this address is found during the reverse lokup from the physical address inside memory_failure()). So we can send a generic SIGBUS, but not one with the usual extra information about the location of the error. -Tony