Received: by 2002:a05:6a10:1a4d:0:0:0:0 with SMTP id nk13csp5362878pxb; Sun, 13 Feb 2022 18:32:33 -0800 (PST) X-Google-Smtp-Source: ABdhPJwvFCW3l0eMtDtJ/pl47jDU5YDoqdtMSqykQiQRD7++K22M1TThGzF1aYo5neOqML4TXg0U X-Received: by 2002:a17:906:d555:: with SMTP id cr21mr10496600ejc.458.1644805952744; Sun, 13 Feb 2022 18:32:32 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1644805952; cv=none; d=google.com; s=arc-20160816; b=bU07S9iwvAiPV8GlPhkUtkVPSflPTw+K5dPM7AMK47DwPguDwt8ItMmJc4JJJBQcPT 3aKLO9mUv1ByHxu83/wDbPd67sqiG+N/RnVpRIIOa5ihE9msRwOYFUZxOpI276fyt5J1 pjsXmu8C/sjy6hl1UUlk2vfHoNBGkX66Gk2btXe6aYdN5zELDfm/j8xbB+BQ9R41bbVg 27ea6FM/TesWAFLLyPltWQ3A6rOrgK5i1ol/8QL7hNFhe6ni1oImmIvedgZxraZxFsxw KYd7vAm8Y2s4TvIXRI1s/1SkTau+lmlwz7JzmyGjg5BFDqJcJoit9F/JTEZGrgdMhk8q q9xg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:user-agent:references :in-reply-to:date:cc:to:from:subject:message-id; bh=pQWTjsDYzqeEOTVPDcMdyVVxq6r2Qx4KzzPNaFtfKuw=; b=l2tlZRl2aEQQN+U5cJKES976hmXB+So53VHcwjJtpxX2PNuYfhwRFaqwekblxMFt04 w5FQ5NhHsuQvbFqE2koRZ5GurN4rpw/AmVvU6CX5X9CfTUrtH9BlJiJFQCfISkZbcXBt HDh9/s5EMGCgx4uDjx1E9VrhMOzCO8q0tD6fdFesdjxP8QfIpJI4Ar03YBK/taJzqNy7 qaMRlQOFvTFBgYHW6RmHNz+kHDmTN95fCtbmJlQ5rQPWUnjQlwjtHqnUk9ZyCaGxKW2I 4nrnJg3IlMrOAjsrfcLnGmRrXOXyab67xWbBRT54LNY3l88o8Tmh7iA0DiMUceB8qrkq huJw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id b37si4114886edf.286.2022.02.13.18.32.09; Sun, 13 Feb 2022 18:32:31 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S236886AbiBMTVq (ORCPT + 99 others); Sun, 13 Feb 2022 14:21:46 -0500 Received: from mxb-00190b01.gslb.pphosted.com ([23.128.96.19]:35206 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229840AbiBMTVq (ORCPT ); Sun, 13 Feb 2022 14:21:46 -0500 Received: from shelob.surriel.com (shelob.surriel.com [96.67.55.147]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id B19DC310 for ; Sun, 13 Feb 2022 11:21:39 -0800 (PST) Received: from imladris.surriel.com ([96.67.55.152]) by shelob.surriel.com with esmtpsa (TLS1.2) tls TLS_ECDHE_ECDSA_WITH_AES_256_GCM_SHA384 (Exim 4.94.2) (envelope-from ) id 1nJKR4-0000LX-Ui; Sun, 13 Feb 2022 14:21:26 -0500 Message-ID: <8aafa00865f564d58dfa39a1e2816a8ec0eab097.camel@surriel.com> Subject: Re: [PATCH] mm: clean up hwpoison page cache page in fault path From: Rik van Riel To: John Hubbard Cc: linux-kernel@vger.kernel.org, kernel-team@fb.com, linux-mm@kvack.org, Andrew Morton , Mel Gorman , Johannes Weiner , Matthew Wilcox Date: Sun, 13 Feb 2022 14:21:26 -0500 In-Reply-To: <10f4319c-45fe-2a7b-db6f-2d5fe8ae98a0@nvidia.com> References: <20220211170557.7964a301@imladris.surriel.com> <10f4319c-45fe-2a7b-db6f-2d5fe8ae98a0@nvidia.com> Content-Type: multipart/signed; micalg="pgp-sha256"; protocol="application/pgp-signature"; boundary="=-8nhOn9cXF7OD7/XqPxdK" User-Agent: Evolution 3.42.3 (3.42.3-1.fc35) MIME-Version: 1.0 Sender: riel@shelob.surriel.com X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,RCVD_IN_DNSWL_NONE, RCVD_IN_MSPIKE_H2,SPF_HELO_NONE,SPF_NONE,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org --=-8nhOn9cXF7OD7/XqPxdK Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable On Sun, 2022-02-13 at 00:56 -0800, John Hubbard wrote: > On Fri, 11 Feb 2022, Rik van Riel wrote: >=20 > > =C2=A0=C2=A0=C2=A0=20 > > This is particularly embarrassing when the page was offlined due to > > having too many corrected memory errors. Now we are killing tasks > > due to them trying to access memory that probably isn't even > > corrupted. >=20 > I'd recommend deleting that paragraph entirely. It's a separate > question, and it is not necessarily an accurate assessment of that > question either: the engineers who set the thresholds for "too many > corrected errors" may not--in fact, probably *will not*--agree with > your > feeling that the memory is still working and reliable! Fair enough. We try to offline pages before we get to a point where the error correction might no longer be able to correct the error correctly, but I am pretty sure I have seen a few odd kernel crashes following a stream of corrected errors that strongly suggested corruption had in fact happened. I'll take that paragraph out if anybody else asks for further changes for v3 of the patch. --=20 All Rights Reversed. --=-8nhOn9cXF7OD7/XqPxdK Content-Type: application/pgp-signature; name="signature.asc" Content-Description: This is a digitally signed message part Content-Transfer-Encoding: 7bit -----BEGIN PGP SIGNATURE----- iQEzBAABCAAdFiEEKR73pCCtJ5Xj3yADznnekoTE3oMFAmIJWjYACgkQznnekoTE 3oNdogf+Ich/XnBrCMzlpA+55TM6I1e+YVX3wzvOc+1CPhoPe0GwAcxnPBKfDkOd Yh9T7IsM71FRlCHL7pl6P7fppcsrfgqcxm5dcsFwPY9Jcj74GCZs9Fi2jxfo8Exb NOHNFo4mj9X/izCQoKzF887bjoTXZpMhb0RylbNxrm1uxwbw8mSkfRyo7U5kYf24 9gtSCw6Ag/ZKLU5omsYLcvTqeJ5619m3wNwKGXoIKYyYRy74nfTykyD/y+xWIF1z lb3nNMqUJx4B8J0d6J/x1EafCxHLThObqw2dy6/expQpwDT/rzgNipFCdhDOG0rr 1PIkhOQi80oESJc0zmN/ocoz7I2r+A== =wbch -----END PGP SIGNATURE----- --=-8nhOn9cXF7OD7/XqPxdK--