Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756136AbcJMSoY (ORCPT ); Thu, 13 Oct 2016 14:44:24 -0400 Received: from mail-by2nam01on0131.outbound.protection.outlook.com ([104.47.34.131]:65161 "EHLO NAM01-BY2-obe.outbound.protection.outlook.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1753785AbcJMSoP (ORCPT ); Thu, 13 Oct 2016 14:44:15 -0400 X-Greylist: delayed 8491 seconds by postgrey-1.27 at vger.kernel.org; Thu, 13 Oct 2016 14:43:52 EDT From: "Kani, Toshimitsu" To: "dan.j.williams@intel.com" CC: "linux-kernel@vger.kernel.org" , "linux-nvdimm@lists.01.org" , "vishal.l.verma@intel.com" Subject: Re: [PATCH] pmem: report error on clear poison failure Thread-Topic: [PATCH] pmem: report error on clear poison failure Thread-Index: AQHSJWpbK8SDkE3LT06eGc6/iqHfVaCmi1yAgAABUIA= Date: Thu, 13 Oct 2016 16:08:02 +0000 Message-ID: <1476374787.20881.34.camel@hpe.com> References: <1476374061-9080-1-git-send-email-toshi.kani@hpe.com> In-Reply-To: Accept-Language: en-US Content-Language: en-US X-MS-Has-Attach: X-MS-TNEF-Correlator: authentication-results: spf=none (sender IP is ) smtp.mailfrom=toshi.kani@hpe.com; x-originating-ip: [15.219.163.9] x-ms-office365-filtering-correlation-id: 86584ab5-70fb-48d5-628e-08d3f3831c0b x-microsoft-exchange-diagnostics: 1;CS1PR84MB0008;7:1fg5AZIu4p7JTUApkSp9cJQIgQRWGVXsGPS4q+wl0/krndNMvfb4DT9tldjfKZlN6f6I3BQwVB27EdbqQZi1QmPzBwM5yf8xLRB1cefqt239fAdFKwjAW2K09D5Tes+XnXgQ41HTgtiWeDWtHpmkhUhBuRaoXWOuFcx2Leed1SZ8+fc0v7B+MGgeLg9VngG5cvCWte3CHND74II3aHW0bIcZYHsEdmbg/ANFd8kPQr+BEpEcRci/ZMk3gmogsCZcARJ2r7S11ElvuNc/EsfJF6wGsEuu3n8SG/o5Yt8ZkBpEwfelv1ascIGk7PLhj7luAVodDymz900cE+tS2wq7V/lABEprOnspLSDRAVDEThM= x-microsoft-antispam: UriScan:;BCL:0;PCL:0;RULEID:;SRVR:CS1PR84MB0008; x-microsoft-antispam-prvs: x-exchange-antispam-report-test: UriScan:(227479698468861); x-exchange-antispam-report-cfa-test: BCL:0;PCL:0;RULEID:(6040176)(601004)(2401047)(8121501046)(5005006)(3002001)(10201501046)(6055026);SRVR:CS1PR84MB0008;BCL:0;PCL:0;RULEID:;SRVR:CS1PR84MB0008; x-forefront-prvs: 0094E3478A x-forefront-antispam-report: SFV:NSPM;SFS:(10019020)(6009001)(7916002)(377424004)(189002)(199003)(377454003)(24454002)(50986999)(106356001)(103116003)(2351001)(110136003)(97736004)(122556002)(189998001)(105586002)(66066001)(92566002)(106116001)(4001150100001)(76176999)(99286002)(10400500002)(11100500001)(54356999)(77096005)(3280700002)(3660700001)(81156014)(36756003)(8676002)(81166006)(5002640100001)(102836003)(19580405001)(101416001)(6916009)(3846002)(586003)(8936002)(2900100001)(19580395003)(2501003)(2906002)(7846002)(5660300001)(6116002)(33646002)(5640700001)(305945005)(2950100002)(4326007)(86362001)(68736007)(7736002)(87936001);DIR:OUT;SFP:1102;SCL:1;SRVR:CS1PR84MB0008;H:CS1PR84MB0005.NAMPRD84.PROD.OUTLOOK.COM;FPR:;SPF:None;PTR:InfoNoRecords;MX:1;A:1;LANG:en; spamdiagnosticoutput: 1:99 spamdiagnosticmetadata: NSPM Content-Type: text/plain; charset="utf-8" Content-ID: MIME-Version: 1.0 X-OriginatorOrg: hpe.com X-MS-Exchange-CrossTenant-originalarrivaltime: 13 Oct 2016 16:08:02.5004 (UTC) X-MS-Exchange-CrossTenant-fromentityheader: Hosted X-MS-Exchange-CrossTenant-id: 105b2061-b669-4b31-92ac-24d304d195dc X-MS-Exchange-Transport-CrossTenantHeadersStamped: CS1PR84MB0008 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Transfer-Encoding: 8bit X-MIME-Autoconverted: from base64 to 8bit by mail.home.local id u9DIiTiG020213 Content-Length: 802 Lines: 22 On Thu, 2016-10-13 at 09:01 -0700, Dan Williams wrote: > On Thu, Oct 13, 2016 at 8:54 AM, Toshi Kani > wrote: > > > > ACPI Clear Uncorrectable Error DSM function may fail or may be > > unsupported on a platform.  pmem_clear_poison() returns without > > clearing badblocks in such cases, which leads to a silent data > > corruption. > > > > Change pmem_do_bvec() and pmem_clear_poison() to return -EIO > > so that filesystem can log an error message. > > What's the silent data corruption scenario?  If the clear poison > fails I'm assuming that the poison will still be notified on the next > read. I agree that the data is eventually read, but there is no guranteed that when it is read soon enough, i.e. user might not access to the data for a long time. Thanks, -Toshi