Received: by 2002:a25:8b12:0:0:0:0:0 with SMTP id i18csp935901ybl; Wed, 28 Aug 2019 07:25:38 -0700 (PDT) X-Google-Smtp-Source: APXvYqz/mMuNL8MRIlLguGBhO41hHroZ7JSNkgZ5RE/vsFX45xIea0QqRMbZBGqlgtTnjmceQ7OZ X-Received: by 2002:a17:902:b106:: with SMTP id q6mr4686400plr.333.1567002338368; Wed, 28 Aug 2019 07:25:38 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1567002338; cv=none; d=google.com; s=arc-20160816; b=J5Qq+OLp2l3izVO6lHnwSGGUfASo/wiJk5QDEFE8FaCjiGm8soq1DcBceq25LiXzK0 P3wy5zwCdcZUX7Jcb+m/qT1FCItHVYgRw4qR55GXzrp16CT9OSTXliba0xVgJZ8FwDhU 5rK59a+6vR+le9PQiLzLDy+IDN+gYYjBkMrwngA3qqfUCQ5WNC8PqLFYMWJjY+CK9zOP e2JodnW3yrJ3rXI9YyEuqI8yXC9yfUr7P4BZRk+X6uLvH8l4gTibjWIRRzXMfq0i1yQv gl3rg9S4v0iYfqgTC77ISNCftntIi+XsP7RE+5kFjtsylpLqd9mubZ+vsvlXSBYDrxXA eZYA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:to:references:message-id :content-transfer-encoding:cc:date:in-reply-to:from:subject :mime-version:dkim-signature; bh=bzoaqBd5bjkPFVuigducMQDLSXZl+fTjlv6yArOUKjw=; b=waLzCkM4shj7EXwqDRZxDtN6bywG4mqO711jDA3AzhP5OWUUIEiIILCJa7SDhlilkL d0co9AxosRQoKkDna7uOPOXxKtSSxF8m4n+iNmKsSHhVU2Jc8x+eqm0Bl2vBvqs6KdBi zAKh1QKmb8WbmK9FyzOrPh/SW74zwYKreG9REhi6fZeC21r4yJILMvctw1P040rHGH0x Xbhx9lE3RDVRPje1w9vt7e4HcPdoqAvimkGuVppUmspwzx1PUQMiiHC5Ml6AZPnxGbi4 i5T+f67z1f5PLee1lYdhMupqkOT+vqy96RE2bfCvJCzIsSAZ3eltT1ZVT+d9qmSy+KOX rmHQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@oracle.com header.s=corp-2019-08-05 header.b="Qz/1UnQ9"; spf=pass (google.com: best guess record for domain of linux-nfs-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=oracle.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id w13si2539517pfi.181.2019.08.28.07.25.22; Wed, 28 Aug 2019 07:25:38 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-nfs-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@oracle.com header.s=corp-2019-08-05 header.b="Qz/1UnQ9"; spf=pass (google.com: best guess record for domain of linux-nfs-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=oracle.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726432AbfH1OXr (ORCPT + 99 others); Wed, 28 Aug 2019 10:23:47 -0400 Received: from userp2130.oracle.com ([156.151.31.86]:60952 "EHLO userp2130.oracle.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726368AbfH1OXp (ORCPT ); Wed, 28 Aug 2019 10:23:45 -0400 Received: from pps.filterd (userp2130.oracle.com [127.0.0.1]) by userp2130.oracle.com (8.16.0.27/8.16.0.27) with SMTP id x7SEJWXZ082702; Wed, 28 Aug 2019 14:23:38 GMT DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=oracle.com; h=content-type : mime-version : subject : from : in-reply-to : date : cc : content-transfer-encoding : message-id : references : to; s=corp-2019-08-05; bh=bzoaqBd5bjkPFVuigducMQDLSXZl+fTjlv6yArOUKjw=; b=Qz/1UnQ95yOY8N7nqMsywYAq+f2jqPU7ZomDMR7npZ2YCOVS1UTe7+gNYwFHiEehwzL3 0u6ctm14dSm04mrm/PHl1Erku5WV5zpa2CAH4v3t2Dh41GZ4G450vtmfSk4MFzBm3x4M uwMkfRfooh4jjw3WRq79lR+MCODHL56lf7HSELTqt4VQFBdL9M9rHxq4iFeXW7puwkbC zfSk8o2AewASBMXN1kIwOoxFKk3gRQrrY457lhAiREOJBAL8W/brV3v5bGBTAgrPe2yx rKq/bUaoJD1aj5OHKJUv83xGCNokXOMjsM/16x5RF+LQFwumC+V2gIsekNDvAWTx9QbA Qw== Received: from userp3020.oracle.com (userp3020.oracle.com [156.151.31.79]) by userp2130.oracle.com with ESMTP id 2unu76829t-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Wed, 28 Aug 2019 14:23:38 +0000 Received: from pps.filterd (userp3020.oracle.com [127.0.0.1]) by userp3020.oracle.com (8.16.0.27/8.16.0.27) with SMTP id x7SEJrCQ039343; Wed, 28 Aug 2019 14:21:37 GMT Received: from userv0122.oracle.com (userv0122.oracle.com [156.151.31.75]) by userp3020.oracle.com with ESMTP id 2untetaqq4-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Wed, 28 Aug 2019 14:21:37 +0000 Received: from abhmp0020.oracle.com (abhmp0020.oracle.com [141.146.116.26]) by userv0122.oracle.com (8.14.4/8.14.4) with ESMTP id x7SELZS8002541; Wed, 28 Aug 2019 14:21:35 GMT Received: from anon-dhcp-153.1015granger.net (/68.61.232.219) by default (Oracle Beehive Gateway v4.0) with ESMTP ; Wed, 28 Aug 2019 07:21:35 -0700 Content-Type: text/plain; charset=us-ascii Mime-Version: 1.0 (Mac OS X Mail 12.4 \(3445.104.11\)) Subject: Re: [PATCH 0/3] Handling NFSv3 I/O errors in knfsd From: Chuck Lever In-Reply-To: <31658faabfbe3b4c2925bd899e264adf501fbc75.camel@redhat.com> Date: Wed, 28 Aug 2019 10:21:34 -0400 Cc: Bruce Fields , Bruce Fields , Trond Myklebust , Linux NFS Mailing List Content-Transfer-Encoding: quoted-printable Message-Id: References: <20190826205156.GA27834@fieldses.org> <61F77AD6-BD02-4322-B944-0DC263EB9BD8@oracle.com> <20190827145819.GB9804@fieldses.org> <20190827145912.GC9804@fieldses.org> <1ee75165d548b336f5724b6d655aa2545b9270c3.camel@hammerspace.com> <20190828134839.GA26492@fieldses.org> <45582F32-69C7-4DC8-A608-E45038A44D42@oracle.com> <20190828140044.GA14249@parsley.fieldses.org> <990B7B57-53B8-4ECB-A08B-1AFD2FCE13A6@oracle.com> <31658faabfbe3b4c2925bd899e264adf501fbc75.camel@redhat.com> To: Jeff Layton X-Mailer: Apple Mail (2.3445.104.11) X-Proofpoint-Virus-Version: vendor=nai engine=6000 definitions=9363 signatures=668685 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 suspectscore=0 malwarescore=0 phishscore=0 bulkscore=0 spamscore=0 mlxscore=0 mlxlogscore=999 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1906280000 definitions=main-1908280150 X-Proofpoint-Virus-Version: vendor=nai engine=6000 definitions=9363 signatures=668685 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 priorityscore=1501 malwarescore=0 suspectscore=0 phishscore=0 bulkscore=0 spamscore=0 clxscore=1015 lowpriorityscore=0 mlxscore=0 impostorscore=0 mlxlogscore=999 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1906280000 definitions=main-1908280150 Sender: linux-nfs-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-nfs@vger.kernel.org > On Aug 28, 2019, at 10:16 AM, Jeff Layton wrote: >=20 > On Wed, 2019-08-28 at 10:03 -0400, Chuck Lever wrote: >>> On Aug 28, 2019, at 10:00 AM, J. Bruce Fields = wrote: >>>=20 >>> On Wed, Aug 28, 2019 at 09:57:25AM -0400, Chuck Lever wrote: >>>>=20 >>>>> On Aug 28, 2019, at 9:51 AM, Jeff Layton = wrote: >>>>>=20 >>>>> On Wed, 2019-08-28 at 09:48 -0400, bfields@fieldses.org wrote: >>>>>> On Tue, Aug 27, 2019 at 03:15:35PM +0000, Trond Myklebust wrote: >>>>>>> I'm open to other suggestions, but I'm having trouble finding = one that >>>>>>> can scale correctly (i.e. not require per-client tracking), = prevent >>>>>>> silent corruption (by causing clients to miss errors), while not >>>>>>> relying on optional features that may not be implemented by all = NFSv3 >>>>>>> clients (e.g. per-file write verifiers are not implemented by = *BSD). >>>>>>>=20 >>>>>>> That said, it seems to me that to do nothing should not be an = option, >>>>>>> as that would imply tolerating silent corruption of file data. >>>>>>=20 >>>>>> So should we increment the boot verifier every time we discover = an error >>>>>> on an asynchronous write? >>>>>>=20 >>>>>=20 >>>>> I think so. Otherwise, only one client will ever see that error. >>>>=20 >>>> +1 >>>>=20 >>>> I'm not familiar with the details of how the Linux NFS server >>>> implements the boot verifier: Will a verifier bump be effective >>>> for all file systems that server exports? >>>=20 >>> Yes. It will be per network namespace, but that's the only limit. >>>=20 >>>> If so, is that an acceptable cost? >>>=20 >>> It means clients will resend all their uncommitted writes. That = could >>> certainly make write errors more expensive. But maybe you've = already >>> got bigger problems if you've got a full or failing disk? >>=20 >> One full filesystem will impact the behavior of all other exported >> filesystems. That might be surprising behavior to a server = administrator. >> I don't have any suggestions other than maintaining a separate = verifier >> for each exported filesystem in each net namespace. >>=20 >>=20 >=20 > Yeah, it's not pretty, but I think the alternative is worse. Most = admins > would take rotten performance over data corruption. Again, I'm not saying we should do nothing. It doesn't seem like a per-export verifier would be much more work than a per-net-namespace verifier. > For the most part, these sorts of errors tend to be rare. EIO is certainly going to be rare, agreed. ENOSPC might not be. > If it turns > out to be a problem we could consider moving the verifier into > svc_export or something? -- Chuck Lever