Received: by 2002:a25:8b91:0:0:0:0:0 with SMTP id j17csp9530834ybl; Fri, 17 Jan 2020 13:46:34 -0800 (PST) X-Google-Smtp-Source: APXvYqzGac7eMzaSHLpxL4P0/zNxOVLX2VL+7Cacl9zlg3qPnYmfuBulBJsizc9KhkSguyRQ8tGp X-Received: by 2002:a05:6830:18d4:: with SMTP id v20mr7837569ote.29.1579297594364; Fri, 17 Jan 2020 13:46:34 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1579297594; cv=none; d=google.com; s=arc-20160816; b=imTkVXgUKUazQjpfjUoMRVfc80luhLMo9LJyRK5Xipourb5FKF3y9jXNJ+l0KTE+21 MCk64fVKAnijoHN0i/Qmdg6STGztUZQSsMSMcG5mw+nWyBs9fW6fdsv3jWWkgrvW5krX yUk6Z0foBzxFlNCL1eumGCd6wJEkkwrt5HjMmIuUTjOY38L6uvNYDf91a0X1XvAgOpxX gwXFOy37hu9MDenNkNBTZQbyr//zTTj9lOpx71IPBnRR5RpfkMvg/zlhWwM/Qi3BeSO6 sCvoWE/o7Yr5vSIlDuJfiJNmd816sjI2z1n7uatb52wnIj9BVbaU8KiUW4mlCIOXRahc YghQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=wMn3ADU3X7T0ZzExO/AGE2GBYRUMgs+Rk2Jnq/t2i3A=; b=NkO5GTEQrf1+bPEdOll2F+1Ug9ZsDfm7lLlFu844rbp+uhz/s5+PLJ/CU42myUbwuz CjIegmrsyOZ/ExqJywrcKudyaAewu5CiVZNKRkAjK4c2rcXgA3UzGvX0b6Ed2PeKQKsb WR3foTKiCCxrIAUtrQOFaJ97hv7IJHuTF770C8fXHaHovCVfY4q6c072x2LWIiWfJzM/ mpQ2EJXSHtNbOxxL7HYa1C1CVauqAZKhF0vpaOHsbasTHOhdzD11/hM48MZUY+xDuUSA 1febplXuKls08zf/OHN9N8ARAz5+XeDyNGSKzc9izP7TdF+x1LcNOymBHI78XSHaVS4m ugaA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-nfs-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id d22si16432396oti.316.2020.01.17.13.46.17; Fri, 17 Jan 2020 13:46:34 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-nfs-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-nfs-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727033AbgAQVqP (ORCPT + 99 others); Fri, 17 Jan 2020 16:46:15 -0500 Received: from fieldses.org ([173.255.197.46]:34008 "EHLO fieldses.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726587AbgAQVqP (ORCPT ); Fri, 17 Jan 2020 16:46:15 -0500 Received: by fieldses.org (Postfix, from userid 2815) id D64511C84; Fri, 17 Jan 2020 16:46:14 -0500 (EST) Date: Fri, 17 Jan 2020 16:46:14 -0500 From: "J. Bruce Fields" To: Chuck Lever Cc: linux-nfs@vger.kernel.org Subject: Re: [PATCH RFC] nfsd: Fix NFSv4 READ on RDMA when using readv Message-ID: <20200117214614.GC27294@fieldses.org> References: <20200115202647.2172.666.stgit@bazille.1015granger.net> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20200115202647.2172.666.stgit@bazille.1015granger.net> User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-nfs-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-nfs@vger.kernel.org On Wed, Jan 15, 2020 at 03:37:33PM -0500, Chuck Lever wrote: > svcrdma expects that the READ payload falls precisely into the > xdr_buf's page vector. Adding "xdr->iov = NULL" forces > xdr_reserve_space() to always use pages from xdr->buf->pages when > calling nfsd_readv. > > Also, the XDR padding is problematic. For NFS/RDMA Write chunks, > the padding needs to be in xdr->buf->tail so that the transport can > skip over it. However for NFS/TCP and the NFS/RDMA Reply chunks, > the padding has to be retained. Not yet sure how to add this. > > Fixes: b04209806384 ("nfsd4: allow exotic read compounds") > Buglink: https://bugzilla.kernel.org/show_bug.cgi?id=198053 > Signed-off-by: Chuck Lever > --- > Howdy Bruce- > > I'm struggling with nfsd4_encode_readv(). > > - for NFS/RDMA Write chunks, the READ payload has to be in > buf->pages. I've fixed that. > > - xdr_reserve_space() calls don't need to explicitly align the > @nbytes argument: xdr_reserve_space() already does this? > > - the while loop probably won't work if a later READ in the COMPOUND > doesn't start on a page boundary. This isn't a problem until we > run into a Solaris client in forcedirectio mode. So the Solaris client sends multiple reads per compound in that case? > - the XDR padding doesn't work for NFS/RDMA Write chunks, which are > supposed to skip padding altogether. krb5i/p has to treat read data as padded regardless of the transport, doesn't it? --b. > Do you have suggestions? Thanks in advance. > > > fs/nfsd/nfs4xdr.c | 17 +++++++---------- > 1 file changed, 7 insertions(+), 10 deletions(-) > > diff --git a/fs/nfsd/nfs4xdr.c b/fs/nfsd/nfs4xdr.c > index d2dc4c0e22e8..14c68a136b4e 100644 > --- a/fs/nfsd/nfs4xdr.c > +++ b/fs/nfsd/nfs4xdr.c > @@ -3519,17 +3519,14 @@ static __be32 nfsd4_encode_readv(struct nfsd4_compoundres *resp, > u32 zzz = 0; > int pad; > > + /* Ensure xdr_reserve_space behaves itself */ > + if (xdr->iov == xdr->buf->head) { > + xdr->iov = NULL; > + xdr->end = xdr->p; > + } > + > len = maxcount; > v = 0; > - > - thislen = min_t(long, len, ((void *)xdr->end - (void *)xdr->p)); > - p = xdr_reserve_space(xdr, (thislen+3)&~3); > - WARN_ON_ONCE(!p); > - resp->rqstp->rq_vec[v].iov_base = p; > - resp->rqstp->rq_vec[v].iov_len = thislen; > - v++; > - len -= thislen; > - > while (len) { > thislen = min_t(long, len, PAGE_SIZE); > p = xdr_reserve_space(xdr, (thislen+3)&~3); > @@ -3548,7 +3545,7 @@ static __be32 nfsd4_encode_readv(struct nfsd4_compoundres *resp, > read->rd_length = maxcount; > if (nfserr) > return nfserr; > - xdr_truncate_encode(xdr, starting_len + 8 + ((maxcount+3)&~3)); > + xdr_truncate_encode(xdr, starting_len + 8 + maxcount); > > tmp = htonl(eof); > write_bytes_to_xdr_buf(xdr->buf, starting_len , &tmp, 4);