Received: by 2002:a25:824b:0:0:0:0:0 with SMTP id d11csp2249534ybn; Thu, 26 Sep 2019 09:06:29 -0700 (PDT) X-Google-Smtp-Source: APXvYqz6ZNEvWE4FGSZkpP1VqlLCotUdrsDq0CYCQfkhjFT5E5jJMjlR65f1HN/TKWPBQsRW/RMS X-Received: by 2002:a17:906:4b07:: with SMTP id y7mr3829265eju.126.1569513988887; Thu, 26 Sep 2019 09:06:28 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1569513988; cv=none; d=google.com; s=arc-20160816; b=SNO27zFgH38TfzeZiJ/se89S92LdHJpZudWMri/FtIauCTI330X8cSy57xmLEaWMlV j5jNYCi+Cme8m6dRdJTukNXhhje20dbXIL1N4GLvVLrdZRZN0D0ySRpOjn8NYmymmWhG VDSI4xh7x0DuawFE19uXjvhtFxb/tW5Zd0yOIjTAaDPaR/Mc8gON/gkLGNqFpWXTTjvw HJOssYcXaNVsF2b/vuNYq6GyqiWOE5vrrbXH/AYHGwjjb6tEhASUbLUn7rh2sWIC/gHG lVOAnYCz2WJ31HZ1vGoBid9pQH6a6vOum3ellmy+nGv5a2htyK18iE2CzDbt5lqz40kC UoZQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=v8nAjCxT7sExFgUcFti9La+5Yff+3bQ5jwW3e65fA60=; b=flxjs5bvPAl1NhD36R7fwllTT/mRNd0AWRw6tOmBZxMaZPHPZFFM4BW5CRzlqDES4z xpKc4UpUozb8H6O2c+3BfpilXCrAZgrmVCqeNWbDr7BRB91R89Srj9zb9XH4sn/pg2/7 c3rOItpqg1y7PqDlQnIVsZqkN9z/qC3MUQ3d2dIxdvNP65Us7yhmtlHSG9oXOWgVX8EN huw7BLRn2aVOCEjXioT30T7JCgUQuTAGvBM6Zne5oxuK5sAuIHvHm117FGXzvzzozbyc 05UhL05VYlqVdPDld7PTK/vAStzvYEcnTL1cRLssvgkUqi1eat76zildXEZhxGJeWD64 BKCw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-nfs-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id w2si1800940edc.386.2019.09.26.09.06.00; Thu, 26 Sep 2019 09:06:28 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-nfs-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-nfs-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727355AbfIZQF5 (ORCPT + 99 others); Thu, 26 Sep 2019 12:05:57 -0400 Received: from fieldses.org ([173.255.197.46]:33998 "EHLO fieldses.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726984AbfIZQF5 (ORCPT ); Thu, 26 Sep 2019 12:05:57 -0400 Received: by fieldses.org (Postfix, from userid 2815) id AC0611513; Thu, 26 Sep 2019 12:05:56 -0400 (EDT) Date: Thu, 26 Sep 2019 12:05:56 -0400 From: Bruce Fields To: Chuck Lever Cc: Trond Myklebust , Kevin Vasko , Linux NFS Mailing List Subject: Re: NFSv4 client locks up on larger writes with Kerberos enabled Message-ID: <20190926160556.GC29192@fieldses.org> References: <20190925164831.GA9366@fieldses.org> <57192382-86BE-4878-9AE0-B22833D56367@oracle.com> <20190925200723.GA11954@fieldses.org> <1BC54D7A-073E-40FD-9AA3-552F1E1BD214@oracle.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1BC54D7A-073E-40FD-9AA3-552F1E1BD214@oracle.com> User-Agent: Mutt/1.5.21 (2010-09-15) Sender: linux-nfs-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-nfs@vger.kernel.org On Thu, Sep 26, 2019 at 08:55:17AM -0700, Chuck Lever wrote: > > On Sep 25, 2019, at 1:07 PM, Bruce Fields wrote: > > > > On Wed, Sep 25, 2019 at 11:49:14AM -0700, Chuck Lever wrote: > >> Sounds like the NFS server is dropping the connection. With > >> GSS enabled, that's usually a sign that the GSS window has > >> overflowed. > > > > Would that show up in the rpc statistics on the client somehow? > > More likely on the server. The client just sees a disconnect > without any explanation attached. So watching a count of disconnects might at least tell us something? > gss_verify_header is where the checking is done on the server. > Disappointingly, I see some dprintk's in there, but no static > trace events. Kevin, was this a Linux server? --b. > > In that case--I seem to remember there's a way to configure the size of > > the client's slot table, maybe lowering that (decreasing the number of > > rpc's allowed to be outstanding at a time) would work around the > > problem. > > > Should the client be doing something different to avoid or recover from > > overflows of the gss window? > > The client attempts to meter the request stream so that it stays > within the bounds of the GSS sequence number window. The stream > of requests is typically unordered coming out of the transmit > queue. > > There is some new code (since maybe v5.0?) that handles the > metering: gss_xmit_need_reencode().