From: "J. Bruce Fields" Subject: Re: [BUG] All Kerberos mounts stop working, restarting rpc.svcgssd helps Date: Fri, 10 Nov 2006 15:49:17 -0500 Message-ID: <20061110204917.GF9453@fieldses.org> References: <20061106231821.GK12372@fieldses.org> <20061106232224.GL12372@fieldses.org> <20061106234607.GA18236@uio.no> <20061106234922.GM12372@fieldses.org> <20061107004720.GA16731@uio.no> <20061108004440.GL11346@fieldses.org> <20061108005045.GA4524@uio.no> <4d569c330611101211y4846d2b2l2ad4be385688deff@mail.gmail.com> <20061110202521.GE9453@fieldses.org> <4d569c330611101239v46a8e444u15aa06433259d220@mail.gmail.com> Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Cc: "Steinar H. Gunderson" , nfs@lists.sourceforge.net Return-path: Received: from sc8-sf-mx1-b.sourceforge.net ([10.3.1.91] helo=mail.sourceforge.net) by sc8-sf-list2-new.sourceforge.net with esmtp (Exim 4.43) id 1GidJS-0007NR-8A for nfs@lists.sourceforge.net; Fri, 10 Nov 2006 12:49:22 -0800 Received: from mail.fieldses.org ([66.93.2.214] helo=pickle.fieldses.org) by mail.sourceforge.net with esmtps (TLSv1:AES256-SHA:256) (Exim 4.44) id 1GidJT-0000Cb-9F for nfs@lists.sourceforge.net; Fri, 10 Nov 2006 12:49:23 -0800 To: Kevin Coffman In-Reply-To: <4d569c330611101239v46a8e444u15aa06433259d220@mail.gmail.com> List-Id: "Discussion of NFS under Linux development, interoperability, and testing." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Sender: nfs-bounces@lists.sourceforge.net Errors-To: nfs-bounces@lists.sourceforge.net On Fri, Nov 10, 2006 at 03:39:44PM -0500, Kevin Coffman wrote: > On 11/10/06, J. Bruce Fields wrote: > >Well, if we're seeing a NO_CONTEXT error on the wire, and if (as is > >apparent from the strace), svcgssd isn't itself sending a NO_CONTEXT > >error down, then you must be correct that it's coming from > >gss_write_init_verf(). > > > >So gss_svc_searchbyctx() is returning NULL. > > > >I wonder if the kmalloc() in dup_to_netobj() is failing? If it was > >caused by a memory leak of some kind, that would explain why it takes a > >while for the problem to show up, but not why restarting rpc.svcgssd > >would help. > > That is the part that is most confusing to me. Dumb question time: Is > kernel memory allocated by the kernel while running in the svcgssd > task freed up when that task is restarted? No, kmalloc() allocations belong to the kernel, not to any particular process. So it's a little hard to tell how the server auth_gss kernel code can even be affected by svcgssd restarting. Just one obvious thing: svcgssd holds /proc/net/rpc/auth.rpcsec.init/channel open, so when it's restarted, there will be a close and then an open of that file. I think the code in question is net/sunrpc/cache.c:cache_open() and net/sunrpc/cache.c:cache_release(). Maybe there's a clue there. --b. ------------------------------------------------------------------------- Using Tomcat but need to do more? Need to support web services, security? Get stuff done quickly with pre-integrated technology to make your job easier Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 _______________________________________________ NFS maillist - NFS@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nfs