Return-Path: Received: from mx0b-001b2d01.pphosted.com ([148.163.158.5]:20551 "EHLO mx0a-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1752166AbcGAUqz convert rfc822-to-8bit (ORCPT ); Fri, 1 Jul 2016 16:46:55 -0400 Received: from pps.filterd (m0098420.ppops.net [127.0.0.1]) by mx0b-001b2d01.pphosted.com (8.16.0.11/8.16.0.11) with SMTP id u61Ki6It110511 for ; Fri, 1 Jul 2016 16:46:54 -0400 Received: from e35.co.us.ibm.com (e35.co.us.ibm.com [32.97.110.153]) by mx0b-001b2d01.pphosted.com with ESMTP id 23wpww24bh-1 (version=TLSv1.2 cipher=AES256-SHA bits=256 verify=NOT) for ; Fri, 01 Jul 2016 16:46:54 -0400 Received: from localhost by e35.co.us.ibm.com with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted for from ; Fri, 1 Jul 2016 14:46:53 -0600 Received: from b01cxnp22034.gho.pok.ibm.com (b01cxnp22034.gho.pok.ibm.com [9.57.198.24]) by d03dlp01.boulder.ibm.com (Postfix) with ESMTP id 9F7311FF0046 for ; Fri, 1 Jul 2016 14:46:33 -0600 (MDT) Received: from b01ledav005.gho.pok.ibm.com (b01ledav005.gho.pok.ibm.com [9.57.199.110]) by b01cxnp22034.gho.pok.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id u61KknjF65339500 for ; Fri, 1 Jul 2016 20:46:49 GMT Received: from b01ledav005.gho.pok.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 8BAE7AE034 for ; Fri, 1 Jul 2016 16:46:49 -0400 (EDT) Received: from e39.co.us.ibm.com (unknown [9.17.249.49]) by b01ledav005.gho.pok.ibm.com (Postfix) with ESMTPS id 46877AE03B for ; Fri, 1 Jul 2016 16:46:49 -0400 (EDT) Received: from localhost by e39.co.us.ibm.com with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted for from ; Fri, 1 Jul 2016 14:46:48 -0600 Received: from localhost by smtp.notes.na.collabserv.com with smtp.notes.na.collabserv.com ESMTP for from ; Fri, 1 Jul 2016 20:46:45 -0000 In-Reply-To: <20160701200742.GA24269@fieldses.org> To: Bruce Fields Cc: linux-nfs@vger.kernel.org, "Tomer Perry" Subject: Re: grace period From: "Marc Eshel" Date: Fri, 1 Jul 2016 13:46:42 -0700 References: <1465939516-44769-1-git-send-email-trond.myklebust@primarydata.com> <20160701160857.GB20327@fieldses.org> <20160701200742.GA24269@fieldses.org> MIME-Version: 1.0 Content-Type: text/plain; charset="US-ASCII" Message-Id: Sender: linux-nfs-owner@vger.kernel.org List-ID: This is my v3 test that show the lock still there after echo 0 > /proc/fs/nfsd/threads [root@sonascl21 ~]# cat /etc/redhat-release Red Hat Enterprise Linux Server release 7.2 (Maipo) [root@sonascl21 ~]# uname -a Linux sonascl21.sonasad.almaden.ibm.com 3.10.0-327.el7.x86_64 #1 SMP Thu Oct 29 17:29:29 EDT 2015 x86_64 x86_64 x86_64 GNU/Linux [root@sonascl21 ~]# cat /proc/locks | grep 999 3: POSIX ADVISORY WRITE 2349 00:2a:489486 0 999 [root@sonascl21 ~]# echo 0 > /proc/fs/nfsd/threads [root@sonascl21 ~]# cat /proc/fs/nfsd/threads 0 [root@sonascl21 ~]# cat /proc/locks | grep 999 3: POSIX ADVISORY WRITE 2349 00:2a:489486 0 999 From: Bruce Fields To: Marc Eshel/Almaden/IBM@IBMUS Cc: linux-nfs@vger.kernel.org Date: 07/01/2016 01:07 PM Subject: Re: grace period On Fri, Jul 01, 2016 at 10:31:55AM -0700, Marc Eshel wrote: > It used to be that sending KILL signal to lockd would free locks and start > Grace period, and when setting nfsd threads to zero, nfsd_last_thread() > calls nfsd_shutdown that called lockd_down that I believe was causing both > freeing of locks and starting grace period or maybe it was setting it back > to a value > 0 that started the grace period. OK, apologies, I didn't know (or forgot) that. > Any way starting with the kernels that are in RHEL7.1 and up echo 0 > > /proc/fs/nfsd/threads doesn't do it anymore, I assume going to common > grace period for NLM and NFSv4 changed things. > The question is how to do IP fail-over, so when a node fails and the IP is > moving to another node, we need to go into grace period on all the nodes > in the cluster so the locks of the failed node are not given to anyone > other than the client that is reclaiming his locks. Restarting NFS server > is to distractive. What's the difference? Just that clients don't have to reestablish tcp connections? --b. > For NFSv3 KILL signal to lockd still works but for > NFSv4 have no way to do it for v4. > Marc. > > > > From: Bruce Fields > To: Marc Eshel/Almaden/IBM@IBMUS > Cc: linux-nfs@vger.kernel.org > Date: 07/01/2016 09:09 AM > Subject: Re: grace period > > > > On Thu, Jun 30, 2016 at 02:46:19PM -0700, Marc Eshel wrote: > > I see that setting the number of nfsd threads to 0 (echo 0 > > > /proc/fs/nfsd/threads) is not releasing the locks and putting the server > > > in grace mode. > > Writing 0 to /proc/fs/nfsd/threads shuts down knfsd. So it should > certainly drop locks. If that's not happening, there's a bug, but we'd > need to know more details (version numbers, etc.) to help. > > That alone has never been enough to start a grace period--you'd have to > start knfsd again to do that. > > > What is the best way to go into grace period, in new version of the > > kernel, without restarting the nfs server? > > Restarting the nfs server is the only way. That's true on older kernels > true, as far as I know. (OK, you can apparently make lockd do something > like this with a signal, I don't know if that's used much, and I doubt > it works outside an NFSv3-only environment.) > > So if you want locks dropped and a new grace period, then you should run > "systemctl restart nfs-server", or your distro's equivalent. > > But you're probably doing something more complicated than that. I'm not > sure I understand the question.... > > --b. > > > >