Return-Path: Received: from mx1.redhat.com ([209.132.183.28]:23750 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754147Ab0IRLJg (ORCPT ); Sat, 18 Sep 2010 07:09:36 -0400 Date: Sat, 18 Sep 2010 07:09:32 -0400 From: Jeff Layton To: "J. Bruce Fields" Cc: Sachin Prabhu , linux-nfs Subject: Re: Should we be aggressively invalidating cache when using -onolock? Message-ID: <20100918070932.1c1bb700@corrin.poochiereds.net> In-Reply-To: <20100917174644.GD25515@fieldses.org> References: <1103741.22.1284726314119.JavaMail.sprabhu@dhcp-1-233.fab.redhat.com> <29790688.25.1284726394683.JavaMail.sprabhu@dhcp-1-233.fab.redhat.com> <20100917174644.GD25515@fieldses.org> Content-Type: text/plain; charset=US-ASCII Sender: linux-nfs-owner@vger.kernel.org List-ID: MIME-Version: 1.0 On Fri, 17 Sep 2010 13:46:44 -0400 "J. Bruce Fields" wrote: > On Fri, Sep 17, 2010 at 08:26:39AM -0400, Sachin Prabhu wrote: > > We came across an issue where the performance of an application using flocks on RHEL 4(2.6.9 kernel) was far better when compared to the performance of the same application on RHEL 5(2.6.18 kernel). The nfs client behavior when performing flocks on RHEL 4 and RHEL 5 differ. To ensure we had a level playing field, we repeated the tests using the mount option -o nolock. > > > > The performance on RHEL 5 improved slightly but was still pretty bad when compared to performance on RHEL 4. On closer observation, it was seen that there are a large number of READ requests on RHEL 5 while on RHEL 4 there were hardly any. This difference in behavior was caused by the code which invalidates the cache in the do_setlk() function and results in the RHEL 5 client performing a large number of READ requests. > > > > In this case, the files were only being accessed by one client. This is why the nolock mount option was used. When running such workloads, the aggressive invalidation of the cache is unnecessary. This patch improves the performance in such a scenario. > > Makes sense to me. > Agreed. This is potentially a huge performance win for some workloads. > (Is it possible that somebody might depend on lock/unlock to keep their > meaning of "invalidate cache/flush changes" even when they don't care > bout checking for inter-client lock conflicts? That sounds like an odd > use case to me.) > Aye, there's the rub. It might cause anyone doing this to regress. My gut feeling is that those people are a minority if any exist at all. The consequences of such a change for them could be pretty ugly but I'm not sure we owe them any consistency guarantees in such a case. -- Jeff Layton