Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1757536AbYLPV0z (ORCPT ); Tue, 16 Dec 2008 16:26:55 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1753088AbYLPV0m (ORCPT ); Tue, 16 Dec 2008 16:26:42 -0500 Received: from e1.ny.us.ibm.com ([32.97.182.141]:52415 "EHLO e1.ny.us.ibm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752015AbYLPV0k (ORCPT ); Tue, 16 Dec 2008 16:26:40 -0500 Date: Tue, 16 Dec 2008 13:26:38 -0800 From: "Paul E. McKenney" To: Eric Dumazet Cc: Andrew Morton , Ingo Molnar , Christoph Hellwig , David Miller , "Rafael J. Wysocki" , linux-kernel@vger.kernel.org, "kernel-testers@vger.kernel.org >> Kernel Testers List" , Mike Galbraith , Peter Zijlstra , Linux Netdev List , Christoph Lameter , linux-fsdevel@vger.kernel.org, Al Viro Subject: Re: [PATCH v3 3/7] fs: Introduce a per_cpu last_ino allocator Message-ID: <20081216212638.GN6681@linux.vnet.ibm.com> Reply-To: paulmck@linux.vnet.ibm.com References: <49267694.1030506@cosmosbay.com> <20081121.010508.40225532.davem@davemloft.net> <4926AEDB.10007@cosmosbay.com> <4926D022.5060008@cosmosbay.com> <20081121152148.GA20388@elte.hu> <4926D39D.9050603@cosmosbay.com> <20081121153453.GA23713@elte.hu> <492DDB6A.8090806@cosmosbay.com> <493100B0.6090104@cosmosbay.com> <49419696.9080509@cosmosbay.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <49419696.9080509@cosmosbay.com> User-Agent: Mutt/1.5.15+20070412 (2007-04-11) Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2831 Lines: 95 On Thu, Dec 11, 2008 at 11:39:18PM +0100, Eric Dumazet wrote: > new_inode() dirties a contended cache line to get increasing > inode numbers. > > Solve this problem by providing to each cpu a per_cpu variable, > feeded by the shared last_ino, but once every 1024 allocations. > > This reduce contention on the shared last_ino, and give same > spreading ino numbers than before. > (same wraparound after 2^32 allocations) One question below, but just a clarification. Works correctly as is, though a bit strangely. Reviewed-by: Paul E. McKenney > Signed-off-by: Eric Dumazet > --- > fs/inode.c | 35 ++++++++++++++++++++++++++++++++--- > 1 files changed, 32 insertions(+), 3 deletions(-) > > diff --git a/fs/inode.c b/fs/inode.c > index f94f889..dc8e72a 100644 > --- a/fs/inode.c > +++ b/fs/inode.c > @@ -556,6 +556,36 @@ repeat: > return node ? inode : NULL; > } > > +#ifdef CONFIG_SMP > +/* > + * Each cpu owns a range of 1024 numbers. > + * 'shared_last_ino' is dirtied only once out of 1024 allocations, > + * to renew the exhausted range. > + */ > +static DEFINE_PER_CPU(int, last_ino); > + > +static int last_ino_get(void) > +{ > + static atomic_t shared_last_ino; > + int *p = &get_cpu_var(last_ino); > + int res = *p; > + > + if (unlikely((res & 1023) == 0)) > + res = atomic_add_return(1024, &shared_last_ino) - 1024; > + > + *p = ++res; So the first CPU gets the range [1:1024], the second [1025:2048], and so on, eventually wrapping to [4294966273:0]. Is that the intent? (I don't see a problem with this, just seems a bit strange.) > + put_cpu_var(last_ino); > + return res; > +} > +#else > +static int last_ino_get(void) > +{ > + static int last_ino; > + > + return ++last_ino; > +} > +#endif > + > /** > * new_inode - obtain an inode > * @sb: superblock > @@ -575,7 +605,6 @@ struct inode *new_inode(struct super_block *sb) > * error if st_ino won't fit in target struct field. Use 32bit counter > * here to attempt to avoid that. > */ > - static unsigned int last_ino; > struct inode * inode; > > spin_lock_prefetch(&inode_lock); > @@ -583,11 +612,11 @@ struct inode *new_inode(struct super_block *sb) > inode = alloc_inode(sb); > if (inode) { > percpu_counter_inc(&nr_inodes); > + inode->i_state = 0; > + inode->i_ino = last_ino_get(); > spin_lock(&inode_lock); > list_add(&inode->i_list, &inode_in_use); > list_add(&inode->i_sb_list, &sb->s_inodes); > - inode->i_ino = ++last_ino; > - inode->i_state = 0; > spin_unlock(&inode_lock); > } > return inode; -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/