Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id ; Wed, 21 Feb 2001 18:49:41 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id ; Wed, 21 Feb 2001 18:49:32 -0500 Received: from neon-gw.transmeta.com ([209.10.217.66]:51212 "EHLO neon-gw.transmeta.com") by vger.kernel.org with ESMTP id ; Wed, 21 Feb 2001 18:49:18 -0500 Message-ID: <3A9453E9.4457668C@transmeta.com> Date: Wed, 21 Feb 2001 15:48:57 -0800 From: "H. Peter Anvin" Organization: Transmeta Corporation X-Mailer: Mozilla 4.76 [en] (X11; U; Linux 2.4.1 i686) X-Accept-Language: en, sv, no, da, es, fr, ja MIME-Version: 1.0 To: Daniel Phillips , Linux Kernel Mailing List Subject: Re: [rfc] Near-constant time directory index for Ext2 In-Reply-To: <20010221220835.A8781@atrey.karlin.mff.cuni.cz> <20010221223238.A17903@atrey.karlin.mff.cuni.cz> <971ejs$139$1@cesium.transmeta.com> <20010221233204.A26671@atrey.karlin.mff.cuni.cz> <3A94435D.59A4D729@transmeta.com> <20010221235008.A27924@atrey.karlin.mff.cuni.cz> <3A94470C.2E54EB58@transmeta.com> <20010222000755.A29061@atrey.karlin.mff.cuni.cz> <3A944C05.FC2B623A@transmeta.com> <3A945081.E6EB78F4@innominate.de> Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Daniel Phillips wrote: > > Have you looked at the structure and algorithms I'm using? I would not > call this a hash table, nor is it a btree. It's a 'hash-keyed > uniform-depth tree'. It never needs to be rehashed (though it might be > worthwhile compacting it at some point). It also never needs to be > rebalanced - it's only two levels deep for up to 50 million files. > I'm curious how you do that. It seems each level would have to be 64K large in order to do that, with a minimum disk space consumption of 128K for a directory. That seems extremely painful *except* in the case of hysterically large directories, which tend to be the exception even on filesystems where they occur. I think I'd rather take the extra complexity and rebalancing cost of a B-tree. > This thing deserves a name of its own. I call it an 'htree'. The > performance should speak for itself - 150 usec/create across 90,000 > files and still a few optmizations to go. > > Random access runs at similar speeds too, it's not just taking advantage > of a long sequence of insertions into the same directory. > > BTW, the discussion in this thread has been very interesting, it just > isn't entirely relevant to my patch :-) -hpa -- at work, in private! "Unix gives you enough rope to shoot yourself in the foot." http://www.zytor.com/~hpa/puzzle.txt - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/