Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1758432AbXFOD6i (ORCPT ); Thu, 14 Jun 2007 23:58:38 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1755075AbXFOD62 (ORCPT ); Thu, 14 Jun 2007 23:58:28 -0400 Received: from cantor.suse.de ([195.135.220.2]:51830 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752400AbXFOD61 (ORCPT ); Thu, 14 Jun 2007 23:58:27 -0400 From: Neil Brown To: david@lang.hm Date: Fri, 15 Jun 2007 13:58:20 +1000 MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Message-ID: <18034.3676.477575.490448@notabene.brown> Cc: linux-kernel@vger.kernel.org, linux-raid@vger.kernel.org Subject: Re: limits on raid In-Reply-To: message from david@lang.hm on Thursday June 14 References: <18034.479.256870.600360@notabene.brown> X-Mailer: VM 7.19 under Emacs 21.4.1 X-face: [Gw_3E*Gng}4rRrKRYotwlE?.2|**#s9D On Fri, 15 Jun 2007, Neil Brown wrote: > > > On Thursday June 14, david@lang.hm wrote: > >> what is the limit for the number of devices that can be in a single array? > >> > >> I'm trying to build a 45x750G array and want to experiment with the > >> different configurations. I'm trying to start with raid6, but mdadm is > >> complaining about an invalid number of drives > >> > >> David Lang > > > > "man mdadm" search for "limits". (forgive typos). > > thanks. > > why does it still default to the old format after so many new versions? > (by the way, the documetnation said 28 devices, but I couldn't get it to > accept more then 27) Dunno - maybe I can't count... > > it's now churning away 'rebuilding' the brand new array. > > a few questions/thoughts. > > why does it need to do a rebuild when makeing a new array? couldn't it > just zero all the drives instead? (or better still just record most of the > space as 'unused' and initialize it as it starts useing it?) Yes, it could zero all the drives first. But that would take the same length of time (unless p/q generation was very very slow), and you wouldn't be able to start writing data until it had finished. You can "dd" /dev/zero onto all drives and then create the array with --assume-clean if you want to. You could even write a shell script to do it for you. Yes, you could record which space is used vs unused, but I really don't think the complexity is worth it. > > while I consider zfs to be ~80% hype, one advantage it could have (but I > don't know if it has) is that since the filesystem an raid are integrated > into one layer they can optimize the case where files are being written > onto unallocated space and instead of reading blocks from disk to > calculate the parity they could just put zeros in the unallocated space, > potentially speeding up the system by reducing the amount of disk I/O. Certainly. But the raid doesn't need to be tightly integrated into the filesystem to achieve this. The filesystem need only know the geometry of the RAID and when it comes to write, it tries to write full stripes at a time. If that means writing some extra blocks full of zeros, it can try to do that. This would require a little bit better communication between filesystem and raid, but not much. If anyone has a filesystem that they want to be able to talk to raid better, they need only ask... > is there any way that linux would be able to do this sort of thing? or is > it impossible due to the layering preventing the nessasary knowledge from > being in the right place? Linux can do anything we want it to. Interfaces can be changed. All it takes is a fairly well defined requirement, and the will to make it happen (and some technical expertise, and lots of time .... and coffee?). NeilBrown - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/