Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id ; Wed, 13 Nov 2002 06:39:39 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id ; Wed, 13 Nov 2002 06:39:39 -0500 Received: from gate.in-addr.de ([212.8.193.158]:28168 "HELO mx.in-addr.de") by vger.kernel.org with SMTP id ; Wed, 13 Nov 2002 06:39:37 -0500 Date: Wed, 13 Nov 2002 12:46:41 +0100 From: Lars Marowsky-Bree To: Brian Jackson , "linux-kernel@vger.kernel.org" Subject: Re: md on shared storage Message-ID: <20021113114641.GI19811@marowsky-bree.de> References: <20021113002529.7413.qmail@escalade.vistahp.com> Mime-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20021113002529.7413.qmail@escalade.vistahp.com> User-Agent: Mutt/1.4i X-Ctuhulu: HASTUR Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2224 Lines: 54 On 2002-11-12T18:25:29, Brian Jackson said: > Does the MD driver work with shared storage? I would also be interested to > know if the new DM driver works with shared storage(though I must admit I > didn't really try to answer this one myself, just hoping somebody will > know). The short answer is "Not sanely", as far as I know. RAID0 might be okay, however RAID1/5 get into issues if two nodes update the same data in parallel; they do not coordinate the writes, and thus might stomp over each other. In theory, given a RAID1 with disks {d1,d2}, node n1 might write in order (d2,d1) while n2 writes as (d1,d2), resulting in inconsistent mirrors. This becomes a bigger race window for RAID5, obviously, because more disks are involved. The "multiple nodes beginning to reconstruct the same md device" is also a problem; but even if that was solved that only one node does the recovery, the others would be blocked from doing any IO on that drive for the time being. Another issue that any node will want to update the md superblock regularly. LVM is fine, MD doesn't seem to be. With the MD patches I posted weeks ago, at least MD multipathing should work appropriately; even if the ugliness of multiple nodes scribbling over the superblock remains, it shouldn't matter because the autodetection is based only on the UUID for m-p. In short, you can do "MD", if you don't use it as "shared"; have only one node have a given md device active at any point in time. Thus, no autostart, but manual activation. This rules out "GFS over md", basically. If you want to fix that, it would be cool; it will just require a DLM, membership and communication services in the kernel. ;-) Sincerely, Lars Marowsky-Br?e -- Principal Squirrel SuSE Labs - Research & Development, SuSE Linux AG "If anything can go wrong, it will." "Chance favors the prepared (mind)." -- Capt. Edward A. Murphy -- Louis Pasteur - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/