Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752343Ab0KGTcI (ORCPT ); Sun, 7 Nov 2010 14:32:08 -0500 Received: from mail-fx0-f46.google.com ([209.85.161.46]:54956 "EHLO mail-fx0-f46.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752121Ab0KGTcH convert rfc822-to-8bit (ORCPT ); Sun, 7 Nov 2010 14:32:07 -0500 DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type:content-transfer-encoding; b=jiY7InHuw1/UiUTB9MP+Nx9hJaVsdEQSGN8ggHU9kFbtu7lQkiwaGx0xnVKkEGnyNg P9w1RxBbnKY2elxJFBBQ/sg64WMPvqwOVt5kwi1+mIpya+y3iRow9IfHbpibLhAnLg7h 9aDOcqnW6yqwwEEEgaQLclkM8lR5Qlte3OLGI= MIME-Version: 1.0 In-Reply-To: References: <4CD6B7FA.3050005@redhat.com> Date: Sun, 7 Nov 2010 20:32:05 +0100 Message-ID: Subject: Re: [PATCH] DM-CRYPT: Scale to multiple CPUs v3 on 2.6.37-rc* ? From: Matt To: Milan Broz Cc: Andi Kleen , Linux Kernel , dm-devel@redhat.com, htd@fancy-poultry.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 8BIT Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 4385 Lines: 147 On Sun, Nov 7, 2010 at 6:49 PM, Matt wrote: > On Sun, Nov 7, 2010 at 3:30 PM, Milan Broz wrote: >> >> On 11/06/2010 11:16 PM, Matt wrote: >>> before diving into testing out 2.6.37-rc* kernels I wanted to make >>> sure that the patch: >>> >>> [dm-devel] [PATCH] DM-CRYPT: Scale to multiple CPUs v3 >>> >>> is safe to use with e.g. >2.6.37-rc1 kernels >>> >>> I know that it's not a "fix all" patch but it significantly seems to >>> speed up my backup jobs (by factor 2-3) >>> and 2.6.37* has evolved that much that interactivity isn't hurt too much >> >> yes, it should work for the simple mappings without problems. >> >> I hope we will fix the patch soon to be ready for upstream. >> >> Milan >> > > Hi Milan, > > thanks for your answer ! > > Unfortunately I have to post a "Warning" that it's currently not safe > (at least for me) to use it > > a few hours ago before 2.6.37-rc1 was tagged I already had shortly > tested it with the dm-crypt multi-cpu patch and massive "silent" data > corruption or loss occured: > > fortunately I don't/didn't see any data-corruption on my /home > partition (yet) but everytime I boot into my system things are screwed > up on the root-partition, e.g.: > > where > eselect opengl list would show >>Available OpenGL implementations: >> ?[1] ? ati * >> ?[2] ? xorg-x11 > > normally it's >>cat /etc/env.d/03opengl >># Configuration file for eselect >># This file has been automatically generated. >>LDPATH="/usr/lib32/opengl/ati/lib:/usr/lib64/opengl/ati/lib" >>OPENGL_PROFILE="ati" > > > it currently says: > >>eselect opengl list >>Available OpenGL implementations: >> ?[1] ? ati >> ?[2] ? xorg-x11 > > >>cat /etc/env.d/03opengl >># Configuration file for eselect > > and another example was a corrupted /etc/init.d/killprocs > > so since this (a corrupted killprocs) already had happened in the past > (last time due to a hardlock with fglrx/amd's catalyst driver) I > thought it was some kind of system problem which could be fixed: > I fired up a rebuild-job (emerge -e system) for my system and (surely) > some other stuff disappeared - after 2-3 reboots I wanted to continue > finishing the rebuild and gcc was gone (!) > > I don't have the time to re-test everything since this is my testing & > production machine (I'll play back a system-backup tarball) but this > didn't happen (yet) with 2.6.36 and > the following patches related to multi-cpu dm-crypt: > > * [dm-devel] [PATCH] DM-CRYPT: Scale to multiple CPUs v3 > * [PATCH] Use generic private pointer in per-cpu struct > > so it seems to be safe. > > It has to be changes which got introduced with 2.6.37* which broke > stuff. 2.6.36 seems to work perfectly fine with those 2 patches since > several days already > > I'll stick with 2.6.36 for some time now > > Thanks ! > > Matt > sorry - I forgot to include the most important part: the system-partition is on an LVM/Volume Group that sits on an cryptsetup partition so: cryptsetup (luks) -> LVM/Volume Group (2 partitions, one of them system - the other swap) -> system (ext4) [cryptsetup -> LVM -> ext4-partition] the mount-options were/are: noatime,nodiratime,barrier=1 sometimes also noatime,nodiratime,barrier=1,commit=600 (when the system runs for several hours to make it consume less energy/write less) the other settings are: echo "3000" > /proc/sys/vm/dirty_expire_centisecs echo "1500" > /proc/sys/vm/dirty_writeback_centisecs echo "15" > /proc/sys/vm/dirty_background_ratio echo "50" > /proc/sys/vm/dirty_ratio echo "50" > /proc/sys/vm/vfs_cache_pressure i/o scheduler: cfq as already mentioned - this problem didn't appear or wasn't noticable (yet) with or until 2.6.36 - my system-memory should also be error-free (tested via memtest86), the harddisk too (previously tested several times via badblocks without errors) during every shutdown, reboot, hibernate, etc. I do a manual: sync && sdparm -C sync /dev/foo to make sure data gets to the partition I read about barrier-problems and data getting to the partition when using dm-crypt and several layers so I don't know if that could be related Regards Matt -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/