Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933109Ab1CXKQ5 (ORCPT ); Thu, 24 Mar 2011 06:16:57 -0400 Received: from mail-iy0-f174.google.com ([209.85.210.174]:33650 "EHLO mail-iy0-f174.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932341Ab1CXKQz convert rfc822-to-8bit (ORCPT ); Thu, 24 Mar 2011 06:16:55 -0400 DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :cc:content-type:content-transfer-encoding; b=M1+c7N1wj8sD1x9wMlyKAeqdPIjbTDK5mCYimAHPwaZ0uixb7xpQ2V/EM7MEv4w0Me 5T0fPUZAyupMzIgJRYKH5oEHVMHa9hRSctx+pgGWzhEBzoTWsnBmFvAtSi2JvuikvWWq G3EX988pp3PgeNC023oc74v75U7XwC4du3GnE= MIME-Version: 1.0 In-Reply-To: <201103240000.26788.rjw@sisk.pl> References: <201103232322.52485.rjw@sisk.pl> <201103240000.26788.rjw@sisk.pl> Date: Thu, 24 Mar 2011 11:16:54 +0100 Message-ID: Subject: Re: Corrupted files after suspend to disk From: richard -rw- weinberger To: "Rafael J. Wysocki" Cc: LKML , linux-pm@lists.linux-foundation.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 8BIT Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3960 Lines: 100 On Thu, Mar 24, 2011 at 12:00 AM, Rafael J. Wysocki wrote: > On Wednesday, March 23, 2011, richard -rw- weinberger wrote: >> On Wed, Mar 23, 2011 at 11:22 PM, Rafael J. Wysocki wrote: >> > On Wednesday, March 23, 2011, richard -rw- weinberger wrote: >> >> On Wed, Mar 23, 2011 at 11:11 PM, Rafael J. Wysocki wrote: >> >> > On Wednesday, March 23, 2011, richard -rw- weinberger wrote: >> >> >> 2011/3/23 Rafael J. Wysocki : >> >> >> > On Wednesday, March 23, 2011, richard -rw- weinberger wrote: >> >> >> >> Hi, >> >> >> >> >> >> >> >> I'm facing a very strange problem on my netbook (Lenovo Ideapad S10) >> >> >> >> running Linux 2.6.37.4. >> >> >> >> After resuming from s2disk some files are corrupted. >> >> >> >> But when I reboot my netbook everything seems good again. >> >> >> >> >> >> >> >> When I saw the problem the first time the ls command segfaulted always. >> >> >> >> I did a reboot and it worked again. >> >> >> >> >> >> >> >> A few days later zypper crashed. After a reboot it worked again. >> >> >> >> And today ssh crashed. I looked a bit closer and saw it crashed >> >> >> >> somewhere within libcrypto. >> >> >> >> So I made copy libcrypto and rebooted. >> >> >> >> After the reboot ssh worked again but libcrypto and the copy of it hat >> >> >> >> a different sha1 sum! >> >> >> >> WTF?! >> >> >> >> >> >> >> >> Is this a known issue? >> >> >> > >> >> >> > No. >> >> >> > >> >> >> >> dmesgs and config are attached. >> >> >> >> >> >> >> >> The used distribution is openSUSE 11.4 with suspend-0.80.20100129-7.1 >> >> >> >> (default from suse). >> >> >> >> I'm using ext3 as root filesystem. >> >> >> >> What else do you need? >> >> >> > >> >> >> > Whatever you can do to narrow down the problem. ?At the moment I only know >> >> >> > that it's there. >> >> >> >> >> >> I can reproduce the problem now. >> >> >> After ~20 suspend and resume iterations aide finds corrupted files in /lib/. >> >> >> It's always a very basic lib like libcrypto, libglib which is used all >> >> >> the time on my system. >> >> > >> >> > Those files are never intentionally modified, right? >> >> > >> >> >> Maybe it's an issue like this one? >> >> >> https://lkml.org/lkml/2010/12/2/339 >> >> > >> >> > It might have if that patch hadn't been merged before 2.6.37. >> >> > >> >> > Is the system 32-bit or 64-bit? >> >> >> >> It's a 32-bit system. >> >> cmp shows that the corrupted files differ in many bytes (not scattered). >> >> The corrupted bytes are always 0 or 252. >> > >> > Do I understand correctly that the files apparently corrupted after resume >> > are not corrupted any more when you reboot? >> >> Yes. >> Seems like a cache issue. > > There's a couple things you can check before we start asking other people for > help. > > First, it would be good to know if things change when you save the image > into a swap file instead of the swap partition you've been using so far > (I believe it's documented quite well how to do that). > > Second, please verify if using the built-in save/load hibernate code leads > to the same issue (you can hibernate by doing "echo disk > /sys/power/state" > to verify that). > > Of course, please test the above separately. :-) Ok, I'll test this when I'm at home. BTW: dropping the caches helps, when some files seem corrupted. Today /usr/bin/okular was broken. After setting vm.drop_caches=1 it worked again. > Thanks, > Rafael > -- > To unsubscribe from this list: send the line "unsubscribe linux-kernel" in > the body of a message to majordomo@vger.kernel.org > More majordomo info at ?http://vger.kernel.org/majordomo-info.html > Please read the FAQ at ?http://www.tux.org/lkml/ > -- Thanks, //richard -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/