Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752191AbZKIRfI (ORCPT ); Mon, 9 Nov 2009 12:35:08 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1751200AbZKIRfH (ORCPT ); Mon, 9 Nov 2009 12:35:07 -0500 Received: from ogre.sisk.pl ([217.79.144.158]:51791 "EHLO ogre.sisk.pl" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751132AbZKIRfG (ORCPT ); Mon, 9 Nov 2009 12:35:06 -0500 From: "Rafael J. Wysocki" To: Mike Galbraith Subject: Re: Help needed: Resume problems in 2.6.32-rc, perhaps related to preempt_count leakage in keventd Date: Mon, 9 Nov 2009 18:36:30 +0100 User-Agent: KMail/1.12.1 (Linux/2.6.31.5-tst; KDE/4.3.1; x86_64; ; ) Cc: Thomas Gleixner , Ingo Molnar , LKML , pm list , Greg KH , Linus Torvalds , Jesse Barnes References: <200911091250.31626.rjw@sisk.pl> <200911091647.54171.rjw@sisk.pl> <1257783553.6408.6.camel@marge.simson.net> In-Reply-To: <1257783553.6408.6.camel@marge.simson.net> MIME-Version: 1.0 Content-Type: Text/Plain; charset="iso-8859-2" Content-Transfer-Encoding: 7bit Message-Id: <200911091836.30349.rjw@sisk.pl> Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1514 Lines: 41 On Monday 09 November 2009, Mike Galbraith wrote: > On Mon, 2009-11-09 at 16:47 +0100, Rafael J. Wysocki wrote: > > On Monday 09 November 2009, Mike Galbraith wrote: > > > > > Very likely. What did you do to fix it? > > > > > > You don't really wanna know. In 31 with newidle enabled, the below > > > fixed it. It won't fix 32, though it might cure the resume problem. > > > > OK, I'll give it a try. It doesn't help. Also, I can reproduce the issue with current -git and kernel preepmtion disabled. > I just tried to trigger badness via high speed online/offline combined > with taskset with CONFIG_PREEMPT enabled, and couldn't make it explode. I'm not able to do it this way too, so resume seems to be necessary to trigger it. I'm going try with the suspend debug in the "core" mode. > (damn, wish i could s2ram this box) That need not suffice. I have two other boxes that suspend and resume correctly with 2.6.32-rc, AFAICS. However, there seems to be a systematic error somewhere, since the failure always happens at the same place, ie. list_del_init(cwq->worklist.next); in run_workqueue(), in preemptible as well as in non-preemptible kernels. Which is kind of strange, given the !list_empty(&cwq->worklist) test right before it. Thanks, Rafael -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/