Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755988AbZKIOEJ (ORCPT ); Mon, 9 Nov 2009 09:04:09 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1755935AbZKIOEI (ORCPT ); Mon, 9 Nov 2009 09:04:08 -0500 Received: from www.tglx.de ([62.245.132.106]:34910 "EHLO www.tglx.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755856AbZKIOEI (ORCPT ); Mon, 9 Nov 2009 09:04:08 -0500 Date: Mon, 9 Nov 2009 15:02:33 +0100 (CET) From: Thomas Gleixner To: Ingo Molnar cc: "Rafael J. Wysocki" , LKML , pm list , Greg KH , Linus Torvalds , Jesse Barnes Subject: Re: Help needed: Resume problems in 2.6.32-rc, perhaps related to preempt_count leakage in keventd In-Reply-To: <20091109124937.GA21114@elte.hu> Message-ID: References: <200911091250.31626.rjw@sisk.pl> <20091109120217.GB18399@elte.hu> <200911091324.37955.rjw@sisk.pl> <20091109124937.GA21114@elte.hu> User-Agent: Alpine 2.00 (LFD 1167 2008-08-23) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3080 Lines: 65 On Mon, 9 Nov 2009, Ingo Molnar wrote: > > * Rafael J. Wysocki wrote: > > > On Monday 09 November 2009, Ingo Molnar wrote: > > > > > > * Rafael J. Wysocki wrote: > > > > > > > [ 2016.865041] BUG: using smp_processor_id() in preemptible [00000000] code: events/1/29920 > > > > [ 2016.865344] caller is vmstat_update+0x13/0x48 > > > > [ 2016.865522] Pid: 29920, comm: events/1 Not tainted 2.6.31-tst #158 > > > > [ 2016.865700] Call Trace: > > > > [ 2016.865877] [] debug_smp_processor_id+0xc4/0xd4 > > > > [ 2016.866052] [] vmstat_update+0x13/0x48 > > > > [ 2016.866232] [] worker_thread+0x18b/0x22a > > > > [ 2016.866409] [] ? vmstat_update+0x0/0x48 > > > > [ 2016.866578] [] ? autoremove_wake_function+0x0/0x38 > > > > [ 2016.866749] [] ? _spin_unlock_irqrestore+0x35/0x37 > > > > [ 2016.866935] [] ? worker_thread+0x0/0x22a > > > > [ 2016.867113] [] kthread+0x69/0x71 > > > > [ 2016.867278] [] child_rip+0xa/0x20 > > > > [ 2016.867450] [] ? kthread+0x0/0x71 > > > > [ 2016.867618] [] ? child_rip+0x0/0x20 > > > > > > a bug producing similar looking messages was fixed by: > > > > > > fd21073: sched: Fix affinity logic in select_task_rq_fair() > > > > > > but that bug was introduced by: > > > > > > a1f84a3: sched: Check for an idle shared cache in select_task_rq_fair() > > > > I guess these are tip commits? > > yep, tip:sched/core ones. > > > > Which is for v2.6.33, not v2.6.32. > > > > The one I saw was in the Linus' tree, quite obviously. > > ok, then my observation should not apply. I think it _IS_ releated because the worker_thread is CPU affine and the debug_smp_processor_id() check does: if (cpumask_equal(¤t->cpus_allowed, cpumask_of(this_cpu))) which prevents that usage of smp_processor_id() in ksoftirqd and keventd in preempt enabled regions is warned on. We saw exaclty the same back trace with fd21073 (sched: Fix affinity logic in select_task_rq_fair()). Rafael, can you please add a printk to debug_smp_processor_id() so we can see on which CPU we are running ? I suspect we are on the wrong one. Thanks, tglx -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/