Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1756187AbXLJL3b (ORCPT ); Mon, 10 Dec 2007 06:29:31 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1752133AbXLJL3X (ORCPT ); Mon, 10 Dec 2007 06:29:23 -0500 Received: from mx3.mail.elte.hu ([157.181.1.138]:37244 "EHLO mx3.mail.elte.hu" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751917AbXLJL3W (ORCPT ); Mon, 10 Dec 2007 06:29:22 -0500 Date: Mon, 10 Dec 2007 12:28:32 +0100 From: Ingo Molnar To: Gautham R Shenoy Cc: Jiri Slaby , Andrew Morton , linux-kernel@vger.kernel.org, "Rafael J. Wysocki" , Arjan van de Ven , Thomas Gleixner , Linux-pm mailing list , Dipankar Sarma Subject: Re: broken suspend (sched related) [Was: 2.6.24-rc4-mm1] Message-ID: <20071210112832.GA20189@elte.hu> References: <475A629C.7010408@gmail.com> <20071208152447.GA30270@elte.hu> <475B24F4.3090904@gmail.com> <20071209074647.GE22981@elte.hu> <20071210081952.GA7215@in.ibm.com> <475CFF01.2090502@gmail.com> <20071210091052.GA14487@elte.hu> <20071210101500.GB12880@in.ibm.com> <20071210102157.GB31103@elte.hu> <20071210110818.GC12880@in.ibm.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20071210110818.GC12880@in.ibm.com> User-Agent: Mutt/1.5.17 (2007-11-01) X-ELTE-VirusStatus: clean X-ELTE-SpamScore: -1.5 X-ELTE-SpamLevel: X-ELTE-SpamCheck: no X-ELTE-SpamVersion: ELTE 2.0 X-ELTE-SpamCheck-Details: score=-1.5 required=5.9 tests=BAYES_00 autolearn=no SpamAssassin version=3.2.3 -1.5 BAYES_00 BODY: Bayesian spam probability is 0 to 1% [score: 0.0000] Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1296 Lines: 36 * Gautham R Shenoy wrote: > > say we've got 100 CPUs, so we've got 100 watchdog tasks running - > > one for each CPU. Checking for hung tasks is a global operation not > > a per-CPU operation (we iterate over the global tasklist), hence > > only one CPU should really be calling this function. That > > online-cpus logic achieves this by picking a single CPU. Perhaps it > > would be better to keep a hung_task_checker_cpu variable that is > > driven from a CPU-hotplug-down notifier? That way if a CPU is > > brought down we can update hung_task_checker_cpu to another, > > still-online CPU. (this would also be faster, because event-driven) > > Do you mean something like this? yeah, thanks - queued it up. one question: > +static int check_cpu = -1; > case CPU_ONLINE: > case CPU_ONLINE_FROZEN: > + check_cpu = any_online_cpu(cpu_online_map); > wake_up_process(per_cpu(watchdog_task, hotcpu)); > break; do we bring the boot CPU online too - i.e. will check_cpu be properly initialized on UP too? Ingo -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/