Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752470AbZIKABE (ORCPT ); Thu, 10 Sep 2009 20:01:04 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1750860AbZIKABD (ORCPT ); Thu, 10 Sep 2009 20:01:03 -0400 Received: from mga09.intel.com ([134.134.136.24]:64904 "EHLO mga09.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750853AbZIKABC (ORCPT ); Thu, 10 Sep 2009 20:01:02 -0400 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="4.44,367,1249282800"; d="scan'208";a="549572612" Subject: Re: [-mm] warning during suspend [was: suspend race -mm regression] From: Suresh Siddha Reply-To: Suresh Siddha To: Andrew Morton Cc: Xiao Guangrong , "jirislaby@gmail.com" , "rjw@sisk.pl" , "linux-kernel@vger.kernel.org" , "npiggin@suse.de" , "hpa@zytor.com" , "xiaoguangrong@cn.fujitsu.com" , "peterz@infradead.org" , "rusty@rustcorp.com.au" , "mingo@elte.hu" , "jens.axboe@oracle.com" In-Reply-To: <20090910135745.61de2be6.akpm@linux-foundation.org> References: <1249980093-16319-1-git-send-email-jirislaby@gmail.com> <4A81E073.5080703@gmail.com> <4A9B9C1C.9020506@gmail.com> <200908312132.10904.rjw@sisk.pl> <4AA0FEBF.7040104@gmail.com> <4AA19520.3070708@gmail.com> <4AA19668.5080909@gmail.com> <4AA25C11.9050209@gmail.com> <4AA278A1.2090107@gmail.com> <20090910135745.61de2be6.akpm@linux-foundation.org> Content-Type: text/plain Organization: Intel Corp Date: Thu, 10 Sep 2009 17:00:20 -0700 Message-Id: <1252627221.2816.9.camel@sbs-t61.sc.intel.com> Mime-Version: 1.0 X-Mailer: Evolution 2.26.3 (2.26.3-1.fc11) Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2960 Lines: 73 On Thu, 2009-09-10 at 13:57 -0700, Andrew Morton wrote: > On Sat, 05 Sep 2009 22:41:37 +0800 > Xiao Guangrong wrote: > > > Jiri Slaby ______: > > > On 09/05/2009 12:36 AM, Jiri Slaby wrote: > > >> On 09/05/2009 12:30 AM, Jiri Slaby wrote: > > >>> WARNING: at kernel/smp.c:124 > > >>> __generic_smp_call_function_interrupt+0xfd/0x110() > > >>> Hardware name: To Be Filled By O.E.M. > > >>> Modules linked in: nfs lockd auth_rpcgss sunrpc ath5k ath > > >>> Pid: 3423, comm: pm-suspend Not tainted 2.6.31-rc8-mm1_64 #762 > > >>> Call Trace: > > >>> [] warn_slowpath_common+0x78/0xb0 > > >>> [] warn_slowpath_null+0xf/0x20 > > >>> [] __generic_smp_call_function_interrupt+0xfd/0x110 > > >>> [] hotplug_cfd+0x4a/0xa0 > > >>> [] notifier_call_chain+0x47/0x90 > > >>> [] raw_notifier_call_chain+0x11/0x20 > > >>> [] _cpu_down+0x150/0x2d0 > > >> It's the CPU_DEAD notifier: > > >> ffffffff8141ecd0: 48 83 ce 07 or $0x7,%rsi > > >> ffffffff8141ecd4: 48 c7 c7 08 ff 5d 81 mov > > >> $0xffffffff815dff08,%rdi > > >> ffffffff8141ecdb: e8 20 c6 c3 ff callq ffffffff8105b300 > > >> > > >> ffffffff8141ece0: 3d 02 80 00 00 cmp $0x8002,%eax > > > > > > And it's due to: > > > generic-ipi-fix-the-race-between-generic_smp_call_function_-and-hotplug_cfd.patch > > > > > > > I think it has collision between my patch and below patch: Xiao, I am not sure if the race that you are trying to fix here indeed exists. Doesn't the stop machine that we do as part of cpu down address and avoid the race that you mention? Have you seen any real crashes and hangs or is it theory? And if even the race exists (which I don't think) calling the interrupt handler from the cpu down path looks like a hack. Can you please elaborate why we need this patch? Then we can think of a cleaner solution if needed. > > > > Commit-ID: 269c861baa2fe7c114c3bc7831292758d29eb336 > > Gitweb: http://git.kernel.org/tip/269c861baa2fe7c114c3bc7831292758d29eb336 > > Author: Suresh Siddha > > AuthorDate: Wed, 19 Aug 2009 18:05:35 -0700 > > Committer: H. Peter Anvin > > CommitDate: Fri, 21 Aug 2009 16:25:43 -0700 > > > > generic-ipi: Allow cpus not yet online to call smp_call_function with irqs disabled > > > > My patch is merged at -mm tree, but this patch is base on -tip tree later, so it has this > > problem > > > > Suresh, what your opinion? > > > > Suresh appears to be hiding. Not any more. I am back from vacation :( thanks, suresh -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/