Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755575AbZIUDMG (ORCPT ); Sun, 20 Sep 2009 23:12:06 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1752756AbZIUDMD (ORCPT ); Sun, 20 Sep 2009 23:12:03 -0400 Received: from mga03.intel.com ([143.182.124.21]:62383 "EHLO mga03.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751807AbZIUDMB (ORCPT ); Sun, 20 Sep 2009 23:12:01 -0400 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="4.44,422,1249282800"; d="scan'208";a="189733249" Subject: Re: + generic-ipi-fix-the-race-between-generic_smp_call_function_-and-hotplug_cfd.patch added to -mm tree From: Suresh Siddha Reply-To: Suresh Siddha To: Xiao Guangrong Cc: Peter Zijlstra , "akpm@linux-foundation.org" , "mm-commits@vger.kernel.org" , "jens.axboe@oracle.com" , "mingo@elte.hu" , "nickpiggin@yahoo.com.au" , "rusty@rustcorp.com.au" , LKML In-Reply-To: <4AB6EB1C.5090106@cn.fujitsu.com> References: <200907310030.n6V0Uqgw001644@imap1.linux-foundation.org> <1252616988.7205.102.camel@laptop> <4AAA0001.2060703@cn.fujitsu.com> <1252696132.3756.21.camel@sbs-t61.sc.intel.com> <4AADEF2F.5080504@cn.fujitsu.com> <1252973802.2899.88.camel@sbs-t61.sc.intel.com> <4AAEF5DC.4070308@cn.fujitsu.com> <1253067602.2667.13.camel@sbs-t61.sc.intel.com> <4AB1A652.1010000@cn.fujitsu.com> <1253326599.3948.732.camel@sbs-t61.sc.intel.com> <4AB6EB1C.5090106@cn.fujitsu.com> Content-Type: text/plain Organization: Intel Corp Date: Sun, 20 Sep 2009 20:11:22 -0700 Message-Id: <1253502682.2528.4.camel@sbs-t61> Mime-Version: 1.0 X-Mailer: Evolution 2.26.3 (2.26.3-1.fc11) Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2142 Lines: 57 On Sun, 2009-09-20 at 19:55 -0700, Xiao Guangrong wrote: > > Suresh Siddha wrote: > > On Wed, 2009-09-16 at 20:00 -0700, Xiao Guangrong wrote: > >> How about manual check/handle pending IPI interruption in the CPU context? > >> like this: > >> --- a/kernel/cpu.c > >> +++ b/kernel/cpu.c > >> @@ -173,6 +173,9 @@ static int __ref take_cpu_down(void *_param) > >> struct take_cpu_down_param *param = _param; > >> int err; > >> > >> + generic_smp_call_function_interrupt(); > >> + generic_smp_call_function_single_interrupt(); > >> + > > > > At this place, how will you ensure that the smp_call_function initiated > > by this dying cpu has reached and got serviced at its destination? > > > > Suresh, sorry for my poor English, Do you mean that how we ensure it has > pending IPI request in the dying cpu? > > generic_smp_call_function_*() will check it, if the cpu has pending request, > then handle it, else directly return. > I am referring to the missing csd_lock_wait() here that you had in the first version of your patch. Let's say, if cpu X is going offline, we need to ensure that the smp_call_function() initiated by cpu X (i.e., smp_call_function IPI sent to some other cpu's from cpu X) got serviced before cpu X goes offline. We can't do csd_lock_wait() here, as that might deadlock (as all the other cpu's are already in stop machine with interrupts disabled). > > All the other cpu's have disabled interrupts in the stop machine state > > by the time we come here and we can't wait. > > > > Why we can't wait? It manual check/handle the pending IPI request not wait > interruption happen. > > It not has race here because all cpu's interruption is disabled, and it not > make stop machine slow because only the dying cpu can enter take_cpu_down(), > we just wait the dying cpu handle it's pending request. > > Am I misunderstand something? > > Thanks, > Xiao > -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/