Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751883AbaFFMcM (ORCPT ); Fri, 6 Jun 2014 08:32:12 -0400 Received: from e28smtp05.in.ibm.com ([122.248.162.5]:54195 "EHLO e28smtp05.in.ibm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750993AbaFFMcL (ORCPT ); Fri, 6 Jun 2014 08:32:11 -0400 Message-ID: <5391B473.2050809@linux.vnet.ibm.com> Date: Fri, 06 Jun 2014 18:00:43 +0530 From: "Srivatsa S. Bhat" User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:15.0) Gecko/20120828 Thunderbird/15.0 MIME-Version: 1.0 To: Vivek Goyal CC: ego@linux.vnet.ibm.com, matt@ozlabs.org, mahesh@linux.vnet.ibm.com, kexec@lists.infradead.org, linux-kernel@vger.kernel.org, suzuki@in.ibm.com, ebiederm@xmission.com, paulus@samba.org, linuxppc-dev@lists.ozlabs.org, Ananth N Mavinakayanahalli , Benjamin Herrenschmidt Subject: Re: [PATCH] powerpc, kexec: Fix "Processor X is stuck" issue during kexec from ST mode References: <20140527105511.24309.74320.stgit@srivatsabhat.in.ibm.com> <20140528133143.GK14863@redhat.com> <538E2FF8.8060707@linux.vnet.ibm.com> <1401833365.3247.36.camel@pasglop> <20140604134649.GB27557@redhat.com> In-Reply-To: <20140604134649.GB27557@redhat.com> Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-TM-AS-MML: disable X-Content-Scanned: Fidelis XPS MAILER x-cbid: 14060612-8256-0000-0000-00000D981B9F Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 06/04/2014 07:16 PM, Vivek Goyal wrote: > On Wed, Jun 04, 2014 at 08:09:25AM +1000, Benjamin Herrenschmidt wrote: >> On Wed, 2014-06-04 at 01:58 +0530, Srivatsa S. Bhat wrote: >>> Yep, that makes sense. But unfortunately I don't have enough insight into >>> why exactly powerpc has to online the CPUs before doing a kexec. I just >>> know from the commit log and the comment mentioned above (and from my own >>> experiments) that the CPUs will get stuck if they were offline. Perhaps >>> somebody more knowledgeable can explain this in detail and suggest a proper >>> long-term solution. >>> >>> Matt, Ben, any thoughts on this? >> >> The problem is with our "soft offline" which we do on some platforms. When we >> offline we don't actually send the CPUs back to firmware or anything like that. >> >> We put them into a very low low power loop inside Linux. >> >> The new kernel has no way to extract them from that loop. So we must re-"online" >> them before we kexec so they can be passed to the new kernel normally (or returned >> to firmware like we do on powernv). > > Srivatsa, > > Looks like your patch has been merged. > > I don't like the following change in arch independent code. > > /* > * migrate_to_reboot_cpu() disables CPU hotplug assuming that > * no further code needs to use CPU hotplug (which is true in > * the reboot case). However, the kexec path depends on using > * CPU hotplug again; so re-enable it here. > */ > cpu_hotplug_enable(); > > As it is very powerpc specific requirement, can you enable hotplug in powerpc > arch dependent code as a short term solution. > I didn't do that because that would mean that the _disable() would be performed inside kernel/kexec.c and the corresponding _enable() would be performed in arch/powerpc/kernel/machine_kexec_64.c -- with no apparent connection between them, which would have made them hard to relate. > Ideally one needs to fix the requirement of online all cpus in powerpc > as a long term solution and then get rid of hotplug enable call. > Yes, I agree. I'm trying out a solution at the moment (see the 4 preliminary patches I sent in my reply to Ben). If that works, we won't need the enable call on powerpc. Regards, Srivatsa S. Bhat -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/