Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753212Ab0GISzb (ORCPT ); Fri, 9 Jul 2010 14:55:31 -0400 Received: from e34.co.us.ibm.com ([32.97.110.152]:47449 "EHLO e34.co.us.ibm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752094Ab0GISza (ORCPT ); Fri, 9 Jul 2010 14:55:30 -0400 Subject: [PATCH, RT, RFC] Hacks allowing -rt to run on POWER7 / Powerpc. From: Will Schmidt Reply-To: will_schmidt@vnet.ibm.com To: rt-users , Thomas Gleixner , Benjamin Herrenschmidt Cc: LKML , will_schmidt , linuxppc-dev , Darren Hart Content-Type: text/plain; charset="UTF-8" Organization: IBM Date: Fri, 09 Jul 2010 13:55:01 -0500 Message-ID: <1278701701.24737.19.camel@lexx> Mime-Version: 1.0 X-Mailer: Evolution 2.28.1 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 3258 Lines: 97 [PATCH, RT, RFC] Hacks allowing -rt to run on POWER7 / Powerpc. We've been seeing some issues with userspace randomly SIGSEGV'ing while running the -RT kernels on POWER7 based systems. After lots of debugging, head scratching, and experimental changes to the code, the problem has been narrowed down such that we can avoid the problems by disabling the TLB batching. After some input from Ben and further debug, we've found that the restoration of the batch->active value near the end of __switch_to() seems to be the key. ( The -RT related changes within arch/powerpc/kernel/processor.c __switch_to() do the equivalent of a arch_leave_lazy_mmu_mode() before calling _switch, use a hadbatch flag to indicate if batching was active, and then restore that batch->active value on the way out after the call to _switch_to. That particular code is in the -RT branch, and not found in mainline ) Deferring to Ben (or others in the know) for whether this is the proper solution or if there is something deeper, but.. IF the right answer is to simply disable the restoration of batch->active, the rest of the CONFIG_PREEMPT_RT changes in __switch_to() should then be replaceable with a single call to arch_leave_lazy_mmu_mode(). The patch here is what I am currently running with, on both POWER6 and POWER7 systems, successfully. Signed-off-by: Will Schmidt CC: Ben Herrenschmidt CC: Thomas Gleixner --- diff -aurp linux-2.6.33.5-rt23.orig/arch/powerpc/kernel/process.c linux-2.6.33.5-rt23.exp/arch/powerpc/kernel/process.c --- linux-2.6.33.5-rt23.orig/arch/powerpc/kernel/process.c 2010-06-21 11:41:34.402513904 -0500 +++ linux-2.6.33.5-rt23.exp/arch/powerpc/kernel/process.c 2010-07-09 13:15:13.533269904 -0500 @@ -304,10 +304,6 @@ struct task_struct *__switch_to(struct t struct thread_struct *new_thread, *old_thread; unsigned long flags; struct task_struct *last; -#if defined(CONFIG_PPC64) && defined (CONFIG_PREEMPT_RT) - struct ppc64_tlb_batch *batch; - int hadbatch; -#endif #ifdef CONFIG_SMP /* avoid complexity of lazy save/restore of fpu @@ -401,16 +397,6 @@ struct task_struct *__switch_to(struct t new_thread->start_tb = current_tb; } -#ifdef CONFIG_PREEMPT_RT - batch = &__get_cpu_var(ppc64_tlb_batch); - if (batch->active) { - hadbatch = 1; - if (batch->index) { - __flush_tlb_pending(batch); - } - batch->active = 0; - } -#endif /* #ifdef CONFIG_PREEMPT_RT */ #endif local_irq_save(flags); @@ -425,16 +411,13 @@ struct task_struct *__switch_to(struct t * of sync. Hard disable here. */ hard_irq_disable(); - last = _switch(old_thread, new_thread); - - local_irq_restore(flags); #if defined(CONFIG_PPC64) && defined(CONFIG_PREEMPT_RT) - if (hadbatch) { - batch = &__get_cpu_var(ppc64_tlb_batch); - batch->active = 1; - } + arch_leave_lazy_mmu_mode(); #endif + last = _switch(old_thread, new_thread); + + local_irq_restore(flags); return last; } -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/