Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755368AbXFLWBJ (ORCPT ); Tue, 12 Jun 2007 18:01:09 -0400 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1751713AbXFLWA5 (ORCPT ); Tue, 12 Jun 2007 18:00:57 -0400 Received: from mga09.intel.com ([134.134.136.24]:31647 "EHLO mga09.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752396AbXFLWA4 (ORCPT ); Tue, 12 Jun 2007 18:00:56 -0400 X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="4.16,413,1175497200"; d="scan'208";a="95951924" Date: Tue, 12 Jun 2007 14:56:59 -0700 From: "Siddha, Suresh B" To: "Rafael J. Wysocki" Cc: "Eric W. Biederman" , Pavel Machek , Andrew Morton , Andi Kleen , linux-kernel@vger.kernel.org, Neil Brown , Ingo Molnar , "Siddha, Suresh B" , asit.k.mallick@intel.com Subject: Re: [PATCH] x86: Document the hotplug code is incompatible with x86 irq handling Message-ID: <20070612215659.GB17143@linux-os.sc.intel.com> References: <20070607140102.GA9094@ucw.cz> <200706122252.10886.rjw@sisk.pl> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <200706122252.10886.rjw@sisk.pl> User-Agent: Mutt/1.4.1i Sender: linux-kernel-owner@vger.kernel.org X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2014 Lines: 48 On Tue, Jun 12, 2007 at 10:52:09PM +0200, Rafael J. Wysocki wrote: > On Tuesday, 12 June 2007 20:19, Eric W. Biederman wrote: > > Because you are calling unfixably broken code. That should be a decent > > incentive to do something else won't it? > > Can you please tell me _what_ else can be done? > > > IOAPICs do not support what the code is doing here. There is lots of > > practical evidence including bad experiences and practical tests that > > support this. > > Well, AFAICS, Suresh has tried to debug one failing case recently without > any consistent conclusions. I don't know of any other test cases (links, > please?). Rafael, Darrick Wong's issue looks different and hence I was motivated to look and fix if it was a SW issue. For now, I am not able to comprehend what is happening on Darrick Wong's system. Need more help from Darrick as he has the golden failing system. Meanwhile I talked to our hardware folks about the irq migration in general. One good news is that future versions of chipsets will have an Interrupt Remapping feature(for more details please refer to http://download.intel.com/technology/computing/vptech/Intel(r)_VT_for_Direct_IO.pdf) where in we can reliably migrate the irq to someother cpu in the process context. And for the existing platforms, chipset folks don't see a reason why the Eric's algorithm(specified below) should fail. Eric's algorithm for level triggered(edge triggered should be easier than level triggered): - Mask the irq in process context. - Poll RIRR until an ack of for the irq was not pending. - Migrate the irq. Eric had a problem of stuck remote IRR on E75xx chipset with this algorithm and my next step is to reproduce this issue on this platform and understand the behavior. thanks, suresh - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/