Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753866AbZBAAhn (ORCPT ); Sat, 31 Jan 2009 19:37:43 -0500 Received: (majordomo@vger.kernel.org) by vger.kernel.org id S1751947AbZBAAhd (ORCPT ); Sat, 31 Jan 2009 19:37:33 -0500 Received: from ogre.sisk.pl ([217.79.144.158]:54875 "EHLO ogre.sisk.pl" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751517AbZBAAhc (ORCPT ); Sat, 31 Jan 2009 19:37:32 -0500 From: "Rafael J. Wysocki" To: Linus Torvalds Subject: Re: What should PCI core do during suspend-resume? (was: Re: 2.6.29-rc3: tg3 dead after resume) Date: Sun, 1 Feb 2009 01:36:54 +0100 User-Agent: KMail/1.10.3 (Linux/2.6.29-rc2-tst; KDE/4.1.3; x86_64; ; ) Cc: Parag Warudkar , Matt Carlson , "netdev@vger.kernel.org" , Linux Kernel Mailing List , "David S. Miller" , Andrew Morton References: In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200902010136.55375.rjw@sisk.pl> Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2559 Lines: 50 On Sunday 01 February 2009, Linus Torvalds wrote: > > On Sat, 31 Jan 2009, Linus Torvalds wrote: > > > > But how many people test STR while doing a "ping -f" from another machine? > > > > It _should_ work. Do you guarantee that it does? > > Btw, this really only is interesting if there's a shared interrupt. > > I'm sure that there are network drivers that will crash even on their own > with _just_ the right timing (imagine having a delayed interrupt pending, > then doing the "pci_set_power_state(PCI_D3hot)" thing, and then get the > interrupt handler invoked on another CPU _just_ afterwards), but it's > probably really hard to trigger, and a bug in that specific driver anyway. > > But what's much more interesting (and not necessarily a driver bug, but a > general PM infrastructure problem) is if we have that shared interrupt > case, and the network driver gets lots of interrupts just as "driver X" is > shutting down with that interrupt shared. Then, "driver X" will get > interrupts after the PM layer has put its device to sleep, and now "driver > X" is quite understandably confused - it didn't even do the "put to sleep" > itself, but now its device is no longer responding. > > And now it's not a really unlikely race condition any more. All this leads to the conclusion that we should put devices into low power states with interrupts off and this seems to imply that we'll need to make the AML interpreter allow us to run AML with interrupts off. Still, what about the following rule: - If the device is supposed to wake up the system, the driver should prepare it and put it into a low power state using the existing PCI callbacks, in ->suspend(). In that case, the driver is also required to save the state of the device before putting it into the low power state. It is also required to make sure that its interrupt handler will not get confused in case of shared interrupts. - If the state of the device hasn't been saved by the driver, the core is required to save its state (with interrupts off, I suppose?). - If the state of the device hasn't been saved by the driver, the core will attempt to put the device into a low power state, using the native PCI PM and with interrupts off, unless PCI_DEV_FLAGS_NO_D3 is set in dev->flags. Thanks, Rafael -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/