Return-path: Received: from mail-iw0-f174.google.com ([209.85.214.174]:65321 "EHLO mail-iw0-f174.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751159Ab0LSHs3 convert rfc822-to-8bit (ORCPT ); Sun, 19 Dec 2010 02:48:29 -0500 MIME-Version: 1.0 In-Reply-To: <201012182347.08635.rjw@sisk.pl> References: <201012181607.26628.rjw@sisk.pl> <201012182347.08635.rjw@sisk.pl> From: Ohad Ben-Cohen Date: Sun, 19 Dec 2010 09:48:07 +0200 Message-ID: Subject: Re: [linux-pm] subtle pm_runtime_put_sync race and sdio functions To: "Rafael J. Wysocki" , Alan Stern Cc: linux-mmc@vger.kernel.org, Linux-pm mailing list , Ido Yariv , linux-wireless@vger.kernel.org, Johannes Berg Content-Type: text/plain; charset=ISO-8859-1 Sender: linux-wireless-owner@vger.kernel.org List-ID: On Sun, Dec 19, 2010 at 12:47 AM, Rafael J. Wysocki wrote: > On Saturday, December 18, 2010, Ohad Ben-Cohen wrote: >> On Sat, Dec 18, 2010 at 5:07 PM, Rafael J. Wysocki wrote: >> > On Saturday, December 18, 2010, Ohad Ben-Cohen wrote: >> >> On Sat, Dec 11, 2010 at 4:50 PM, Alan Stern wrote: >> >> >> Think of a device which you have no way to reset other than powering >> >> >> it down and up again. >> >> >> >> >> >> If that device is managed by runtime PM, the only way to do that is by >> >> >> using put_sync() followed by a get_sync(). Sure, if someone else >> >> >> increased the usage_count of that device this won't work, but then of >> >> >> course it means that the driver is not using runtime PM correctly. >> >> > >> >> > Not so. ?Even if a driver uses runtime PM correctly, there will still >> >> > be times when someone else has increased the usage_count. ?This happens >> >> > during probing and during system resume, for example. >> >> >> >> I'm aware of these two examples; normally we're good with them since >> >> during probing we're not toggling the power, and during suspend/resume >> >> the SDIO core is responsible for manipulating the power (and it does >> >> so directly). Are there (or do you think there will be) additional >> >> examples where this can happen ? >> >> >> >> But this leads me to a real problem which we have encountered. >> >> >> >> During system suspend, our driver is asked (by mac80211's suspend >> >> handler) to power off its device. When this happens, the driver has no >> >> idea that the system is suspending - regular driver code (responsible >> >> to remove the wlan interface and stop the device) is being called. >> > >> > That's where the problem is. ?If there's a difference, from the driver's >> > point of view, between suspend and some other operation, there should be a >> > way to tell the driver what case it actually is dealing with. >> >> Yes, the problem will be solved if the driver would bypass the runtime >> PM framework on system suspend. mac80211 obviously has this >> information, and technically it's very easy to let the driver know >> about it. >> >> But the difference between suspend and normal operation is really >> artificial: in both cases mac80211 just asks the driver to power its >> device down, and the end result is exactly the same (a GPIO line of >> the device is de-asserted in our case). The difference between these >> two scenarios >> exist only because runtime PM is effectively disabled during system >> suspend, and therefore the driver has to look for an alternative way >> to power down the device. > > That's not correct, sorry. > > There is a bug and the bug is that you use the runtime PM to power down > the device in every situation, although you obviously shouldn't do that > (e.g. because it may be disabled by user space or it _is_ disabled by > the PM core during system suspend). That completely makes sense, and helps to precisely define the problem and justify the solution. We will continue to use runtime PM within the SDIO core, in order to opportunistically power down the device (e.g. when a driver is not loaded). In this case, it is perfectly OK if runtime PM is disabled, since there is no functional difference if power is taken down or not - it's purely about minimizing power consumption. OTOH, within the wl12xx driver, where powering down the device is tied to functionality (e.g. error recovery or just hard reset), and must be carried out unconditionally, we will bypass runtime PM. Rafael, Alan, thank you for your time and attention, it really helped. Thanks, Ohad.