Return-path: Received: from mail-tul01m020-f174.google.com ([209.85.214.174]:63151 "EHLO mail-tul01m020-f174.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751746Ab2AZJQh convert rfc822-to-8bit (ORCPT ); Thu, 26 Jan 2012 04:16:37 -0500 MIME-Version: 1.0 In-Reply-To: <4F21173A.9060405@broadcom.com> References: <4F1FBAA9.4040405@broadcom.com> <4F1FD2AA.9020708@broadcom.com> <4F21173A.9060405@broadcom.com> Date: Thu, 26 Jan 2012 10:16:36 +0100 Message-ID: (sfid-20120126_101643_339339_DA10BF8F) Subject: Re: kernel 3.2.1: BUG: scheduling while atomic: swapper/0/0/0x10000100 From: Francis Moreau To: Arend van Spriel Cc: Josh Boyer , Linux Kernel Mailing List , "linux-wireless@vger.kernel.org" , "John W. Linville" , Greg KH Content-Type: text/plain; charset=ISO-8859-1 Sender: linux-wireless-owner@vger.kernel.org List-ID: On Thu, Jan 26, 2012 at 10:04 AM, Arend van Spriel wrote: > On 01/25/2012 05:49 PM, Josh Boyer wrote: >> On Wed, Jan 25, 2012 at 11:37 AM, Francis Moreau wrote: >>> Hello, >>> >>> On Wed, Jan 25, 2012 at 1:11 PM, Josh Boyer wrote: >>>> >>>> We've had a few similar issues reported in Fedora yesterday after we >>>> rebased to 3.2.1. >>> >>> From which kernel did you rebase to 3.2.1 ? >> >> Fedora 16 went from 3.1.9 to 3.2.1 earlier this week. ?We hadn't been getting >> reports like this until 3.2.1 showed up. ?I asked one of the reporters to try >> 3.2.0 and they say it recreates there still, so it seems the issue showed up >> with the 3.2 release. ?One user managed to capture the trace below using a >> kernel with a variety of debug options turned on. ?It isn't the same backtrace >> but it might be related. >> >> backtrace: >> :BUG: sleeping function called from invalid context at >> drivers/base/power/runtime.c:785 >> :in_atomic(): 1, irqs_disabled(): 0, pid: 0, name: swapper/0 >> :1 lock held by swapper/0/0: >> : #0: ?(&chip->rts51x_suspend_timer){+.-...}, at: [] >> run_timer_softirq+0x111/0x600 >> :Pid: 0, comm: swapper/0 Tainted: G ? ? ? ?W ? ?3.2.0-2.fc17.x86_64.debug #1 >> :Call Trace: >> : ?[] __might_sleep+0x13a/0x1f0 >> : [] ? rts51x_modi_suspend_timer+0x50/0x50 [ums_realtek] >> : [] __pm_runtime_idle+0xa3/0xb0 >> : [] ? rts51x_modi_suspend_timer+0x50/0x50 [ums_realtek] >> : [] usb_autopm_put_interface+0x30/0x40 > > This trace seems to indicate what is wrong. At least for this one user. > The usb_autopm_put_interface() call seems result in a sleep and the > rts51x_suspend_timer_fn is holding a lock. Below the code indicated in > the trace. > > runtime.c: > 768 /** > 769 ?* __pm_runtime_idle - Entry point for runtime idle operations. > 770 ?* @dev: Device to send idle notification for. > 771 ?* @rpmflags: Flag bits. > 772 ?* > 773 ?* If the RPM_GET_PUT flag is set, decrement the device's usage > count and > 774 ?* return immediately if it is larger than zero. ?Then carry out an idle > 775 ?* notification, either synchronous or asynchronous. > 776 ?* > 777 ?* This routine may be called in atomic context if the RPM_ASYNC > flag is set, > 778 ?* or if pm_runtime_irq_safe() has been called. > 779 ?*/ > 780 int __pm_runtime_idle(struct device *dev, int rpmflags) > 781 { > 782 ? ? ? ? unsigned long flags; > 783 ? ? ? ? int retval; > 784 > 785 ? ? ? ? might_sleep_if(!(rpmflags & RPM_ASYNC) && !dev->power.irq_safe); > > This issue can either be caused in ums_realtek module or the usb autopm > code. Have not looked into that. > Just a wild guess: it seems that we're affected using different HW configs with different USB devices, so it would indicate that usb autopm is the culprit. -- Francis