Return-path: Received: from mail-wm0-f67.google.com ([74.125.82.67]:34434 "EHLO mail-wm0-f67.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752328AbcHOKlk (ORCPT ); Mon, 15 Aug 2016 06:41:40 -0400 Subject: Re: [BUGFIX PATCH 1/2] brcmfmac: Check rtnl_lock is locked when removing interface To: Masami Hiramatsu , Arend van Spriel , Franky Lin , Hante Meuleman , Kalle Valo , Pieter-Paul Giesberts References: <147125403645.9434.8008546579326856373.stgit@devbox> <147125405701.9434.12911635695339175773.stgit@devbox> Cc: linux-wireless@vger.kernel.org, brcm80211-dev-list.pdl@broadcom.com, netdev@vger.kernel.org, linux-kernel@vger.kernel.org From: =?UTF-8?B?UmFmYcWCIE1pxYJlY2tp?= Message-ID: <5bb6e373-e110-b1ac-6f65-fdf2f9f059fc@gmail.com> (sfid-20160815_124205_998246_28E72B2D) Date: Mon, 15 Aug 2016 12:41:34 +0200 MIME-Version: 1.0 In-Reply-To: <147125405701.9434.12911635695339175773.stgit@devbox> Content-Type: text/plain; charset=utf-8; format=flowed Sender: linux-wireless-owner@vger.kernel.org List-ID: On 08/15/2016 11:40 AM, Masami Hiramatsu wrote: > Check rtnl_lock is locked in brcmf_p2p_ifp_removed() by passing > rtnl_locked flag. Actually the caller brcmf_del_if() checks whether > the rtnl_lock is locked, but doesn't pass it to brcmf_p2p_ifp_removed(). > > Without this fix, wpa_supplicant goes softlockup with rtnl_lock > holding (this means all other process using netlink are locked up too) > > e.g. > [ 4495.876627] INFO: task wpa_supplicant:7307 blocked for more than 10 seconds. > [ 4495.876632] Tainted: G W 4.8.0-rc1+ #8 > [ 4495.876635] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. > [ 4495.876638] wpa_supplicant D ffff974c647b39a0 0 7307 1 0x00000000 > [ 4495.876644] ffff974c647b39a0 0000000000000000 ffff974c00000000 ffff974c7dc59c58 > [ 4495.876651] ffff974c6b7417c0 ffff974c645017c0 ffff974c647b4000 ffffffff86f16c08 > [ 4495.876657] ffff974c645017c0 0000000000000246 00000000ffffffff ffff974c647b39b8 > [ 4495.876664] Call Trace: > [ 4495.876671] [] schedule+0x3c/0x90 > [ 4495.876676] [] schedule_preempt_disabled+0x15/0x20 > [ 4495.876682] [] mutex_lock_nested+0x176/0x3b0 > [ 4495.876686] [] ? rtnl_lock+0x17/0x20 > [ 4495.876690] [] rtnl_lock+0x17/0x20 > [ 4495.876720] [] brcmf_p2p_ifp_removed+0x4d/0x70 [brcmfmac] > [ 4495.876741] [] brcmf_remove_interface+0x196/0x1b0 [brcmfmac] > [ 4495.876760] [] brcmf_p2p_del_vif+0x111/0x220 [brcmfmac] > [ 4495.876777] [] brcmf_cfg80211_del_iface+0x21b/0x270 [brcmfmac] > [ 4495.876820] [] nl80211_del_interface+0xfe/0x3a0 [cfg80211] > [ 4495.876825] [] genl_family_rcv_msg+0x1b5/0x370 > [ 4495.876832] [] ? trace_hardirqs_on+0xd/0x10 > [ 4495.876836] [] genl_rcv_msg+0x7d/0xb0 > [ 4495.876839] [] ? genl_family_rcv_msg+0x370/0x370 > [ 4495.876846] [] netlink_rcv_skb+0x97/0xb0 > [ 4495.876849] [] genl_rcv+0x28/0x40 > [ 4495.876854] [] netlink_unicast+0x1d3/0x2f0 > [ 4495.876860] [] ? netlink_unicast+0x14b/0x2f0 > [ 4495.876866] [] netlink_sendmsg+0x2eb/0x3a0 > [ 4495.876870] [] sock_sendmsg+0x38/0x50 > [ 4495.876874] [] ___sys_sendmsg+0x27f/0x290 > [ 4495.876882] [] ? mntput_no_expire+0x5/0x3f0 > [ 4495.876888] [] ? mntput_no_expire+0x8e/0x3f0 > [ 4495.876894] [] ? mntput_no_expire+0x5/0x3f0 > [ 4495.876899] [] ? mntput+0x24/0x40 > [ 4495.876904] [] ? __fput+0x190/0x200 > [ 4495.876909] [] __sys_sendmsg+0x45/0x80 > [ 4495.876914] [] SyS_sendmsg+0x12/0x20 > [ 4495.876918] [] entry_SYSCALL_64_fastpath+0x23/0xc1 > [ 4495.876924] [] ? trace_hardirqs_off_caller+0x1f/0xc0 This is probably caused by my commit: a63b09872c1d ("brcmfmac: delete interface directly in code that sent fw request") https://git.kernel.org/cgit/linux/kernel/git/kvalo/wireless-drivers-next.git/commit/?id=a63b09872c1dc0ce0da3628647da67a112b484bf I changed condition for calling brcmf_remove_interface and it seems it broke P2P. Unfortunately I couldn't fully test my change due to firmware not supporting P2P. I did similar fix for error path for P2P with commit b50ddfa8530e ("brcmfmac: fix lockup when removing P2P interface after event timeout") https://git.kernel.org/cgit/linux/kernel/git/kvalo/wireless-drivers-next.git/commit/?id=b50ddfa8530e9b5f52e873fdd6ff04f327a88799 so your change looks like a proper follow-up. > Signed-off-by: Masami Hiramatsu Fixes: a63b09872c1d ("brcmfmac: delete interface directly in code that sent fw request") Acked-by: Rafał Miłecki Kalle: I'm acking this as bugfix for 4.8 release.