Return-path: Received: from sabertooth01.qualcomm.com ([65.197.215.72]:48054 "EHLO sabertooth01.qualcomm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754354AbaJHLLq (ORCPT ); Wed, 8 Oct 2014 07:11:46 -0400 Date: Wed, 8 Oct 2014 16:43:00 +0530 From: Rajkumar Manoharan To: Michal Kazior CC: linux-wireless , "ath10k@lists.infradead.org" Subject: Re: [PATCH] ath10k: fix kernel panic while shutting down AP Message-ID: <20141008111258.GD25666@qca.qualcomm.com> (sfid-20141008_131149_633633_AD6F8A66) References: <1412759786-10567-1-git-send-email-rmanohar@qti.qualcomm.com> <20141008103312.GA25666@qca.qualcomm.com> <20141008110842.GC25666@qca.qualcomm.com> MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" In-Reply-To: <20141008110842.GC25666@qca.qualcomm.com> Sender: linux-wireless-owner@vger.kernel.org List-ID: On Wed, Oct 08, 2014 at 04:38:44PM +0530, Rajkumar Manoharan wrote: > On Wed, Oct 08, 2014 at 12:50:28PM +0200, Michal Kazior wrote: > > On 8 October 2014 12:33, Rajkumar Manoharan wrote: > > > On Wed, Oct 08, 2014 at 11:45:38AM +0200, Michal Kazior wrote: > > >> On 8 October 2014 11:16, Rajkumar Manoharan wrote: > > >> > The commit "ath10k: workaround fw beaconing bug" is freeing > > >> > DMA-coherent memory in irq context which is hitting BUG ON > > >> > in ARM platforms. Fix this by moving dma_free out of spin > > >> > lock. > > >> > > >> I hardly see how moving the freeing outside the spinlock is a fix. > > >> > > >> > > >> > kernel BUG at mm/vmalloc.c:1512! > > >> > Internal error: Oops - BUG: 0 [#1] PREEMPT SMP ARM > > >> > CPU: 0 PID: 722 Comm: hostapd Not tainted 3.14.0 #3 > > >> > task: dd58b840 ti: da6a6000 task.ti: da6a6000 > > >> > PC is at vunmap+0x24/0x34 > > >> > LR is at __arm_dma_free.isra.21+0x12c/0x190 > > >> > [] (vunmap) from [] (__arm_dma_free.isra.21+0x12c/0x190) > > >> > [] (__arm_dma_free.isra.21) from [] > > >> > (ath10k_mac_vif_beacon_free+0xf4/0x100 [ath10k_core]) > > >> > [] (ath10k_mac_vif_beacon_free [ath10k_core]) from [] > > >> > (ath10k_remove_interface+0x44/0x1ec [ath10k_core]) > > >> > [] (ath10k_remove_interface [ath10k_core]) from [] > > >> > (ieee80211_add_virtual_monitor+0x9d8/0x9f0 [mac80211]) > > >> > [] (ieee80211_add_virtual_monitor [mac80211]) from [] > > >> > (ieee80211_stop+0x10/0x18 [mac80211]) > > >> > [] (ieee80211_stop [mac80211]) from [] > > >> > (__dev_close_many+0x9c/0xcc) > > >> > > >> 1. How can even ieee80211_add_virtual_monitor() call > > >> ath10k_remove_interface()? Upstream ath10k doesn't advertise > > >> IEEE80211_HW_WANT_MONITOR_VIF. This call trace is either invalid, > > >> you're not using upstream ath10k and/or have custom patches applied to > > >> ath10k. > > >> > > > This is the backtrace captured on panic and we are getting the same > > > backtrace consistently. I confirmed that add_virtual_monitor is not > > > called for ath10k as it is not advertising. ath10k_remove_interface is > > > called for master mode. > > > > > >> 2. How can ieee80211_stop() be called from an interrupt context > > >> anyway? ieee80211_stop() calls ieee80211_do_stop() which calls > > >> ieee80211_roc_purge() which tries to get a hold of local->mtx. This > > >> implies ieee80211_stop() isn't design to be run in an interrupt > > >> context to begin with so I don't see why ath10k should even care if > > >> ath10k_remove_interface() is called in an interrupt context at this > > >> point. > > >> > > > in_interrupt is counting soft and hard irqs. ieee80211_stop is not > > > called from interrupt context. In ath10k, by aquiring spin_lock in > > > ath10k_mac_vif_beacon_free is increasing soft irq count. > > > > > > In ARM arch, __arm_dma_free is calling vunmap which might sleep. So it > > > can not be called within spin_lock. > > > > Did you try using GFP_ATOMIC in the dma_alloc_coherent instead of > > moving the spinlock? > > > Nope. The problem is while freeing dma memory not during allocation. > dma_free_coherent won't take any GFP_* flags. > > > > > > > > Similar to dma_alloc_coherent, dma_free_coherent can not be called under > > > soft irq context. > > > > The call trace points to ath10k_mac_vif_beacon_free() which doesn't > > use dma_free_coherent() so why are you blaming it for the BUG_ON? > > > I agree the calltrace is a bogus. ath10k_mac_vif_beacon_cleanup is > calling dma_free_coherent. > > > If anything the offender should be dma_unmap_single() but the thing is > > beacon_buf is always allocated for AP/IBSS now which means > > dma_unmap_single() is never called. For non-AP/IBSS both arvif->beacon > > and arvif->beacon_buf are always NULL so neither > > dma_alloc/free_coherent nor dma_map/unmap_single are called. > > > Agree. We need one more check in ath10k_mac_vif_beacon_free. No additional check is needed. For non beaconing mode, arvif->beacon should be false. isn't it? -Rajkumar