Return-path: Received: from s3.sipsolutions.net ([5.9.151.49]:59615 "EHLO sipsolutions.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750826AbcDIE5D (ORCPT ); Sat, 9 Apr 2016 00:57:03 -0400 Message-ID: <1460177816.7409.4.camel@sipsolutions.net> (sfid-20160409_065706_546912_E151ECF8) Subject: Re: [PATCH] mac80211: debugfs var for the default aggregation timeout. From: Johannes Berg To: Avery Pennarun Cc: ath9k-devel , linux-wireless , Felix Fietkau Date: Sat, 09 Apr 2016 06:56:56 +0200 In-Reply-To: (sfid-20160409_032748_062530_E2AD90E8) References: <1455658091-28262-1-git-send-email-apenwarr@gmail.com> <1455658091-28262-2-git-send-email-apenwarr@gmail.com> <1456222441.2041.10.camel@sipsolutions.net> <1456257946.9910.23.camel@sipsolutions.net> <1459928436.17504.11.camel@sipsolutions.net> <1460098614.30678.1.camel@sipsolutions.net> <1460098909.30678.2.camel@sipsolutions.net> <1460099746.30678.3.camel@sipsolutions.net> (sfid-20160409_032748_062530_E2AD90E8) Content-Type: text/plain; charset="UTF-8" Mime-Version: 1.0 Sender: linux-wireless-owner@vger.kernel.org List-ID: On Fri, 2016-04-08 at 21:27 -0400, Avery Pennarun wrote: > > Just to be clear, this crash is only from *reading* the agg_status > > files.  I don't know if the crashiness reduces when disabling the > > aggregation timeouts, since that's a separate bug (in which the > > queue gets stuck and the 'pending' column of this file just keeps > > increasing). Oh, right, I was confusing the two. The reading one is even stranger though, in a way. I have no explanation for it (yet). We could suspect memory corruption, but why would it specifically hit issues here? Not very plausible. > Updated .ko file that definitely has debug symbols this time: > http://apenwarr.ca/tmp/mac80211-agg-status-crash-debugsyms.ko > Ok, that confirms what I did manually in my previous email - that it crashed on this: 141 p += scnprintf(p, sizeof(buf) + buf - p, "\t%#.2x", 142 tid_tx ? tid_tx->dialog_token : 0); (and by hand I'd already checked that it crashed dereferencing the tid_tx->dialog_token, since tid_tx was the value 0x5b35da40. If any people more familiar with ARM are reading this - does the value 0x5b35da40 ring a bell? Is that a userspace area? Or an area where the stack would be? All other points around here seem to look like 0xac0c3c58, or maybe 0x838c6958, but not 0x5b35...., how could we end up with that? johannes