Return-path: Received: from he.sipsolutions.net ([78.46.109.217]:43228 "EHLO sipsolutions.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754539Ab1BOMo3 (ORCPT ); Tue, 15 Feb 2011 07:44:29 -0500 Subject: Re: [PATCH v2] mac80211: Fix a race on enabling power save. From: Johannes Berg To: Vivek Natarajan Cc: linville@tuxdriver.com, linux-wireless@vger.kernel.org In-Reply-To: References: <1296822326-4878-1-git-send-email-vnatarajan@atheros.com> <1296824920.3671.4.camel@jlt3.sipsolutions.net> <1296825156.3671.7.camel@jlt3.sipsolutions.net> Content-Type: text/plain; charset="UTF-8" Date: Tue, 15 Feb 2011 13:44:18 +0100 Message-ID: <1297773858.3935.13.camel@jlt3.sipsolutions.net> Mime-Version: 1.0 Sender: linux-wireless-owner@vger.kernel.org List-ID: On Tue, 2011-02-08 at 15:43 +0530, Vivek Natarajan wrote: > > Maybe the subif queues should be stopped, then flush, then tx nullfunc, > > then stop all queues to configure the HW or something like that? > > I tried this sequence: > the subif queues stopped, then flush, then tx nullfunc, and wake subif > queues,(we cannot have the queues stopped till we receive tx_status > because nullfunc might have failed during tx path itself and mac80211 > will not receive tx_status) I've recently been thinking about this -- I'm thinking that maybe we should change this behaviour. Right now the tx() routine basically always returns OK (except in at76) and I suppose instead it could return whether the frame was queued up successfully... > After some time interval, once again stop queues on receiving ack for > nullfunc, configure the hw and then wake up queues. So, during the > above time interval, there is a race of queuing a frame to the hw. I > have tested this and the issue is quickly reproducible. Right. The code that sends the nullfunc -- I think it can probably sleep? If so, instead of starting the queue for the TX it could just flush TX again after sending the nullfunc -- after that either it had status for the frame, or the frame was dropped, no? johannes