Return-Path: MIME-Version: 1.0 Sender: armansito@google.com In-Reply-To: <50181A20-3313-48C7-924B-05C67B975139@holtmann.org> References: <50181A20-3313-48C7-924B-05C67B975139@holtmann.org> Date: Wed, 12 Nov 2014 20:51:53 -0800 Message-ID: Subject: Re: Handling EBUSY for LE connections. From: Arman Uguray To: Marcel Holtmann Cc: BlueZ development , Jakub Pawlowski Content-Type: text/plain; charset=UTF-8 List-ID: Hi Marcel, > On Wed, Nov 12, 2014 at 8:41 PM, Marcel Holtmann wrote: > Hi Arman, > >> We have a use case in which multiple LE connection attempts are made >> in rapid succession to a device that frequently rotates its private >> address. The connect call frequently fails with >> "org.bluez.Error.Failed: Device or resource is busy (16)". I tracked >> this down to a line in net/bluetooth/hci_conn.c:hci_connect_le that >> has the following comment before returning -EBUSY: >> >> /* Since the controller supports only one LE connection attempt at a >> * a time, we return -EBUSY if there is any connection attempt running. >> */ >> >> This is all fine and good, however the org.bluez.Error.Failed error >> name is too generic for an application to determine how to recover >> from this case and it would be nice if there was a way for an >> application to know that it should perhaps try connecting again later, >> especially without having to parse the error message string to make >> sense of things while there can be a clear error name. >> >> So, would it make sense for bluetoothd to return >> org.bluez.Error.InProgress or even org.bluez.Error.Busy here? Another >> potential solution I had in mind was to have bluetoothd actually queue >> LE connection attempts if one is already pending, or perhaps >> automatically schedule a retry if kernel returns EBUSY. >> >> What do you think is the best solution here? > > we should fix that directly in the kernel. Back in the days we had a similar issue with BR/EDR and some more "dumb" controllers. We queued the connection attempt in the kernel and issued it whenever the controller was free again. > > One other thing that we should be doing for "direct" LE connection attempts is to run them through the LE passive background scanning. That way we have the same connection logic handling the background scanning and the connection attempts. > > Mainly I am thinking that if we have a L2CAP connect() request, we just add it to the LE passive background scanning. The only difference would be that these can actually time out. Otherwise it should be exactly the same. > > Regards > > Marcel > OK, so what is the work that I would need to do here? I'm not super familiar with the kernel code, hence I'd appreciate some guidance. >From what you said, I gather two different work items: 1. Queue connection attempts and reissue when the controller becomes free. 2. Make use of background scanning. If we do #2 would we need #1 as well? I assume #2 will only be possible on later kernels, is there a solution that would easily backport to older kernels? Thanks, Arman