Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1751366Ab3FCDvK (ORCPT ); Sun, 2 Jun 2013 23:51:10 -0400 Received: from mailhub4.stratus.com ([134.111.1.17]:57243 "EHLO mailhub4.stratus.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751129Ab3FCDu7 (ORCPT ); Sun, 2 Jun 2013 23:50:59 -0400 Date: Sun, 2 Jun 2013 23:46:33 -0400 (EDT) From: Joe Lawrence X-X-Sender: jlaw@jlaw-desktop.mno.stratus.com To: Rusty Russell cc: Ben Greear , Linux Kernel Mailing List , stable@kenel.org, Joe Lawrence Subject: Re: Please add to stable: module: don't unlink the module until we've removed all exposure. In-Reply-To: <87ehclumhr.fsf@rustcorp.com.au> Message-ID: References: <51A8E884.1080009@candelatech.com> <87ehclumhr.fsf@rustcorp.com.au> User-Agent: Alpine 2.02 (DEB 1266 2009-07-14) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII X-OriginalArrivalTime: 03 Jun 2013 03:50:12.0834 (UTC) FILETIME=[7376F420:01CE600D] Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 1935 Lines: 47 On Sun, 2 Jun 2013, Rusty Russell wrote: > Ben Greear writes: > > > It turns out, the bug I spent yesterday chasing in various 3.9 kernels is apparently > > fixed by the commit in the title (c9c390bb5535380d40614571894ef0c00bc026ff). > > Apparently being the operative word. > > This commit avoids the entire "module insert failed due to sysfs race" > path in the common case, it doesn't fix any actual problem. > > I think the real commit you want is Linus' kobject fix > a49b7e82cab0f9b41f483359be83f44fbb6b4979 "kobject: fix kset_find_obj() > race with concurrent last kobject_put()". > > Or is that already in stable? Hi Rusty, I had pointed Ben (offlist) to that bugzilla entry without realizing there were other earlier related fixes in this space. Re-viewing bz- 58011, it looks like it was opened against 3.8.12, while Ben and myself had encountered module loading problems in versions 3.9 and 3.9.[1-3]. I can update the bugzilla entry to add a comment noting commit a49b7e82 "kobject: fix kset_find_obj() race with concurrent last kobject_put()". That said, it doesn't appear that commit 944a1fa "module: don't unlink the module until we've removed all exposure" has not made it into any stable kernel. On my system, applying this on top of 3.9 resolved a module unload/load race that would occasionally occur on boot (two video adapters of the same make, the module unloads for whatever reason and I see "module is already loaded" and "sysfs: cannot create duplicate filename '/module/mgag200'" messages every 5-10% instances.) I have logs if you were interested in these warnings/crashes. Hope this clarifies things. Regards, -- Joe -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/