Received: by 2002:a25:1985:0:0:0:0:0 with SMTP id 127csp651492ybz; Wed, 15 Apr 2020 15:59:48 -0700 (PDT) X-Google-Smtp-Source: APiQypIGFz1f8eLpMWHmSSiPoXcPUZB1I5GuivjrGfySr5clIvyjZtrV4b0kB+UDt5843Wr6Hnfa X-Received: by 2002:a50:e043:: with SMTP id g3mr28098088edl.220.1586991588759; Wed, 15 Apr 2020 15:59:48 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1586991588; cv=none; d=google.com; s=arc-20160816; b=W9Ns9HKpoNobBHJiIol34XDcmR+EiZiE8Dhnl33TI9rXiZhc3EIi4B3SDeKbMrmSZF 1Zb3NG/SOgn1xypZakp4DztddQaMZSKYE7pyW8TqYbQeuBSlDSCuA6CxysXoXaH83c7v nWsQQ3OyLfEw5uTx+ZS9hIrAS58M/zLClucAY/Iw5LruZ69W4M8sfDJ6xQMeWZlbJrf4 EFS69aQEt7Ms7uEfS0/c2pEoFL3IoM9Y1j6k2IeXC6hYZuoN8rU4lZSBuqI91A8T6z4K S6WPOkE84PQ2QDzsF5/GJ+g30Aoc6/CopiXUl353Rr3mZyMi3YmBcyXsq+bZKNBI3Dr4 Ys+A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date :ironport-sdr:ironport-sdr; bh=11cOpD2icuG73VLitwl+KPE9SB95NFfy6WzLLWlj3MM=; b=aGFLui3ZSW0klXq55OfHlsvwOG7uxI2uv7/lFCbxyshrForAI+0/BFTNWWx3+JD36X 1I+YbPZZUVNDDlrrNAW/BJuwDJYXFGS5xuPQLv3+yzfo7y2e9JlsBrlW/cLtzqHhgmGB gc6OlC3O1DyPWqK0nhryETjF+Exw7slmHuuNKot0auYC5ABs2BqxASvi0ySJZ7uiBpmd Zyb/hU3pKFdlMDzth8Sl1QTNWnT2xqXfCjKdTNoQRALZMxvjsq1HSpPr7m2THZXiNCnJ k1JUYGuOERf+iuu8Mml7hN7UtDsH2/hBUO0y/IzhhPaGMU9EwyS85zyFQr4UPUO0Vnfr lRJQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id r21si10594191ejo.82.2020.04.15.15.59.25; Wed, 15 Apr 2020 15:59:48 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2896946AbgDOLZX (ORCPT + 99 others); Wed, 15 Apr 2020 07:25:23 -0400 Received: from mga01.intel.com ([192.55.52.88]:29224 "EHLO mga01.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2896922AbgDOLZK (ORCPT ); Wed, 15 Apr 2020 07:25:10 -0400 IronPort-SDR: eRDIGlJRQjiH0pnsIvDJ4ClQiiMJryjClAV+M7Soj5zQIeTp3mHQu9enQWRAt69CrGNDORa+5h MbqPXj46nfKA== X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from fmsmga001.fm.intel.com ([10.253.24.23]) by fmsmga101.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 15 Apr 2020 04:25:08 -0700 IronPort-SDR: 4IyR2ipB0pYdd/yQryNiZ8IGlXDDKJiKdsxRxk1fS4mva48chEQCle75ccSjvdnecp+3ImSfB/ hLee6EGB5dzw== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.72,386,1580803200"; d="scan'208";a="363645932" Received: from kuha.fi.intel.com ([10.237.72.162]) by fmsmga001.fm.intel.com with SMTP; 15 Apr 2020 04:25:04 -0700 Received: by kuha.fi.intel.com (sSMTP sendmail emulation); Wed, 15 Apr 2020 14:25:03 +0300 Date: Wed, 15 Apr 2020 14:25:03 +0300 From: Heikki Krogerus To: Greg KH Cc: Brendan Higgins , rafael@kernel.org, linux-kernel@vger.kernel.org, naresh.kamboju@linaro.org, sakari.ailus@linux.intel.com, andy.shevchenko@gmail.com, hdegoede@redhat.com, rafael.j.wysocki@intel.com, linux-kselftest@vger.kernel.org, rostedt@goodmis.org, sergey.senozhatsky@gmail.com, andriy.shevchenko@linux.intel.com, shuah@kernel.org, anders.roxell@linaro.org, lkft-triage@lists.linaro.org, linux@rasmusvillemoes.dk Subject: Re: [PATCH v1] kobject: make sure parent is not released before children Message-ID: <20200415112503.GN2828150@kuha.fi.intel.com> References: <20200414204240.186377-1-brendanhiggins@google.com> <20200415061154.GA2496263@kroah.com> <20200415084653.GM2828150@kuha.fi.intel.com> <20200415092115.GA2565850@kroah.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20200415092115.GA2565850@kroah.com> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Apr 15, 2020 at 11:21:15AM +0200, Greg KH wrote: > On Wed, Apr 15, 2020 at 11:46:53AM +0300, Heikki Krogerus wrote: > > Hi Greg, > > > > On Wed, Apr 15, 2020 at 08:11:54AM +0200, Greg KH wrote: > > > > diff --git a/lib/kobject.c b/lib/kobject.c > > > > index 83198cb37d8d..5921e2470b46 100644 > > > > --- a/lib/kobject.c > > > > +++ b/lib/kobject.c > > > > @@ -663,6 +663,7 @@ EXPORT_SYMBOL(kobject_get_unless_zero); > > > > */ > > > > static void kobject_cleanup(struct kobject *kobj) > > > > { > > > > + struct kobject *parent = kobj->parent; > > > > struct kobj_type *t = get_ktype(kobj); > > > > const char *name = kobj->name; > > > > > > > > @@ -680,6 +681,9 @@ static void kobject_cleanup(struct kobject *kobj) > > > > kobject_uevent(kobj, KOBJ_REMOVE); > > > > } > > > > > > > > + /* make sure the parent is not released before the (last) child */ > > > > + kobject_get(parent); > > > > + > > > > /* remove from sysfs if the caller did not do it */ > > > > if (kobj->state_in_sysfs) { > > > > pr_debug("kobject: '%s' (%p): auto cleanup kobject_del\n", > > > > @@ -693,6 +697,8 @@ static void kobject_cleanup(struct kobject *kobj) > > > > t->release(kobj); > > > > } > > > > > > > > + kobject_put(parent); > > > > + > > > > > > No, please don't do this. > > > > > > A child device should have always incremented the parent already if it > > > was correctly registered. We have had this patch been proposed multiple > > > times over the years, and every time it was, we said no and went and > > > fixed the real issue which was with the user of the interface. > > > > The parent ref count is incremented by the child, that is not the > > problem. The problem is that when that child is released, if it's the > > last child of the parent, and there are no other users for the parent, > > then the parent is actually released _before_ the child. And that > > happens in the above function kobject_cleanup(). > > > > We can work around the problem by taking a reference to the parent > > separately, but we have to do that everywhere separately (which I > > guess is exactly what has been done so far). That workaroud still does > > not really fix the core problem. The core problem is still that > > lib/kboject.c is allowing the parent kobject to be released before the > > child kobject, and that quite simply should not be allowed to happen. > > > > I don't have a problem if you want to have a better solution for this, > > but the solution really can't anymore be that we are always expected > > to separately increment the parent's ref count with every type of > > kobject. > > Why is this suddenly showing up as a issue and it hasn't ever before? > Because of that I would argue that the problem is not in the kobject > core, but the use of it. The problem has showed up before, and you pointed that out yourself in your previous mail. The thing is that we are going to continue to have this issue over and over again until the core issue is fixed. > When a child object is created, it should have already incremented the > parent reference count before it is registered, to ensure that the > parent does not go away before the child is finished being registered. > Then, when the child is removed, it decrements the reference count of > the parent as it "knows" it is done with that pointer. Please note that when you add a kobject, its parent ref count gets incremented (line 240 in lib/kobject.c). Are you saying here that that is actually not necessary, as the code that registers the new (child) kobject should _always_ be responsible of holding a reference to the parent? Because if that is the case, then we should actually cleanup lib/kboject.c and remove all parent reference handling. We obviously need to fix the documentation as well in this case. > So perhaps this new swnode code is just wrong? Dealing with "raw" > kobjects is hard, and not generally recommended due to issues like this. > When you use the driver core, all of this logic is already handled for > you... > > And I just looked at the code swnode_register() needs to increment the > parent kobject's reference, as it is saving a pointer to it away to be > used later. That's just basic reference-counted-pointer logic here, > please fix the issue there. That is fair, however, it still does not fix the core problem. Even if we did not need to save the pointer to the parent, we would still need to increment the parents reference count, because we have to prevent the parent from being released before us. There are two ways we can fix the situation: 1) We can make it clear that lib/kboject.c does not take any responsibility over the reference counting of the parents, and remove all the parent reference handling from it. 2) We fix this by guaranteeing that the parent kboject can not be released before its children. thanks, -- heikki