Received: by 2002:a25:683:0:0:0:0:0 with SMTP id 125csp1553340ybg; Tue, 2 Jun 2020 13:11:27 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwyzCb/UwkM4orBjdSEG8PrGzPYgYapoJX4My8+5tdcvA3gF3Jf5/zVc3Vx2ko0r0vyx17X X-Received: by 2002:a50:9f27:: with SMTP id b36mr1603714edf.211.1591128687721; Tue, 02 Jun 2020 13:11:27 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1591128687; cv=none; d=google.com; s=arc-20160816; b=x5YJOfP74lOieWLLBonGe211N2NB+nvEnmYLm4DHIewpWCrTBLraUzMQNXtkpp3nXT eZsF0OvS3vCzRvEWcRokx3fv5YKbl+P/oRlwW8UC7zWED40PfPc4GdoWzpcCpfJ3F0rb IG0J7/m2kgs/NhB9RBblPdi4TddgVAoVshu6auqhuT2lSFG5S52mYdme+gdnHyRkxJ2G A+hjWHmiPGyI7HJfhPlntmlG+PGrRSqtbMsokOUg67y4UNQDSrCfkkI8Qd/T7WZ0IV8T XApryef9v8dNKFS9856IFZbj/gNmxYRBfcHKKeafgLYH4HguqPpDoVL+pKx/rtHDaniH CWHw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date :dkim-signature; bh=7Y8OEQOUU8tuawniDj5fqLGgbL3uHzjKfny9yOQhi88=; b=v30imIBmOs8BYYlhb45k1m3GqyNKjTuuBPvnsAgQKpn/t/R5GdMX9Hb4V6iPl2RNLp qzgkkQCEfTL7TqD0YcOjVsfmN6c7Bz7PsGTb2CXXFXdYO71eu+9pX185fOAJx+rO0feg NHd/co1CZ/G2mqIExpzXAd65OjB5vtzADjdfO3ci5nR7718xX9Jsw2eUtx70tjgOEQjK Za9X3OwI81aG1544DZu8e6LTW4LZBVQ4zVEitzrgI2IDwtzgV9rutkZj2K40JhpTFWPb v85ULrDCpM9uJqK8NZIsA+TDJlbnN0U/BQSGtEueSdSqijWCjlmFq4paae/xqyXdc05b 4dOg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=rC9RYL7E; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id e1si2011445ejd.353.2020.06.02.13.11.04; Tue, 02 Jun 2020 13:11:27 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=rC9RYL7E; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726223AbgFBUIA (ORCPT + 99 others); Tue, 2 Jun 2020 16:08:00 -0400 Received: from mail.kernel.org ([198.145.29.99]:52046 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726130AbgFBUIA (ORCPT ); Tue, 2 Jun 2020 16:08:00 -0400 Received: from localhost (83-86-89-107.cable.dynamic.v4.ziggo.nl [83.86.89.107]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id C5EAA2072F; Tue, 2 Jun 2020 20:07:58 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1591128479; bh=yKebJ6YxNe1OKDd9yzOTTACfvrnQvf2mteYMkPYl8rM=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=rC9RYL7Eye74VGyH6nyxx8Zu8h4Pqa2rAOU5Eq1cX6Bg8fuUGcv2U/kvNmLNUUapX rPXdlohH5qBDZKgMCNYlDFKEjer/IU6aR3t7wNyFDMyW1aIP1L9sIpRimU36IBvgVF coC4zD2QSKWZzNT0iaoSRg/UDyQKzBDYenFXNWQw= Date: Tue, 2 Jun 2020 22:07:56 +0200 From: Greg Kroah-Hartman To: James Bottomley Cc: Matthew Wilcox , Wang Hai , cl@linux.com, penberg@kernel.org, rientjes@google.com, iamjoonsoo.kim@lge.com, akpm@linux-foundation.org, khlebnikov@yandex-team.ru, linux-mm@kvack.org, linux-kernel@vger.kernel.org Subject: Re: kobject_init_and_add is easy to misuse Message-ID: <20200602200756.GA3933938@kroah.com> References: <20200602115033.1054-1-wanghai38@huawei.com> <20200602121035.GL19604@bombadil.infradead.org> <1591111514.4253.32.camel@HansenPartnership.com> <20200602173603.GB3579519@kroah.com> <1591127656.16819.7.camel@HansenPartnership.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <1591127656.16819.7.camel@HansenPartnership.com> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Jun 02, 2020 at 12:54:16PM -0700, James Bottomley wrote: > On Tue, 2020-06-02 at 19:36 +0200, Greg Kroah-Hartman wrote: > > On Tue, Jun 02, 2020 at 08:25:14AM -0700, James Bottomley wrote: > > > On Tue, 2020-06-02 at 05:10 -0700, Matthew Wilcox wrote: > > > > On Tue, Jun 02, 2020 at 07:50:33PM +0800, Wang Hai wrote: > > > > > syzkaller reports for memory leak when kobject_init_and_add() > > > > > returns an error in the function sysfs_slab_add() [1] > > > > > > > > > > When this happened, the function kobject_put() is not called > > > > > for the corresponding kobject, which potentially leads to > > > > > memory leak. > > > > > > > > > > This patch fixes the issue by calling kobject_put() even if > > > > > kobject_init_and_add() fails. > > > > > > > > I think this speaks to a deeper problem with > > > > kobject_init_and_add() > > > > -- the need to call kobject_put() if it fails is not readily > > > > apparent > > > > to most users. This same bug appears in the first three users of > > > > kobject_init_and_add() that I checked -- > > > > arch/ia64/kernel/topology.c > > > > drivers/firmware/dmi-sysfs.c > > > > drivers/firmware/efi/esrt.c > > > > drivers/scsi/iscsi_boot_sysfs.c > > > > > > > > Some do get it right -- > > > > arch/powerpc/kernel/cacheinfo.c > > > > drivers/gpu/drm/ttm/ttm_bo.c > > > > drivers/gpu/drm/ttm/ttm_memory.c > > > > drivers/infiniband/hw/mlx4/sysfs.c > > > > > > > > I'd argue that the current behaviour is wrong, > > > > > > Absolutely agree with this. We have a big meta pattern here where > > > we introduce functions with tortuous semantics then someone creates > > > a checker for the semantics and misuses come crawling out of the > > > woodwork leading to floods of patches, usually for little or never > > > used error paths, which really don't buy anything apart from > > > theoretical correctness. Just insisting on simple semantics would > > > have avoided this. > > > > I "introduced" this way back at the end of 2007. It's not exactly a > > new function. > > Heh, well, if it never fails, how you handle the failure become > unimportant semantics ... > > > > > that kobject_init_and_add() should call kobject_put() if the add > > > > fails. This would need a tree-wide audit. But somebody needs to > > > > do that anyway because based on my random sampling, half of the > > > > users currently get it wrong. > > > > > > Well, the semantics of kobject_init() are free on fail, so these > > > are the ones everyone seems to be using. The semantics of > > > kobject_add are put on fail. The problem is that put on fail isn't > > > necessarily correct in the kobject_init() case: the release > > > function may make assumptions about the object hierarchy which > > > aren't satisfied in the kobject_init() failure case. This argues > > > that kobject_init_and_add() can't ever have correct semantics and > > > we should eliminate it. > > > > At the time, it did reduce common functionality and error handling > > all into a simpler function. And, given it's history, it must have > > somehow worked for the past 12 years or so :) > > Well, like I said, as long as it never fails, no problem. > > It was just Matthew saying "couldn't we make it do kobject_put() > itself?" that got me thinking that perhaps that wouldn't work with all > cases. So now we're discussing failure handling, we're into the > esoteric rabbit hole case that never happens. > > > Odds are, lots of the callers shouldn't be messing around with > > kobjects in the first place. Originally it was only assumed that > > there would be very few users. But it has spread to filesystems and > > firmware subsystems. Drivers should never use it though, so it's a > > good hint something is wrong there... > > > > Anyway, patches to fix this up to make a "sane" api for kobjects is > > always appreciated. Personally I don't have the time at the moment. > > I think the only way we can make the failure semantics consistent is to > have the kobject_init() ones (so kfree on failure). That means for the > add part, the function would have to unwind everything it did from init > on so kfree() is still an option. If people agree, then I can produce > the patch ... it's just the current drive to transform everyone who's > doing kfree() into kobject_put() would become wrong ... Everyone should be putting their kfree into the kobject release anyway, right? Anyway, let's see your patch before I start to object further :) thanks, greg k-h