Received: by 2002:a05:6a10:1287:0:0:0:0 with SMTP id d7csp2149443pxv; Sat, 17 Jul 2021 05:32:40 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwz60Mn94vTVmd6gbNj2SkDVwIW33qHsrZQK5eyFcD79E/AHr13AckErLLZkhLGVZ+Gsst2 X-Received: by 2002:a17:907:9152:: with SMTP id l18mr17599303ejs.374.1626525160091; Sat, 17 Jul 2021 05:32:40 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1626525160; cv=none; d=google.com; s=arc-20160816; b=sHAz9y9fO+q+o+W1EGp6WkoycbzOf0Eg8toIvKhtQ+wTgTnDKIaDsKDSGsiEgxHcfO RvshwNoINSE0qfg2laQPAqBa07/Gi8afV6fYitbp9Me/kB5aZLCS5nc3Hfn5cpjUfLU6 Qdk1h0Jfz7mngaO4Sxeahgya/HkqFkiykZikB/5jL9w8jMiGIuk2p6CwF9WlqdX72Vmz tae1VeYG6O02umpFKx3R4VavlKfWcxBWEn0ntrDu03+eEp8FLmkY/hWAgIqHyIzgceN3 OTp6Jb+1mOfxL3H0KAUdrY/CziEps2+pNoeduaWyS2uiTR3Frygmi1CuJENpH6E+EB/e DE5w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:message-id:mime-version :content-transfer-encoding:references:in-reply-to:subject:cc:to:from :date:dkim-signature; bh=bw/P2ct3q0JI7DPI1DUJAEXItR5PHJsDmCVe/ICZLH8=; b=bcBa7srtijClANSXJLhVCNhft+TbzZpXttj4ANd+Y2D/5lq7quEICtGvp3Yl/WdZGJ eLOFxK6nuaNDD354kzuS2sHBqb8K5/zLTtUyyw8xLFhs/YroWce9HhSxJKoJe9RNSbII QTicY6kBljlBvsZWR8mYFFg4maKPU+LZdgj8LSiwsZj8GeWCVscVSgaZt2oHndYQY2V+ 8mK2c5VXmfhVPMrQrpo+Hve/MZDhV6pso5W1/yC1S7LJN8NRQKws4qo/kujypYD4G+1t MY4lVm4gBK7QH0hvugr3xd8GoIzOcyFZHUE1q2GJDRLdksu4DpXScO+eLFgf2kQMIhLn pXRQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@fudan.edu.cn header.s=dkim header.b=lVd2LurH; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=fudan.edu.cn Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id m17si10749241edc.467.2021.07.17.05.32.13; Sat, 17 Jul 2021 05:32:40 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@fudan.edu.cn header.s=dkim header.b=lVd2LurH; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=fudan.edu.cn Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233382AbhGQMaW (ORCPT + 99 others); Sat, 17 Jul 2021 08:30:22 -0400 Received: from zg8tmty1ljiyny4xntqumjca.icoremail.net ([165.227.154.27]:44408 "HELO zg8tmty1ljiyny4xntqumjca.icoremail.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with SMTP id S229471AbhGQMaP (ORCPT ); Sat, 17 Jul 2021 08:30:15 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=fudan.edu.cn; s=dkim; h=Received:Date:From:To:Cc:Subject: In-Reply-To:References:Content-Transfer-Encoding:Content-Type: MIME-Version:Message-ID; bh=bw/P2ct3q0JI7DPI1DUJAEXItR5PHJsDmCVe /ICZLH8=; b=lVd2LurH35u1VUDdDEs+HE/FzxWjajggdOgLz4W7HQwgh2Ffy9+U gzaCp/AOZDI+Z0ooiigtFbZZcBv/LwghJu3KzZjFg74/MbA49OvfxKrJk/VyHIsf HkcJoaBnSThjR314Hc3oevPhANy8Rbc2+InfXF34LGU9JKx+EWbkE98= Received: by ajax-webmail-app1 (Coremail) ; Sat, 17 Jul 2021 20:26:45 +0800 (GMT+08:00) X-Originating-IP: [39.144.44.130] Date: Sat, 17 Jul 2021 20:26:45 +0800 (GMT+08:00) X-CM-HeaderCharset: UTF-8 From: "Xiyu Yang" To: "Jeff Layton" Cc: "Ilya Dryomov" , ceph-devel@vger.kernel.org, linux-kernel@vger.kernel.org, yuanxzhang@fudan.edu.cn, "Xin Tan" , "Yejune Deng" Subject: Re: Re: [PATCH] ceph: Convert from atomic_t to refcount_t on ceph_snap_realm->nref X-Priority: 3 X-Mailer: Coremail Webmail Server Version XT3.0.8 dev build 20200917(8294e55f) Copyright (c) 2002-2021 www.mailtech.cn fudan.edu.cn In-Reply-To: <2e5088eb8df4db5ea4d9c9f2862fc19c3bf186d4.camel@kernel.org> References: <1626516381-40440-1-git-send-email-xiyuyang19@fudan.edu.cn> <2e5088eb8df4db5ea4d9c9f2862fc19c3bf186d4.camel@kernel.org> X-SendMailWithSms: false Content-Transfer-Encoding: 7bit X-CM-CTRLDATA: k2MTk2Zvb3Rlcl90eHQ9NDY0OToxMA== Content-Type: text/plain; charset=UTF-8 MIME-Version: 1.0 Message-ID: <702aa2de.980b.17ab46ee99b.Coremail.xiyuyang19@fudan.edu.cn> X-Coremail-Locale: en_US X-CM-TRANSID: XAUFCgD3_0+FzPJgLf58AA--.13216W X-CM-SenderInfo: irzsiiysuqikmy6i3vldqovvfxof0/1tbiAQ8FAVKp4xL5+gAAsD X-Coremail-Antispam: 1Ur529EdanIXcx71UUUUU7IcSsGvfJ3iIAIbVAYjsxI4VWxJw CS07vEb4IE77IF4wCS07vE1I0E4x80FVAKz4kxMIAIbVAFxVCaYxvI4VCIwcAKzIAtYxBI daVFxhVjvjDU= Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Thank you for pointing out the problem in the patch. I cannot find an unique refcount API work like atomic_inc_return, thus I chose two APIs to play a similar role and forgot the potential racy case. So are you have a better choice to help this refcount type convertation? > -----Original Messages----- > From: "Jeff Layton" > Sent Time: 2021-07-17 19:21:40 (Saturday) > To: "Xiyu Yang" , "Ilya Dryomov" , ceph-devel@vger.kernel.org, linux-kernel@vger.kernel.org > Cc: yuanxzhang@fudan.edu.cn, "Xin Tan" , "Yejune Deng" > Subject: Re: [PATCH] ceph: Convert from atomic_t to refcount_t on ceph_snap_realm->nref > > On Sat, 2021-07-17 at 18:06 +0800, Xiyu Yang wrote: > > refcount_t type and corresponding API can protect refcounters from > > accidental underflow and overflow and further use-after-free situations. > > > > Signed-off-by: Xiyu Yang > > Signed-off-by: Xin Tan > > --- > > fs/ceph/snap.c | 15 ++++++++------- > > fs/ceph/super.h | 3 ++- > > 2 files changed, 10 insertions(+), 8 deletions(-) > > > > diff --git a/fs/ceph/snap.c b/fs/ceph/snap.c > > index 4ac0606dcbd4..d4ec9c5118bd 100644 > > --- a/fs/ceph/snap.c > > +++ b/fs/ceph/snap.c > > @@ -68,14 +68,15 @@ void ceph_get_snap_realm(struct ceph_mds_client *mdsc, > > lockdep_assert_held(&mdsc->snap_rwsem); > > > > dout("get_realm %p %d -> %d\n", realm, > > - atomic_read(&realm->nref), atomic_read(&realm->nref)+1); > > + refcount_read(&realm->nref), refcount_read(&realm->nref)+1); > > /* > > * since we _only_ increment realm refs or empty the empty > > * list with snap_rwsem held, adjusting the empty list here is > > * safe. we do need to protect against concurrent empty list > > * additions, however. > > */ > > - if (atomic_inc_return(&realm->nref) == 1) { > > + refcount_inc(&realm->nref); > > + if (refcount_read(&realm->nref) == 1) { > > The above is potentially racy as you've turned a single atomic operation > into two. Another task could come in and increment or decrement > realm->nref just after your recount_inc but before the refcount_read, > and then the read would show the wrong result. > > FWIW, Yejune Deng (cc'ed) proposed a very similar patch a few months ago > that caused this regression: > > https://tracker.ceph.com/issues/50281 > > > spin_lock(&mdsc->snap_empty_lock); > > list_del_init(&realm->empty_item); > > spin_unlock(&mdsc->snap_empty_lock); > > @@ -121,7 +122,7 @@ static struct ceph_snap_realm *ceph_create_snap_realm( > > if (!realm) > > return ERR_PTR(-ENOMEM); > > > > - atomic_set(&realm->nref, 1); /* for caller */ > > + refcount_set(&realm->nref, 1); /* for caller */ > > realm->ino = ino; > > INIT_LIST_HEAD(&realm->children); > > INIT_LIST_HEAD(&realm->child_item); > > @@ -209,8 +210,8 @@ static void __put_snap_realm(struct ceph_mds_client *mdsc, > > lockdep_assert_held_write(&mdsc->snap_rwsem); > > > > dout("__put_snap_realm %llx %p %d -> %d\n", realm->ino, realm, > > - atomic_read(&realm->nref), atomic_read(&realm->nref)-1); > > - if (atomic_dec_and_test(&realm->nref)) > > + refcount_read(&realm->nref), refcount_read(&realm->nref)-1); > > + if (refcount_dec_and_test(&realm->nref)) > > __destroy_snap_realm(mdsc, realm); > > } > > > > @@ -221,8 +222,8 @@ void ceph_put_snap_realm(struct ceph_mds_client *mdsc, > > struct ceph_snap_realm *realm) > > { > > dout("put_snap_realm %llx %p %d -> %d\n", realm->ino, realm, > > - atomic_read(&realm->nref), atomic_read(&realm->nref)-1); > > - if (!atomic_dec_and_test(&realm->nref)) > > + refcount_read(&realm->nref), refcount_read(&realm->nref)-1); > > + if (!refcount_dec_and_test(&realm->nref)) > > return; > > > > if (down_write_trylock(&mdsc->snap_rwsem)) { > > diff --git a/fs/ceph/super.h b/fs/ceph/super.h > > index 6b6332a5c113..3abb00d7a0eb 100644 > > --- a/fs/ceph/super.h > > +++ b/fs/ceph/super.h > > @@ -2,6 +2,7 @@ > > #ifndef _FS_CEPH_SUPER_H > > #define _FS_CEPH_SUPER_H > > > > +#include > > #include > > > > #include > > @@ -859,7 +860,7 @@ struct ceph_readdir_cache_control { > > struct ceph_snap_realm { > > u64 ino; > > struct inode *inode; > > - atomic_t nref; > > + refcount_t nref; > > struct rb_node node; > > > > u64 created, seq; > > -- > Jeff Layton