Received: by 2002:a25:8b91:0:0:0:0:0 with SMTP id j17csp1070837ybl; Fri, 6 Dec 2019 10:46:28 -0800 (PST) X-Google-Smtp-Source: APXvYqwu0Q/kYeu/xd8uI+WDElCZEQAlYflxK/e6G+cJ+ichjLunA9aYwGNWsHp9do9+XhMG6xZr X-Received: by 2002:aca:4712:: with SMTP id u18mr11413247oia.93.1575657988134; Fri, 06 Dec 2019 10:46:28 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1575657988; cv=none; d=google.com; s=arc-20160816; b=yHpQAAIVVMr6RGQvmeOJTzsDFUIqEiabOueWVQNGxGvJP4vP309AzA+ILao4jUjlWX M7Kpp3t8Wsg4zYyiq8RP/PayIFrXlu+VZx2qlS2pEbp4JQ2FcmD2Pc0Ki/j++msOBt59 oGYDXxGNDg/Sz+O0vpI7opuBd/coT1N0l42uZ/uLVajEMA2vmzibgYdFyT3wVz9qn5mi ES/K/gwDUw2rxvUzNdgvdvo/Ys1rJkasYZtcJ44HzMx7rxL0R82oB+QdNUytJ3ltfiED EKuEoYx6JqTQzEkqiY/hxEjDGjlymbzHUmxCIBAnuvRJcRdMmxpy1IjC4D0RAhdDtD2G zgkg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:reply-to:message-id :subject:cc:to:from:date:dkim-signature; bh=ZL/3/F10aa3R8OkFJhNyHlCPBMLtplnO91iXEWxyolQ=; b=DbBXqCGlnCtv64HgisQ4VZuoW+qqfBD8Joj5CUbYSmEW8Arcm5XZbEa/GbW7Z17P9A Z12d497gMa67eBIqLP1Dq/6Q2acER+LPtICpUpBk1zv2fhUlmiLxjWHupy7+Qo4FUS6i w1S0+XV2xx4XkpPNDZQgkwNlbZvckEhOi4o6uS4fQgx+3FexXU1A9J58YX0HfAUKThGC 3Q/CVJL7mpo8mJYVaPMX19OGsWnMCX3idVh8NQZ3NuFWcCm0aP3OCV2Ww+LYaazYW3Fp K/a3pWdmI4OiC4LQWqLvF7NFVNnb0WC+kJQADeTiIVKGxkW1K3pv/7l2P/zk0nBrQVX2 flSA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=1wiQHNi3; spf=pass (google.com: best guess record for domain of linux-nfs-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id y3si7649345ote.235.2019.12.06.10.46.09; Fri, 06 Dec 2019 10:46:28 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-nfs-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=1wiQHNi3; spf=pass (google.com: best guess record for domain of linux-nfs-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726370AbfLFSpz (ORCPT + 99 others); Fri, 6 Dec 2019 13:45:55 -0500 Received: from mail.kernel.org ([198.145.29.99]:35764 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726317AbfLFSpy (ORCPT ); Fri, 6 Dec 2019 13:45:54 -0500 Received: from paulmck-ThinkPad-P72.home (50-39-105-78.bvtn.or.frontiernet.net [50.39.105.78]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 3622024670; Fri, 6 Dec 2019 18:45:53 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1575657953; bh=uI9EKANieQp4/2Z6MB2ZAs2MMN7vumj93kZAcBBUEt0=; h=Date:From:To:Cc:Subject:Reply-To:References:In-Reply-To:From; b=1wiQHNi35PaEkG8c+sJQrPmvIkrOI/hWxm7imrIqmIQHlt1NZuKXhss6iRwGFU39D bpjt5/DslssYZVyMiM8JtFzAxn6dtHXeIMXZ0Ir9xeC9GRGBAfnSA1UijLPVHmAkBm g3CNDUixy30L8My/oZEi+y4wch3I/jeCaNGlu36g= Received: by paulmck-ThinkPad-P72.home (Postfix, from userid 1000) id 0E73635206AB; Fri, 6 Dec 2019 10:45:53 -0800 (PST) Date: Fri, 6 Dec 2019 10:45:53 -0800 From: "Paul E. McKenney" To: Trond Myklebust Cc: "linux-kernel-mentees@lists.linuxfoundation.org" , "madhuparnabhowmik04@gmail.com" , "linux-nfs@vger.kernel.org" , "rcu@vger.kernel.org" , "anna.schumaker@netapp.com" , "linux-kernel@vger.kernel.org" , "joel@joelfernandes.org" Subject: Re: [PATCH 2/2] fs: nfs: dir.c: Fix sparse error Message-ID: <20191206184553.GI2889@paulmck-ThinkPad-P72> Reply-To: paulmck@kernel.org References: <20191206151640.10966-1-madhuparnabhowmik04@gmail.com> <20191206160238.GE2889@paulmck-ThinkPad-P72> <2ec21ec537144bb3c0d5fbdaf88ea022d07b7ff8.camel@hammerspace.com> <20191206182414.GH2889@paulmck-ThinkPad-P72> <127792d6811173921733542052f061a18991f441.camel@hammerspace.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <127792d6811173921733542052f061a18991f441.camel@hammerspace.com> User-Agent: Mutt/1.9.4 (2018-02-28) Sender: linux-nfs-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-nfs@vger.kernel.org On Fri, Dec 06, 2019 at 06:28:14PM +0000, Trond Myklebust wrote: > On Fri, 2019-12-06 at 10:24 -0800, Paul E. McKenney wrote: > > On Fri, Dec 06, 2019 at 05:52:10PM +0000, Trond Myklebust wrote: > > > Hi Paul, > > > > > > On Fri, 2019-12-06 at 08:02 -0800, Paul E. McKenney wrote: > > > > On Fri, Dec 06, 2019 at 08:46:40PM +0530, > > > > madhuparnabhowmik04@gmail.com wrote: > > > > > From: Madhuparna Bhowmik > > > > > > > > > > This patch fixes the following errors: > > > > > fs/nfs/dir.c:2353:14: error: incompatible types in comparison > > > > > expression (different address spaces): > > > > > fs/nfs/dir.c:2353:14: struct list_head [noderef] * > > > > > fs/nfs/dir.c:2353:14: struct list_head * > > > > > > > > > > caused due to directly accessing the prev pointer of > > > > > a RCU protected list. > > > > > Accessing the pointer using the macro list_prev_rcu() fixes > > > > > this > > > > > error. > > > > > > > > > > Signed-off-by: Madhuparna Bhowmik < > > > > > madhuparnabhowmik04@gmail.com> > > > > > --- > > > > > fs/nfs/dir.c | 2 +- > > > > > 1 file changed, 1 insertion(+), 1 deletion(-) > > > > > > > > > > diff --git a/fs/nfs/dir.c b/fs/nfs/dir.c > > > > > index e180033e35cf..2035254cc283 100644 > > > > > --- a/fs/nfs/dir.c > > > > > +++ b/fs/nfs/dir.c > > > > > @@ -2350,7 +2350,7 @@ static int > > > > > nfs_access_get_cached_rcu(struct > > > > > inode *inode, const struct cred *cre > > > > > rcu_read_lock(); > > > > > if (nfsi->cache_validity & NFS_INO_INVALID_ACCESS) > > > > > goto out; > > > > > - lh = rcu_dereference(nfsi- > > > > > >access_cache_entry_lru.prev); > > > > > + lh = rcu_dereference(list_prev_rcu(&nfsi- > > > > > > access_cache_entry_lru)); > > > > > > > > And as noted in the earlier email, what is preventing concurrent > > > > insertions into and deletions from this list? > > > > > > > > o This use of list_move_tail() is OK because it does not poison. > > > > Though it isn't being all that friendly to lockless access to > > > > ->prev -- no WRITE_ONCE() in list_move_tail(). > > > > > > > > o The use of list_add_tail() is not safe with RCU readers, though > > > > they do at least partially compensate via use of smp_wmb() > > > > in nfs_access_add_cache() before calling > > > > nfs_access_add_rbtree(). > > > > > > > > o The list_del() near the end of nfs_access_add_rbtree() will > > > > poison the ->prev pointer. I don't see how this is safe given > > > > the > > > > possibility of a concurrent call to > > > > nfs_access_get_cached_rcu(). > > > > > > The pointer nfsi->access_cache_entry_lru is the head of the list, > > > so it > > > won't get poisoned. Furthermore, the objects it points to are freed > > > using kfree_rcu(), so they will survive as long as we hold the rcu > > > read > > > lock. The object's cred pointers also points to something that is > > > freed > > > in an rcu-safe manner. > > > > > > The problem here is rather that a racing list_del() can cause nfsi- > > > > access_cache_entry_lru to be empty, which is presumably why Neil > > > > added > > > that check plus the empty cred pointer check in the following line. > > > > > > The barrier semantics may be suspect, although the spin unlock > > > after > > > list_del() should presumably guarantee release semantics? > > > > Ah, OK, so you are only ever using ->prev only from the head of the > > list, > > and presumably never do list_del() on the head itself. (Don't laugh, > > this does really happen as a way to remove the entire list, though > > perhaps with list_del_init() rather than list_del().) > > Correct. > > > Maybe we should have a list_tail_rcu() that is only expected to work > > on the head of the list? > > That might be the best way to resolve this, yes. Madhuparna, would you be willing to do a patch series along these lines? Thanx, Paul