Received: by 2002:a05:6a10:9848:0:0:0:0 with SMTP id x8csp747840pxf; Thu, 1 Apr 2021 12:29:19 -0700 (PDT) X-Google-Smtp-Source: ABdhPJw1KBfmy69Lhm2Fv2cOE8+wfLqr/bvWTxZUrajVgkls5j4PzgfZReTg4uZDWZBEOG+LOhfk X-Received: by 2002:a05:6e02:df2:: with SMTP id m18mr7646444ilj.123.1617305359270; Thu, 01 Apr 2021 12:29:19 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1617305359; cv=none; d=google.com; s=arc-20160816; b=EPYKiat98JyvZh0/Eg5PKkfuLKMgkN4ISx4alMDyKUAfjljzb/SO3VW0MIhiicpE3I wWAOqKn4wJAPh0pQYGSAlwxcKhb4rqMze2XFkgmIfE0qGI/xtfI3zlrnqQ7AnoNE51Ie RrIRtozrjcNBZEPwyTQCjUldzzFeMpsvAbQkaTBOjrVZPGM/6aCeCG6MdbEgCfBHNkvh Y5hVjNakmCRdRuBCL0J0fYlEeJd2mFe6jYaWIoOrIVUS9kKSFlF8UXik/pX3j9auxH0A HJUa12SmYKiWD+swYYZf8eGekWSAJv0xq7/06G1jqNh1o2HDteIPrnTT8PGEb9S6P8xL OEpQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date; bh=tEi2V0Ou1gNpLL+kWNJ4yQdRkOmCHQhrJ/c+95y/Exs=; b=CY3Gzmnd3OvRPbpg21hWtyAgLAKIPxRRT6G0SKeVM2uPv50/qWxl05wSdkWgmFa6mJ fppmMmR2mW/axTYpecWMMFWJuQkC023FU1Ha3FRmyoA2aSbxR/nRLn426vlDit2w37lc PQVNgsRN/N/Tt6BEx7ttW7HDFCP/dPYMijnFPRptwK+I4lFPZ7t+p5E7wON33kiLkn+2 9cqqobOMdqvORkRInOs8nLM3qd2+Fg35fUuynWAovietzSitRw4SnVq+OEHi5uvqab/p 2ckkT4wOHSIamcXe+MqfNxiCNdG51x7Z0fTKmMmpKRu9P2kV/PG6OHBuiwrjg0CyZS6A 86bw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id e12si5450682ilq.94.2021.04.01.12.29.05; Thu, 01 Apr 2021 12:29:19 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S235220AbhDAT2Q (ORCPT + 99 others); Thu, 1 Apr 2021 15:28:16 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:52366 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S235032AbhDAT2K (ORCPT ); Thu, 1 Apr 2021 15:28:10 -0400 Received: from zeniv-ca.linux.org.uk (zeniv-ca.linux.org.uk [IPv6:2607:5300:60:148a::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 6BB39C05BD24; Thu, 1 Apr 2021 12:11:17 -0700 (PDT) Received: from viro by zeniv-ca.linux.org.uk with local (Exim 4.94 #2 (Red Hat Linux)) id 1lS2im-001iJP-Kk; Thu, 01 Apr 2021 19:11:12 +0000 Date: Thu, 1 Apr 2021 19:11:12 +0000 From: Al Viro To: Christian Brauner Cc: Jens Axboe , syzbot , linux-fsdevel@vger.kernel.org, linux-kernel@vger.kernel.org, syzkaller-bugs@googlegroups.com, io-uring@vger.kernel.org Subject: Re: [syzbot] WARNING in mntput_no_expire (2) Message-ID: References: <0000000000003a565e05bee596f2@google.com> <20210401154515.k24qdd2lzhtneu47@wittgenstein> <90e7e339-eaec-adb2-cfed-6dc058a117a3@kernel.dk> <20210401174613.vymhhrfsemypougv@wittgenstein> <20210401175919.jpiylhfrlb4xb67u@wittgenstein> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20210401175919.jpiylhfrlb4xb67u@wittgenstein> Sender: Al Viro Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Apr 01, 2021 at 07:59:19PM +0200, Christian Brauner wrote: > I _think_ I see what the issue is. It seems that an assumption made in > this commit might be wrong and we're missing a mnt_add_count() bump that > we would otherwise have gotten if we've moved the failure handling into > the unlazy helpers themselves. > > Al, does that sound plausible? mnt_add_count() on _what_? Failure in legitimize_links() ends up with nd->path.mnt zeroed, in both callers. So which vfsmount would be affected? Rules: in RCU mode: no mounts pinned out of RCU mode: nd->path.mnt and all nd->stack[i].link.mnt for i below nd->depth are either NULL or pinned Transition from RCU to non-RCU mode happens in try_to_unlazy() and try_to_unlazy_next(). References (if any) are dropped by eventual terminate_walk() (after that the contents of nameidata is junk). __legitimize_mnt() is the primitive for pinning. Return values: 0 -- successfully pinned (or given NULL as an argument) 1 -- failed, refcount not affected -1 -- failed, refcount bumped. It stays in RCU mode in all cases. One user is __legitimize_path(); it also stays in RCU mode. If it fails to legitimize path->mnt, it will zero it *IF* __legitimize_mnt() reports that refcount hadn't been taken. In all other cases, path->mnt is pinned. IOW, the caller is responsible for path_put() regardless of the outcome. Another user is legitimize_mnt(). _That_ will make sure that refcount is unaffected in case of failure (IOW, if __legitimize_mnt() reports failure with refcount bumped, we drop out of RCU mode, do mntput() and go back). On failure in legitimize_links() we either leave nd->depth equal to zero (in which case all nd->stack[...].link.mnt are to be ignored) or we set it one higher than the last attempted legitimize_path() in there. In the latter case, all entries in nd->stack below the value we put into nd->depth had legitimize_path() called (and thus have ->mnt either NULL or pinned) and everything starting from nd->depth is to be ignored. nd->path handling: 1) Callers of legitimize_links() are responsible for zeroing nd->path.mnt on legitimize_links() failure. Both do that, AFAICS. 2) in try_to_unlazy() we proceed to call legitimize_path() on nd->path. Once that call is done, we have nd->path.mnt pinned or NULL, so nothing further is needed with it. 3) in try_to_unlazy_next() we use legitimize_mnt() instead. Failure of that is handled by zeroing nd->path.mnt; success means that nd->path.mnt is pinned and should be left alone. We could use __legitimize_mnt() in try_to_unlazy_next() (basically, substitute the body of legitimize_mnt() there and massage it a bit), but that ends up being harder to follow: res = __legitimize_mnt(nd->path.mnt, nd->m_seq); if (unlikely(res)) { if (res < 0) // pinned, leave it there goto out1; else // not pinned, zero it goto out2; } instead of if (unlikely(!legitimize_mnt(nd->path.mnt, nd->m_seq))) goto out2; we have now.