Received: by 2002:a25:1506:0:0:0:0:0 with SMTP id 6csp6429687ybv; Wed, 12 Feb 2020 12:04:10 -0800 (PST) X-Google-Smtp-Source: APXvYqxUX04xBVSlZY/NksuXViTrxEsAfo6YmFyof2OI/8y24uDxF4nGwydGQCpSI5p7b7qk8Lfb X-Received: by 2002:aca:f20b:: with SMTP id q11mr525699oih.78.1581537850124; Wed, 12 Feb 2020 12:04:10 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1581537850; cv=none; d=google.com; s=arc-20160816; b=rBXttftVpv7X6oq9KsSJW8Yf0x1tt08/Qp48l2+NOlPG0Td1ohZ6Xo1wQxPYe9x61m FQaxrBtySK3Yh1lBjbYgmNosPAgMAFbqY9kVmbVz4EJZBqkVV6S8O+ZV21B37saICEA9 E4e0ThO0k3+acuh0kq/6k4T1kNddUBrG4oH+mFYWOlU00ySGo7ku2Ay9tjlN2DplP+7Z diGTt6FAiU5+TVt36yL0cspgGAguZIHv3a6E5oiL65o3b/psaXXNyI2gjqHUL6nmC1ym uNXuvAC5ZjvnNy2jUNwoZ7R1NhUhdeDhO2cRwjoAiwhYanTGFn0tI8vN1KetUDeg2dDb Nv3g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:in-reply-to:content-disposition :mime-version:references:message-id:subject:cc:to:from:date; bh=N2O+MxoBbwQEfdL7Qasvem5f1mHDjIgAhkhR10rPyy0=; b=X8LBbuXALayP3CAXSYv/YZWTve5zaTQ5guc5cbkYC8Jh6wS33frxcGeqiKlfeI8aJ+ xRmukohFBsW+Mh+5na/cRQud2jdbwtG7W7OE2vtJdO1On95nmjlVWfxnpxIL5s1IrY8D nBCG9tz6hfuFPF+E43N4BX53/fklPltPhSrrH57ju0QBqFl/B/7N67b6J+JtibPO1rbp RzJerYA0O5dI+vjsDBIftGC0nlQQgvQJudtEdrqAWOpLSP5JhM/KaIlsi3+6Npqos+AN m4wpPfimxhMIQjjl9+LD1fWdfzPXJabegsmvRqk2EdGKfaZlMYHqW3cwo/eVgWK7bfOm L62Q== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id f4si704959oto.169.2020.02.12.12.03.49; Wed, 12 Feb 2020 12:04:10 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1729039AbgBLUDl (ORCPT + 99 others); Wed, 12 Feb 2020 15:03:41 -0500 Received: from zeniv.linux.org.uk ([195.92.253.2]:43182 "EHLO ZenIV.linux.org.uk" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727600AbgBLUDl (ORCPT ); Wed, 12 Feb 2020 15:03:41 -0500 Received: from viro by ZenIV.linux.org.uk with local (Exim 4.92.3 #3 (Red Hat Linux)) id 1j1yER-00BZg9-Au; Wed, 12 Feb 2020 20:03:35 +0000 Date: Wed, 12 Feb 2020 20:03:35 +0000 From: Al Viro To: Linus Torvalds Cc: "Eric W. Biederman" , LKML , Kernel Hardening , Linux API , Linux FS Devel , Linux Security Module , Akinobu Mita , Alexey Dobriyan , Andrew Morton , Andy Lutomirski , Daniel Micay , Djalal Harouni , "Dmitry V . Levin" , Greg Kroah-Hartman , Ingo Molnar , "J . Bruce Fields" , Jeff Layton , Jonathan Corbet , Kees Cook , Oleg Nesterov , Solar Designer Subject: Re: [PATCH v8 07/11] proc: flush task dcache entries from all procfs instances Message-ID: <20200212200335.GO23230@ZenIV.linux.org.uk> References: <20200210150519.538333-1-gladkov.alexey@gmail.com> <20200210150519.538333-8-gladkov.alexey@gmail.com> <87v9odlxbr.fsf@x220.int.ebiederm.org> <20200212144921.sykucj4mekcziicz@comp-core-i7-2640m-0182e6> <87tv3vkg1a.fsf@x220.int.ebiederm.org> <87v9obipk9.fsf@x220.int.ebiederm.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Feb 12, 2020 at 11:49:58AM -0800, Linus Torvalds wrote: > I wonder if we could split up d_invalidate(). It already ends up being > two phases: first the unhashing under the d_lock, and then the > recursive shrinking of parents and children. > > The recursive shrinking of the parent isn't actually interesting for > the proc shrinking case: we just looked up one child, after all. So we > only care about the d_walk of the children. > > So if we only did the first part under the RCU lock, and just > collected the dentries (can we perhaps then re-use the hash list to > collect them to another list?) and then did the child d_walk > afterwards? What's to prevent racing with fs shutdown while you are doing the second part? We could, after all, just have them[*] on procfs-private list (anchored in task_struct) from the very beginning; evict on ->d_prune(), walk the list on exit... How do you make sure the fs instance won't go away right under you while you are doing the real work? Suppose you are looking at one of those dentries and you've found something blocking to do. You can't pin that dentry; you can pin ->s_active on its superblock (if it's already zero, you can skip it - fs shutdown already in progress will take care of the damn thing), but that will lead to quite a bit of cacheline pingpong... [*] only /proc/ and /proc/*/task/ dentries, obviously.