Received: by 2002:a05:6a10:9afc:0:0:0:0 with SMTP id t28csp693649pxm; Fri, 25 Feb 2022 17:27:30 -0800 (PST) X-Google-Smtp-Source: ABdhPJzvEjbN//2py1qlcpBJ1EJvp1ZATN9Z1yNYXT0IKsvc8uEXl86w3RJDB6GDuaULm8P41Sph X-Received: by 2002:a17:90b:618:b0:1bc:2e76:6385 with SMTP id gb24-20020a17090b061800b001bc2e766385mr5824392pjb.1.1645838850188; Fri, 25 Feb 2022 17:27:30 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1645838850; cv=none; d=google.com; s=arc-20160816; b=uHzqX3ClPZ8jSKNCfh4w2RMa3w6wTPLUdGaUTkXqeN2CoYtWb7NjSeQcT8QV6H927R cHYbEG9lrgobTWmhyCczJXwbDn4As0a4V5CfYxg0pc3kCvtz0qymMSn2+0G8o9FZ6vH5 /pHkRoVUiZR75NW9juYluGDS/6atGhWSmnv1rCLtjm0XUAuyhaFZn9j3VfILmZPopQjx tZLMA7XoFD5zQyzhhPWIlB6vAetrX5lzpHmq3d0GuLNrLZGcV8agk3GcwQFfNfl2O+2N CZglW05rXaHsexiagZApcctUgncvKIn8BGQx06xhnCzJkOGMo3dtQWPSujdxN1s7Bzbi RgoQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:to:from:dkim-signature; bh=uOdD4frxUAvUDJFGzbcmJZedKAkfXvjQTd9WYy25Azk=; b=afgxdzk1/OLsgbGsg8+yx6XJYYioQhz1cM0gBY/Vp9naDiElAEpolSQ9f0CwQ1Cie6 qXuzLQsurEu6nMy4Q/yUeRH0nggeim+Wwpqs+LFnskKcHNdT786plbSgpb76+dIi6+Kg fYypNP2W7vQQ+ehcqJ3TPjPXWqIAUkTU/ti/htHOMADBmRL9GXJLQQRzX7ag8hLCkSQ/ Cnr324rVJz9rPCuLqIIki6+0FdR7dc6AHio4S2mWCP4jJPrT1q1lQ/fqBZe8hey8UHOJ ZicDF+8AJnCshni1Ke5D7f1JWEL4B4US69ySinxG9ExVp159MPpzRJkwsy7AO4tfGoDW f4gg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=L6YrEWIk; spf=softfail (google.com: domain of transitioning linux-nfs-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [23.128.96.19]) by mx.google.com with ESMTPS id s3-20020a056a00178300b004e15a894614si3808511pfg.71.2022.02.25.17.27.29 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 25 Feb 2022 17:27:30 -0800 (PST) Received-SPF: softfail (google.com: domain of transitioning linux-nfs-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) client-ip=23.128.96.19; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=L6YrEWIk; spf=softfail (google.com: domain of transitioning linux-nfs-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-nfs-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id D9DAE23B4E7; Fri, 25 Feb 2022 17:25:55 -0800 (PST) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233346AbiBYSfQ (ORCPT + 99 others); Fri, 25 Feb 2022 13:35:16 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:46822 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S232563AbiBYSfL (ORCPT ); Fri, 25 Feb 2022 13:35:11 -0500 Received: from ams.source.kernel.org (ams.source.kernel.org [IPv6:2604:1380:4601:e00::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 5DE6B1A8CAE for ; Fri, 25 Feb 2022 10:34:38 -0800 (PST) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ams.source.kernel.org (Postfix) with ESMTPS id 1CB7FB83309 for ; Fri, 25 Feb 2022 18:34:37 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 89CEDC340E7 for ; Fri, 25 Feb 2022 18:34:35 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1645814075; bh=su0PeaSdz3w9X2ycxq4oIIY+BQJ8E4SO1hQzXys1vrM=; h=From:To:Subject:Date:From; b=L6YrEWIkcAtD1mWgusaPU2xKQFjYVvBH5opFhICUDq0EFlFtEznf7Oi5AAhk9pC+v nQqgYqU9WplLvQNM8bRLcnqxfjLq8sZrWKcViwwYy9Sy9KZCuFV9kSxoZjl5rKCnyD iAjIROztfGO8IBBR2FoVR4aFsRdmHHvyBbbVQWIz7XySlMR0H3p5yEDJdVz0XFiSf7 tVgu+fHI4NjI21tCD4F2TM+AX4FLGAmaDVssRxJ3uQDlQ+Xt1QcOKgJAE6zeezYo3c vwU0yRxKGHIjXQVvuzIGXu+nvKiUIPJfJ69klcMnqAC8cGjnPnp3xtyNYNY1GJ0t2T ncLS9Xm1CTkUg== From: trondmy@kernel.org To: linux-nfs@vger.kernel.org Subject: [PATCH v8 00/24] Readdir improvements Date: Fri, 25 Feb 2022 13:28:05 -0500 Message-Id: <20220225182829.1236093-1-trondmy@kernel.org> X-Mailer: git-send-email 2.35.1 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-2.3 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,MAILING_LIST_MULTI, RDNS_NONE,SPF_HELO_NONE,T_SCC_BODY_TEXT_LINE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-nfs@vger.kernel.org From: Trond Myklebust The current NFS readdir code will always try to maximise the amount of readahead it performs on the assumption that we can cache anything that isn't immediately read by the process. There are several cases where this assumption breaks down, including when the 'ls -l' heuristic kicks in to try to force use of readdirplus as a batch replacement for lookup/getattr. This series also implement Ben's page cache filter to ensure that we can improve the ability to share cached data between processes that are reading the same directory at the same time, and to avoid live-locks when the directory is simultaneously changing. -- v2: Remove reset of dtsize when NFS_INO_FORCE_READDIR is set v3: Avoid excessive window shrinking in uncached_readdir case v4: Track 'ls -l' cache hit/miss statistics Improved algorithm for falling back to uncached readdir Skip readdirplus when files are being written to v5: bugfixes Skip readdirplus when the acdirmax/acregmax values are low Request a full XDR buffer when doing READDIRPLUS v6: Add tracing Don't have lookup request readdirplus when it won't help v7: Implement Ben's page cache filter Reduce the use of uncached readdir Change indexing of the page cache to improve seekdir() performance. v8: Reduce the page cache overhead by shrinking the cookie hashvalue size Incorporate other feedback from Anna, Ben and Neil Fix nfs2/3_decode_dirent return values Fix the change attribute value set in nfs_readdir_page_get_next() Trond Myklebust (24): NFS: Return valid errors from nfs2/3_decode_dirent() NFS: constify nfs_server_capable() and nfs_have_writebacks() NFS: Trace lookup revalidation failure NFS: Initialise the readdir verifier as best we can in nfs_opendir() NFS: Use kzalloc() to avoid initialising the nfs_open_dir_context NFS: Calculate page offsets algorithmically NFS: Store the change attribute in the directory page cache NFS: If the cookie verifier changes, we must invalidate the page cache NFS: Don't re-read the entire page cache to find the next cookie NFS: Adjust the amount of readahead performed by NFS readdir NFS: Simplify nfs_readdir_xdr_to_array() NFS: Reduce use of uncached readdir NFS: Improve heuristic for readdirplus NFS: Don't ask for readdirplus unless it can help nfs_getattr() NFSv4: Ask for a full XDR buffer of readdir goodness NFS: Readdirplus can't help lookup for case insensitive filesystems NFS: Don't request readdirplus when revalidation was forced NFS: Add basic readdir tracing NFS: Trace effects of readdirplus on the dcache NFS: Trace effects of the readdirplus heuristic NFS: Convert readdir page cache to use a cookie based index NFS: Fix up forced readdirplus NFS: Remove unnecessary cache invalidations for directories NFS: Cache all entries in the readdirplus reply fs/nfs/Kconfig | 4 + fs/nfs/dir.c | 513 ++++++++++++++++++++++++----------------- fs/nfs/inode.c | 46 ++-- fs/nfs/internal.h | 4 +- fs/nfs/nfs2xdr.c | 2 +- fs/nfs/nfs3xdr.c | 28 +-- fs/nfs/nfs4proc.c | 2 - fs/nfs/nfs4xdr.c | 6 +- fs/nfs/nfstrace.h | 123 +++++++++- include/linux/nfs_fs.h | 19 +- 10 files changed, 479 insertions(+), 268 deletions(-) -- 2.35.1