2007-07-04 22:23:47

by Andreas Dilger

[permalink] [raw]
Subject: [PATCH] speed up statfs

Hi,
this is a patch we are currently using that speeds up statfs.
It is very simple - the "overhead" calculation, which takes a
huge amount of time for large filesystems, never changes unless
the size of the filesystem itself changes. That means we can
store it in memory and only recalculate if the filesystem has
been resized (almost never).

It is based on ext3 but is trivally updated to ext4. It also
fixes a minor problem that we never update the on-disk superblock
free blocks/inodes counts until the filesystem is unmounted.
While not fatal, we may as well update that on disk when we have
the information, and it makes things like debugfs and dumpe2fs
report a bit more accurate info.

I'd be happy if someone could update this to the latest kernel and
for ext2 and ext4 also.


======================= ext3-statfs-2.6.20.diff ==========================
Index: linux-stage/include/linux/ext3_fs_sb.h
===================================================================
--- linux-stage.orig/include/linux/ext3_fs_sb.h 2007-03-22 17:29:30.000000000 -0600
+++ linux-stage/fs/include/ext3_fs_sb.h 2007-03-23 01:48:41.000000000 -0600
@@ -38,6 +38,8 @@ struct ext3_sb_info {
unsigned long s_gdb_count; /* Number of group descriptor blocks */
unsigned long s_desc_per_block; /* Number of group descriptors per block */
unsigned long s_groups_count; /* Number of groups in the fs */
+ unsigned long s_overhead_last; /* Last calculated overhead */
+ unsigned long s_blocks_last; /* Last seen block count */
struct buffer_head * s_sbh; /* Buffer containing the super block */
struct ext3_super_block * s_es; /* Pointer to the super block in the buffer */
struct buffer_head ** s_group_desc;
Index: linux-stage/fs/ext3/super.c
===================================================================
--- linux-stage.orig/fs/ext3/super.c 2007-03-22 17:29:30.000000000 -0600
+++ linux-stage/fs/ext3/super.c 2007-03-23 01:48:41.000000000 -0600
@@ -2389,19 +2389,19 @@ restore_opts:
struct super_block *sb = dentry->d_sb;
struct ext3_sb_info *sbi = EXT3_SB(sb);
struct ext3_super_block *es = sbi->s_es;
- ext3_fsblk_t overhead;
- int i;
u64 fsid;

- if (test_opt (sb, MINIX_DF))
- overhead = 0;
- else {
- unsigned long ngroups;
- ngroups = EXT3_SB(sb)->s_groups_count;
+ if (test_opt(sb, MINIX_DF)) {
+ sbi->s_overhead_last = 0;
+ } else if (sbi->s_blocks_last != le32_to_cpu(es->s_blocks_count)) {
+ unsigned long ngroups = sbi->s_groups_count, i;
+ ext3_fsblk_t overhead = 0;
smp_rmb();

/*
- * Compute the overhead (FS structures)
+ * Compute the overhead (FS structures). This is constant
+ * for a given filesystem unless the number of block groups
+ * changes so we cache the previous value until it does.
*/

/*
@@ -2419,18 +2422,23 @@ static int ext3_statfs (struct super_blo
* Every block group has an inode bitmap, a block
* bitmap, and an inode table.
*/
- overhead += (ngroups * (2 + EXT3_SB(sb)->s_itb_per_group));
+ overhead += ngroups * (2 + sbi->s_itb_per_group);
+ sbi->s_overhead_last = overhead;
+ smp_wmb();
+ sbi->s_blocks_last = le32_to_cpu(es->s_blocks_count);
}

buf->f_type = EXT3_SUPER_MAGIC;
buf->f_bsize = sb->s_blocksize;
- buf->f_blocks = le32_to_cpu(es->s_blocks_count) - overhead;
+ buf->f_blocks = le32_to_cpu(es->s_blocks_count) - sbi->s_overhead_last;
buf->f_bfree = percpu_counter_sum(&sbi->s_freeblocks_counter);
+ es->s_free_blocks_count = cpu_to_le32(buf->f_bfree);
buf->f_bavail = buf->f_bfree - le32_to_cpu(es->s_r_blocks_count);
if (buf->f_bfree < le32_to_cpu(es->s_r_blocks_count))
buf->f_bavail = 0;
buf->f_files = le32_to_cpu(es->s_inodes_count);
buf->f_ffree = percpu_counter_sum(&sbi->s_freeinodes_counter);
+ es->s_free_inodes_count = cpu_to_le32(buf->f_ffree);
buf->f_namelen = EXT3_NAME_LEN;
fsid = le64_to_cpup((void *)es->s_uuid) ^
le64_to_cpup((void *)es->s_uuid + sizeof(u64));
===========================================================================

Cheers, Andreas
--
Andreas Dilger
Principal Software Engineer
Cluster File Systems, Inc.


2007-07-05 03:49:32

by Badari Pulavarty

[permalink] [raw]
Subject: Re: [PATCH] speed up statfs

On Wed, 2007-07-04 at 16:23 -0600, Andreas Dilger wrote:
> Hi,
> this is a patch we are currently using that speeds up statfs.
> It is very simple - the "overhead" calculation, which takes a
> huge amount of time for large filesystems, never changes unless
> the size of the filesystem itself changes. That means we can
> store it in memory and only recalculate if the filesystem has
> been resized (almost never).
>
> It is based on ext3 but is trivally updated to ext4. It also
> fixes a minor problem that we never update the on-disk superblock
> free blocks/inodes counts until the filesystem is unmounted.
> While not fatal, we may as well update that on disk when we have
> the information, and it makes things like debugfs and dumpe2fs
> report a bit more accurate info.
>
> I'd be happy if someone could update this to the latest kernel and
> for ext2 and ext4 also.
>

I forward ported it to latest + did the same for ext2 and ext4 also.
Compiles fine. I will post them tomorrow after little testing.

Thanks,
Badaru