Received: by 2002:a05:6a10:f347:0:0:0:0 with SMTP id d7csp3556853pxu; Tue, 15 Dec 2020 09:39:33 -0800 (PST) X-Google-Smtp-Source: ABdhPJwkdrDmb16HfsKI14PeGakodwqvSd1w9vf9Tvq9rBTfK+v5i0d7ah/S6EK5CDIDF1UiBGt0 X-Received: by 2002:a17:906:cec3:: with SMTP id si3mr6678297ejb.277.1608053972801; Tue, 15 Dec 2020 09:39:32 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1608053972; cv=none; d=google.com; s=arc-20160816; b=etMLOYCZca3dP66WBWL2SOKIex3M1LXZcZwuUJIJ0YWk0x5vXr1M87ty02UCTnkv9Q JNulNRaSflXtdB279QLsiWUF2FevA0Mz34c6mtFE6WK8U3nUwd47o5ZPfb8+2HPS2x54 iFP2+DOyM8/h+1d2kzYCzC9wgoLHp8kpojvb3Wzq2lbrZhKdRuqPLQpWoWxzw1PzK8OQ zSa6O8YBsaEXvU1oT+m0XGcOMJojAt5RCj3jtw3X6Mgb8MIfOgrUDzUB04I+5KV902Xf 9Vd1R/wXX2yddCXpD/34FNbqgJxpTEBclZCOcaGIn6rh4Wl9q4KgpGeNmSmqpuCUgIvC +NIQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :user-agent:message-id:in-reply-to:date:references:organization :subject:cc:to:from; bh=Jm1ra1wyLeS8JJc3M4Bhi057G4qyuSTdCD7gXqILdJ4=; b=SHyDBkEanhk+T3SKIC8Ss/BCnp5V2CeGP5dDD7I2AJTwaCZifNGLA9GEpj6e7aOF0p 0x2UAP2RiCbZ3pP81Q1Pi7zHtMEMZZPHY3iq+CyuNa/3ZqCl+psq118g5k++3P6SSrvo gyd7Ry5IDW4S9fSFA43XLxJH+ZI1XcGi/d0TJLTztEn+NE1+WmiM4O9m4ijz2IJKNvnN rTNXCuEuEo+1I3ccBGZfPN3K1RltEvo7Y4Ho3DrVEpWXlYogFxMn5pG556ZCu0rHaB5O ouqKXK9vUGsbnNWRFhu5IkeUFpB7+aGOGToZxgI2Oz9gk3j1inoqmnhfh1IQWYZ6hbG0 q/EA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-ext4-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-ext4-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=collabora.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id t6si1200815edq.353.2020.12.15.09.39.00; Tue, 15 Dec 2020 09:39:32 -0800 (PST) Received-SPF: pass (google.com: domain of linux-ext4-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-ext4-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-ext4-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=collabora.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730578AbgLORfm convert rfc822-to-8bit (ORCPT + 99 others); Tue, 15 Dec 2020 12:35:42 -0500 Received: from bhuna.collabora.co.uk ([46.235.227.227]:55202 "EHLO bhuna.collabora.co.uk" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1730097AbgLORfd (ORCPT ); Tue, 15 Dec 2020 12:35:33 -0500 Received: from localhost (unknown [IPv6:2804:14c:132:242d::1000]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) (Authenticated sender: krisman) by bhuna.collabora.co.uk (Postfix) with ESMTPSA id 86C571F455DF; Tue, 15 Dec 2020 17:34:50 +0000 (GMT) From: Gabriel Krisman Bertazi To: Arnaud Ferraris Cc: linux-ext4@vger.kernel.org, drosen@google.com, ebiggers@kernel.org, tytso@mit.edu Subject: Re: [PATCH RESEND v2 07/12] e2fsck: Support casefold directories when rehashing Organization: Collabora References: <20201210150353.91843-1-arnaud.ferraris@collabora.com> <20201210150353.91843-8-arnaud.ferraris@collabora.com> <87y2i51ixm.fsf@collabora.com> <40566e74-abd8-13df-45b9-2cf26f89ad54@collabora.com> Date: Tue, 15 Dec 2020 14:34:45 -0300 In-Reply-To: <40566e74-abd8-13df-45b9-2cf26f89ad54@collabora.com> (Arnaud Ferraris's message of "Tue, 15 Dec 2020 18:17:19 +0100") Message-ID: <87r1nrt1l6.fsf@collabora.com> User-Agent: Gnus/5.13 (Gnus v5.13) Emacs/27.1 (gnu/linux) MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8BIT Precedence: bulk List-ID: X-Mailing-List: linux-ext4@vger.kernel.org Arnaud Ferraris writes: > Le 10/12/2020 à 21:53, Gabriel Krisman Bertazi a écrit : >> Arnaud Ferraris writes: >> >>> From: Gabriel Krisman Bertazi >>> >>> @@ -403,11 +451,12 @@ static int duplicate_search_and_fix(e2fsck_t ctx, ext2_filsys fs, >>> ent = fd->harray + i; >>> prev = ent - 1; >>> if (!ent->dir->inode || >>> - (ext2fs_dirent_name_len(ent->dir) != >>> - ext2fs_dirent_name_len(prev->dir)) || >>> - memcmp(ent->dir->name, prev->dir->name, >>> - ext2fs_dirent_name_len(ent->dir))) >>> + !same_name(cmp_ctx, ent->dir->name, >>> + ext2fs_dirent_name_len(ent->dir), >>> + prev->dir->name, >>> + ext2fs_dirent_name_len(prev->dir))) >>> continue; >>> + ^^^^^^^ >> >> noise. > > Could you please be more specific? the patch is adding an empty line for no reason. > > Arnaud > >> >> Other than that, I think this is still good. >> >>> pctx.dirent = ent->dir; >>> if ((ent->dir->inode == prev->dir->inode) && >>> fix_problem(ctx, PR_2_DUPLICATE_DIRENT, &pctx)) { >>> @@ -426,10 +475,11 @@ static int duplicate_search_and_fix(e2fsck_t ctx, ext2_filsys fs, >>> mutate_name(new_name, &new_len); >>> for (j=0; j < fd->num_array; j++) { >>> if ((i==j) || >>> - (new_len != >>> - (unsigned) ext2fs_dirent_name_len(fd->harray[j].dir)) || >>> - memcmp(new_name, fd->harray[j].dir->name, new_len)) >>> + !same_name(cmp_ctx, new_name, new_len, >>> + fd->harray[j].dir->name, >>> + ext2fs_dirent_name_len(fd->harray[j].dir))) { >>> continue; >>> + } >>> mutate_name(new_name, &new_len); >>> >>> j = -1; >>> @@ -894,6 +944,7 @@ errcode_t e2fsck_rehash_dir(e2fsck_t ctx, ext2_ino_t ino, >>> struct fill_dir_struct fd = { NULL, NULL, 0, 0, 0, NULL, >>> 0, 0, 0, 0, 0, 0 }; >>> struct out_dir outdir = { 0, 0, 0, 0 }; >>> + struct name_cmp_ctx name_cmp_ctx = {0, NULL}; >>> >>> e2fsck_read_inode(ctx, ino, &inode, "rehash_dir"); >>> >>> @@ -921,6 +972,11 @@ errcode_t e2fsck_rehash_dir(e2fsck_t ctx, ext2_ino_t ino, >>> fd.compress = 1; >>> fd.parent = 0; >>> >>> + if (fs->encoding && (inode.i_flags & EXT4_CASEFOLD_FL)) { >>> + name_cmp_ctx.casefold = 1; >>> + name_cmp_ctx.tbl = fs->encoding; >>> + } >>> + >>> retry_nohash: >>> /* Read in the entire directory into memory */ >>> retval = ext2fs_block_iterate3(fs, ino, 0, 0, >>> @@ -949,16 +1005,16 @@ retry_nohash: >>> /* Sort the list */ >>> resort: >>> if (fd.compress && fd.num_array > 1) >>> - qsort(fd.harray+2, fd.num_array-2, sizeof(struct hash_entry), >>> - hash_cmp); >>> + qsort_r(fd.harray+2, fd.num_array-2, sizeof(struct hash_entry), >>> + hash_cmp, &name_cmp_ctx); >>> else >>> - qsort(fd.harray, fd.num_array, sizeof(struct hash_entry), >>> - hash_cmp); >>> + qsort_r(fd.harray, fd.num_array, sizeof(struct hash_entry), >>> + hash_cmp, &name_cmp_ctx); >>> >>> /* >>> * Look for duplicates >>> */ >>> - if (duplicate_search_and_fix(ctx, fs, ino, &fd)) >>> + if (duplicate_search_and_fix(ctx, fs, ino, &fd, &name_cmp_ctx)) >>> goto resort; >>> >>> if (ctx->options & E2F_OPT_NO) { >> -- Gabriel Krisman Bertazi