Received: by 2002:ac0:bc90:0:0:0:0:0 with SMTP id a16csp621971img; Fri, 22 Mar 2019 05:20:19 -0700 (PDT) X-Google-Smtp-Source: APXvYqz7IhWKwdiATjv16bBL3f45l3ryownyCNroHniQT56UQ9eQ+lpzXo9PQlygXP2ow24UxuFW X-Received: by 2002:a63:2987:: with SMTP id p129mr8579087pgp.390.1553257219346; Fri, 22 Mar 2019 05:20:19 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1553257219; cv=none; d=google.com; s=arc-20160816; b=lvaVY2epYATr5uuJNCnOgWPl2LtLli/ekSy5e4TGCdfV5dFbpPraZuTspZNHaELAPX sfVskjUMq8nGVEc2PsRzL80D7XUcmAjUI6MECmoHAeVibF4/7RNQYQvp8aa/8MX1FsWX 4zoyae+ZByQysvh8C/3mbVjj3ZABQ6cixnLyRN6vWnclb9BjNjJsO63RhleuypyTCcS0 WeO9AGlLe6ElZ1UxLDcYt0SKB6yTyVqTVnziDUg4uDBB/MpJvnrHf5pD2p33Z8xWrN0p JxC8GgaxBJ1USFNg35NDxtcv66gfSJZHfilvqQTGYrPGOi1vEddlsHaaY8rtkgZFzpp4 xz0Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:message-id:date:subject:cc:to :from:dkim-signature; bh=LWx2+DDPmZPtLyoen+WynPC6/tfEDF9nIFlyXWyEzPI=; b=eRv3TDANF5VNNquq13TBs+AntX2DO8uyS0sKtU3D7gSRW0CwDDBh2YG53TPG7IB6hA pmLsH+/QFIJMQUKAUgHD/VjX7TliFNAgdGZVxF084UU/dV5+h/z1banX+Jfa1qubMNUb vG8b/xNZw1MTiPSoJrHYVqQLmcT6osD5xZhld57BSK6fv2kh34Y+COrAYMOGfTnm3efM pAu+tGWaRgZ1mE/6zobK+Oe2KdOdnheantsfdHhh8Xnt74v7xwtWs8cJECiOMPSvln5P YxZkEuk8b/4yUJ9gIOixSbyNwhSLNEoP9RbPF2E/iNGYLOX53DNs45S/gBDenjTmmIdm v48Q== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=HxqxmZng; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id i69si6941753plb.75.2019.03.22.05.20.04; Fri, 22 Mar 2019 05:20:19 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=HxqxmZng; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2390470AbfCVMSS (ORCPT + 99 others); Fri, 22 Mar 2019 08:18:18 -0400 Received: from mail.kernel.org ([198.145.29.99]:57036 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2390028AbfCVMSK (ORCPT ); Fri, 22 Mar 2019 08:18:10 -0400 Received: from localhost (83-86-89-107.cable.dynamic.v4.ziggo.nl [83.86.89.107]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 2B2502083D; Fri, 22 Mar 2019 12:18:09 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1553257089; bh=0ZDgibt9+HDt38zKOSEMdV9wgvgxAaphMXKQb4Y+IIs=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=HxqxmZngRgZiYc3DNAww3nQ3/M8l+hDJvpUIYVgFDP8VV5Qla8SnVz5kP3GxzSeaX zJKqqyqjRya63ugovgZC77XweErWJbjEeyV6wWnF2IhA0Hdd3t+DOThal8rGo26u8S iV11pZs78QC4lBaRoUc/j97bM32c5ep9O50zwito= From: Greg Kroah-Hartman To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman , stable@vger.kernel.org, Zygo Blaxell , Filipe Manana , David Sterba Subject: [PATCH 5.0 097/238] Btrfs: fix corruption reading shared and compressed extents after hole punching Date: Fri, 22 Mar 2019 12:15:16 +0100 Message-Id: <20190322111304.289411292@linuxfoundation.org> X-Mailer: git-send-email 2.21.0 In-Reply-To: <20190322111258.383569278@linuxfoundation.org> References: <20190322111258.383569278@linuxfoundation.org> User-Agent: quilt/0.65 X-stable: review X-Patchwork-Hint: ignore MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org 5.0-stable review patch. If anyone has any objections, please let me know. ------------------ From: Filipe Manana commit 8e928218780e2f1cf2f5891c7575e8f0b284fcce upstream. In the past we had data corruption when reading compressed extents that are shared within the same file and they are consecutive, this got fixed by commit 005efedf2c7d0 ("Btrfs: fix read corruption of compressed and shared extents") and by commit 808f80b46790f ("Btrfs: update fix for read corruption of compressed and shared extents"). However there was a case that was missing in those fixes, which is when the shared and compressed extents are referenced with a non-zero offset. The following shell script creates a reproducer for this issue: #!/bin/bash mkfs.btrfs -f /dev/sdc &> /dev/null mount -o compress /dev/sdc /mnt/sdc # Create a file with 3 consecutive compressed extents, each has an # uncompressed size of 128Kb and a compressed size of 4Kb. for ((i = 1; i <= 3; i++)); do head -c 4096 /dev/zero for ((j = 1; j <= 31; j++)); do head -c 4096 /dev/zero | tr '\0' "\377" done done > /mnt/sdc/foobar sync echo "Digest after file creation: $(md5sum /mnt/sdc/foobar)" # Clone the first extent into offsets 128K and 256K. xfs_io -c "reflink /mnt/sdc/foobar 0 128K 128K" /mnt/sdc/foobar xfs_io -c "reflink /mnt/sdc/foobar 0 256K 128K" /mnt/sdc/foobar sync echo "Digest after cloning: $(md5sum /mnt/sdc/foobar)" # Punch holes into the regions that are already full of zeroes. xfs_io -c "fpunch 0 4K" /mnt/sdc/foobar xfs_io -c "fpunch 128K 4K" /mnt/sdc/foobar xfs_io -c "fpunch 256K 4K" /mnt/sdc/foobar sync echo "Digest after hole punching: $(md5sum /mnt/sdc/foobar)" echo "Dropping page cache..." sysctl -q vm.drop_caches=1 echo "Digest after hole punching: $(md5sum /mnt/sdc/foobar)" umount /dev/sdc When running the script we get the following output: Digest after file creation: 5a0888d80d7ab1fd31c229f83a3bbcc8 /mnt/sdc/foobar linked 131072/131072 bytes at offset 131072 128 KiB, 1 ops; 0.0033 sec (36.960 MiB/sec and 295.6830 ops/sec) linked 131072/131072 bytes at offset 262144 128 KiB, 1 ops; 0.0015 sec (78.567 MiB/sec and 628.5355 ops/sec) Digest after cloning: 5a0888d80d7ab1fd31c229f83a3bbcc8 /mnt/sdc/foobar Digest after hole punching: 5a0888d80d7ab1fd31c229f83a3bbcc8 /mnt/sdc/foobar Dropping page cache... Digest after hole punching: fba694ae8664ed0c2e9ff8937e7f1484 /mnt/sdc/foobar This happens because after reading all the pages of the extent in the range from 128K to 256K for example, we read the hole at offset 256K and then when reading the page at offset 260K we don't submit the existing bio, which is responsible for filling all the page in the range 128K to 256K only, therefore adding the pages from range 260K to 384K to the existing bio and submitting it after iterating over the entire range. Once the bio completes, the uncompressed data fills only the pages in the range 128K to 256K because there's no more data read from disk, leaving the pages in the range 260K to 384K unfilled. It is just a slightly different variant of what was solved by commit 005efedf2c7d0 ("Btrfs: fix read corruption of compressed and shared extents"). Fix this by forcing a bio submit, during readpages(), whenever we find a compressed extent map for a page that is different from the extent map for the previous page or has a different starting offset (in case it's the same compressed extent), instead of the extent map's original start offset. A test case for fstests follows soon. Reported-by: Zygo Blaxell Fixes: 808f80b46790f ("Btrfs: update fix for read corruption of compressed and shared extents") Fixes: 005efedf2c7d0 ("Btrfs: fix read corruption of compressed and shared extents") Cc: stable@vger.kernel.org # 4.3+ Tested-by: Zygo Blaxell Signed-off-by: Filipe Manana Signed-off-by: David Sterba Signed-off-by: Greg Kroah-Hartman --- fs/btrfs/extent_io.c | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) --- a/fs/btrfs/extent_io.c +++ b/fs/btrfs/extent_io.c @@ -2985,11 +2985,11 @@ static int __do_readpage(struct extent_i */ if (test_bit(EXTENT_FLAG_COMPRESSED, &em->flags) && prev_em_start && *prev_em_start != (u64)-1 && - *prev_em_start != em->orig_start) + *prev_em_start != em->start) force_bio_submit = true; if (prev_em_start) - *prev_em_start = em->orig_start; + *prev_em_start = em->start; free_extent_map(em); em = NULL;