Received: by 2002:a25:31c3:0:0:0:0:0 with SMTP id x186csp4491789ybx; Mon, 4 Nov 2019 14:18:33 -0800 (PST) X-Google-Smtp-Source: APXvYqyuRRAt8pEWniadMeOCL3oOPMfGUyqtYnCCpHjrjhJaf/XgpMhbKYAbAAN2iyz+jn2N+3g8 X-Received: by 2002:a17:906:6857:: with SMTP id a23mr17309118ejs.190.1572905913622; Mon, 04 Nov 2019 14:18:33 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1572905913; cv=none; d=google.com; s=arc-20160816; b=f8/6QJ0O3ruLJ0NBHhg/w/5QQpBgfUBViPJzgJgzvnRYjPLgwgzMCfyKguguLOhzKn Ut6swazxMSTfxP9MgAhic72ENXdt726SEHDAf3kIoTJYKny5jyFTW9pcMjDLhIxR4UGP wYCRiNbl16y1vkCKOWPZaQ003bNr/qHGn5yiWUAzrSOD7PkcfZEc4rNw2CDnqgCQrIlD vwUGRUmrwSGdsnlUepDJsSccL3tDrwY2ANmiIRmU5hFIIa9g/sej3DAE10+UDUSLDNM4 cRJRc7mXifgpXmb/wO7Cj9G/r6/U1zLUn/K6GehvbnYSVXWxAJ0J0q7BC2ytk0keaJeK SNTA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:message-id:date:subject:cc:to :from:dkim-signature; bh=hXI2HJlrit4ulvy9Zy+1okGVV92HdbUPyrsHsM7yEYk=; b=rx29vs7LVufnQKOIvwzgma1PQLI1InKeYFqpjF1WueWpS4rtnoGiC1aVB7/1hMomQC ZoXYrpaMeytIOUXbvFp8AakbTnCTF2pnKbcljRriRsrvT+zGJy4+YFwtTljd+R2N9LyY Qgnkf058AMojX9NWauO6CDLYiUotdDlJGSNXM2/lYiCaylyYzAOFWOu+bWxZ0eqTZD7V SlPRHJrG/ymPTzyq2T5ykzdxacqd2RlYBR2tCeSRR4hCReTppRsi+xDRbtaTCxs4kNOy tkyl61cIC+90E7sIL0e8hFLVI2f1P4g7eFvY1B/zXb3g9DJG0RK8aa8gH+3cxI6upcfl ljvw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=NYTB3t6X; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id f31si8146910edf.75.2019.11.04.14.18.04; Mon, 04 Nov 2019 14:18:33 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=NYTB3t6X; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2389329AbfKDWCc (ORCPT + 99 others); Mon, 4 Nov 2019 17:02:32 -0500 Received: from mail.kernel.org ([198.145.29.99]:60586 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2389239AbfKDWCS (ORCPT ); Mon, 4 Nov 2019 17:02:18 -0500 Received: from localhost (6.204-14-84.ripe.coltfrance.com [84.14.204.6]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 1906920650; Mon, 4 Nov 2019 22:02:16 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1572904936; bh=J7DWm/MkD3ZkOtimVRVrecFAfLiQAyHECZtzD/l+wNo=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=NYTB3t6X7n0Mv6yAxdLNnpX6bxOVKbUcmOVlGiJlCnQL6umhPSDh8MbZNJEXq5XGG Xzepg8JUmvPFEZtnC4ZWcJo+dYti5uhjz49DC2OA6x2by1O6FHcDNKaDFuSvrH4uYm uiubhc5J/d2SlmPQ9YU4Ae7KvaCMsNNJ0T+EES0I= From: Greg Kroah-Hartman To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman , stable@vger.kernel.org, Jia Guo , Yiwen Jiang , Mark Fasheh , Joel Becker , Junxiao Bi , Joseph Qi , Andrew Morton , Linus Torvalds , Sasha Levin Subject: [PATCH 4.19 085/149] ocfs2: clear zero in unaligned direct IO Date: Mon, 4 Nov 2019 22:44:38 +0100 Message-Id: <20191104212142.519464337@linuxfoundation.org> X-Mailer: git-send-email 2.23.0 In-Reply-To: <20191104212126.090054740@linuxfoundation.org> References: <20191104212126.090054740@linuxfoundation.org> User-Agent: quilt/0.66 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Jia Guo [ Upstream commit 7a243c82ea527cd1da47381ad9cd646844f3b693 ] Unused portion of a part-written fs-block-sized block is not set to zero in unaligned append direct write.This can lead to serious data inconsistencies. Ocfs2 manage disk with cluster size(for example, 1M), part-written in one cluster will change the cluster state from UN-WRITTEN to WRITTEN, VFS(function dio_zero_block) doesn't do the cleaning because bh's state is not set to NEW in function ocfs2_dio_wr_get_block when we write a WRITTEN cluster. For example, the cluster size is 1M, file size is 8k and we direct write from 14k to 15k, then 12k~14k and 15k~16k will contain dirty data. We have to deal with two cases: 1.The starting position of direct write is outside the file. 2.The starting position of direct write is located in the file. We need set bh's state to NEW in the first case. In the second case, we need mapped twice because bh's state of area out file should be set to NEW while area in file not. [akpm@linux-foundation.org: coding style fixes] Link: http://lkml.kernel.org/r/5292e287-8f1a-fd4a-1a14-661e555e0bed@huawei.com Signed-off-by: Jia Guo Reviewed-by: Yiwen Jiang Cc: Mark Fasheh Cc: Joel Becker Cc: Junxiao Bi Cc: Joseph Qi Signed-off-by: Andrew Morton Signed-off-by: Linus Torvalds Signed-off-by: Sasha Levin --- fs/ocfs2/aops.c | 22 +++++++++++++++++++++- 1 file changed, 21 insertions(+), 1 deletion(-) diff --git a/fs/ocfs2/aops.c b/fs/ocfs2/aops.c index 7578bd507c70b..dc773e163132c 100644 --- a/fs/ocfs2/aops.c +++ b/fs/ocfs2/aops.c @@ -2153,13 +2153,30 @@ static int ocfs2_dio_wr_get_block(struct inode *inode, sector_t iblock, struct ocfs2_dio_write_ctxt *dwc = NULL; struct buffer_head *di_bh = NULL; u64 p_blkno; - loff_t pos = iblock << inode->i_sb->s_blocksize_bits; + unsigned int i_blkbits = inode->i_sb->s_blocksize_bits; + loff_t pos = iblock << i_blkbits; + sector_t endblk = (i_size_read(inode) - 1) >> i_blkbits; unsigned len, total_len = bh_result->b_size; int ret = 0, first_get_block = 0; len = osb->s_clustersize - (pos & (osb->s_clustersize - 1)); len = min(total_len, len); + /* + * bh_result->b_size is count in get_more_blocks according to write + * "pos" and "end", we need map twice to return different buffer state: + * 1. area in file size, not set NEW; + * 2. area out file size, set NEW. + * + * iblock endblk + * |--------|---------|---------|--------- + * |<-------area in file------->| + */ + + if ((iblock <= endblk) && + ((iblock + ((len - 1) >> i_blkbits)) > endblk)) + len = (endblk - iblock + 1) << i_blkbits; + mlog(0, "get block of %lu at %llu:%u req %u\n", inode->i_ino, pos, len, total_len); @@ -2243,6 +2260,9 @@ static int ocfs2_dio_wr_get_block(struct inode *inode, sector_t iblock, if (desc->c_needs_zero) set_buffer_new(bh_result); + if (iblock > endblk) + set_buffer_new(bh_result); + /* May sleep in end_io. It should not happen in a irq context. So defer * it to dio work queue. */ set_buffer_defer_completion(bh_result); -- 2.20.1