Received: by 2002:a25:7ec1:0:0:0:0:0 with SMTP id z184csp9047211ybc; Fri, 29 Nov 2019 23:29:04 -0800 (PST) X-Google-Smtp-Source: APXvYqzEbCJMgP+3t19WUDM0HV9iCFcMl33U0e4t6NDdxpZw94kajron6W0ikAc1HG6VzTGPPkGm X-Received: by 2002:a17:906:82d0:: with SMTP id a16mr38091277ejy.206.1575098944739; Fri, 29 Nov 2019 23:29:04 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1575098944; cv=none; d=google.com; s=arc-20160816; b=vBf4p4iQw+Y7w+ojRTay0gtR/8B6vcHYnzOlK7yPfMnQTeXEOCrZVGyUfufNFmtRGM 7vI228vMgif2oF11LHkR+42C53UTHenP/g5MIrge+AlxehiW7Qm/vfbdEqkCLTdENZg4 jAAjciYXlB0iPoXEytu1dPzSZKOZJm75WDAwFHMMqlxnacMHMXSvqdTwCJ/WL50q/Gca /UZd/iq5rTHy9smDaGUt4enIHIMJ3UaNtq/P43V57ejFLG8BX+mrIbdgKuwZLzr0xMzY QzljaJ1+CukGlHFATaz/9AgVsjxTIu810ohen5x6XsYVtCYj3Q79to20ZyO982rehfBN n54g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=+I+QoANEQOGPuT0wmkP3fOlpNljp5Q1rz19/BqUttYQ=; b=INIb4Xk4saDsCgGPK3nG+81yuveux8r56hnKUeZX6DB1J4035+S5Oi1zROxY+tsAY6 iPKBQf2REtIhoD5oxhsDair5+IKXSVsqc3dHW8tccgbFTbz26J8803K3w4JDAx5fZh0y 7yxFY+0zg5FBYFgqOvkI1nhbYy2BiHNTDlB8NO2T+pZTIib6pu6BCQhoVO0mhVvK0qIN MuNuTopczYsmE/nJveSu0Vgurma1D903uH+ud2V+yefbbBmlD5v/vSBMtlHyLYx1u3s2 3Jj5WmelR1ExTkbL2EQMz6mIMtF7YsObt3aBFZG03UZMqY4UsAOJEdMhDqKpl2sStCgX /+eA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id i22si1287306ejh.274.2019.11.29.23.28.40; Fri, 29 Nov 2019 23:29:04 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726188AbfK3H1e (ORCPT + 99 others); Sat, 30 Nov 2019 02:27:34 -0500 Received: from szxga06-in.huawei.com ([45.249.212.32]:49068 "EHLO huawei.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1725298AbfK3H1e (ORCPT ); Sat, 30 Nov 2019 02:27:34 -0500 Received: from DGGEMS405-HUB.china.huawei.com (unknown [172.30.72.59]) by Forcepoint Email with ESMTP id 83080E6B1FE2ADD81028; Sat, 30 Nov 2019 15:27:32 +0800 (CST) Received: from [10.134.22.195] (10.134.22.195) by smtp.huawei.com (10.3.19.205) with Microsoft SMTP Server (TLS) id 14.3.439.0; Sat, 30 Nov 2019 15:27:29 +0800 Subject: Re: [PATCH] f2fs: Fix direct IO handling To: Ritesh Harjani , Damien Le Moal , "linux-f2fs-devel@lists.sourceforge.net" , "linux-kernel@vger.kernel.org" , Jaegeuk Kim CC: Javier Gonzalez , Shinichiro Kawasaki References: <20191126075719.1046485-1-damien.lemoal@wdc.com> <20191126083443.F1FD5A405B@b06wcsmtp001.portsmouth.uk.ibm.com> <20191128102033.6085952057@d06av21.portsmouth.uk.ibm.com> From: Chao Yu Message-ID: Date: Sat, 30 Nov 2019 15:27:29 +0800 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: <20191128102033.6085952057@d06av21.portsmouth.uk.ibm.com> Content-Type: text/plain; charset="utf-8" Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.134.22.195] X-CFilter-Loop: Reflected Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2019/11/28 18:20, Ritesh Harjani wrote: > > > On 11/28/19 7:40 AM, Damien Le Moal wrote: >> On 2019/11/26 17:34, Ritesh Harjani wrote: >>> Hello Damien, >>> >>> IIUC, you are trying to fix a stale data read by DIO read for the case >>> you explained in your patch w.r.t. DIO-write forced to write as buffIO. >>> >>> Coincidentally I was just looking at the same code path just now. >>> So I do have a query to you/f2fs group. Below could be silly one, as I >>> don't understand F2FS in great detail. >>> >>> How is the stale data by DIO read, is protected against a mmap >>> writes via f2fs_vm_page_mkwrite? >>> >>> f2fs_vm_page_mkwrite() f2fs_direct_IO (read) >>> filemap_write_and_wait_range() >>> -> f2fs_get_blocks() >>> -> submit_bio() >>> >>> -> set_page_dirty() >>> >>> Is above race possible with current f2fs code? >>> i.e. f2fs_direct_IO could read the stale data from the blocks >>> which were allocated due to mmap fault? >> >> The faulted page is locked until the fault is fully processed so direct >> IO has to wait for that to complete first. > > How about below parallelism? > > f2fs_vm_page_mkwrite() f2fs_direct_IO (read) > filemap_write_and_wait_range() > -> down_read(->i_mmap_sem); > -> lock_page() > -> f2fs_get_blocks() > -> submit_bio() > > -> set_page_dirty() > > Can above DIO read not expose the stale data from block which was > allocated in f2fs_vm_page_mkwrite path? The race can happen, however I doubt the race condition is more complicated as I described in previous reply of mine, could you check that? Thanks, > > >> >>> >>> Am I missing something here? >>> >>> -ritesh >>> >>> On 11/26/19 1:27 PM, Damien Le Moal wrote: >>>> f2fs_preallocate_blocks() identifies direct IOs using the IOCB_DIRECT >>>> flag for a kiocb structure. However, the file system direct IO handler >>>> function f2fs_direct_IO() may have decided that a direct IO has to be >>>> exececuted as a buffered IO using the function f2fs_force_buffered_io(). >>>> This is the case for instance for volumes including zoned block device >>>> and for unaligned write IOs with LFS mode enabled. >>>> >>>> These 2 different methods of identifying direct IOs can result in >>>> inconsistencies generating stale data access for direct reads after a >>>> direct IO write that is treated as a buffered write. Fix this >>>> inconsistency by combining the IOCB_DIRECT flag test with the result >>>> of f2fs_force_buffered_io(). >>>> >>>> Reported-by: Javier Gonzalez >>>> Signed-off-by: Damien Le Moal >>>> --- >>>> fs/f2fs/data.c | 4 +++- >>>> 1 file changed, 3 insertions(+), 1 deletion(-) >>>> >>>> diff --git a/fs/f2fs/data.c b/fs/f2fs/data.c >>>> index 5755e897a5f0..8ac2d3b70022 100644 >>>> --- a/fs/f2fs/data.c >>>> +++ b/fs/f2fs/data.c >>>> @@ -1073,6 +1073,8 @@ int f2fs_preallocate_blocks(struct kiocb *iocb, struct iov_iter *from) >>>> int flag; >>>> int err = 0; >>>> bool direct_io = iocb->ki_flags & IOCB_DIRECT; >>>> + bool do_direct_io = direct_io && >>>> + !f2fs_force_buffered_io(inode, iocb, from); >>>> >>>> /* convert inline data for Direct I/O*/ >>>> if (direct_io) { >>>> @@ -1081,7 +1083,7 @@ int f2fs_preallocate_blocks(struct kiocb *iocb, struct iov_iter *from) >>>> return err; >>>> } >>>> >>>> - if (direct_io && allow_outplace_dio(inode, iocb, from)) >>>> + if (do_direct_io && allow_outplace_dio(inode, iocb, from)) >>>> return 0; >>>> >>>> if (is_inode_flag_set(inode, FI_NO_PREALLOC)) >>>> >>> >>> >> >> > > . >