Received: by 2002:a25:7ec1:0:0:0:0:0 with SMTP id z184csp6609486ybc; Thu, 28 Nov 2019 02:23:33 -0800 (PST) X-Google-Smtp-Source: APXvYqxtvEpRAicUXzYcECbkPxhW+BtmoEIfTjTyMvkdOHnAhVDG0mGkAyY+5c3p/xDrj4M0wjdm X-Received: by 2002:a17:906:ece4:: with SMTP id qt4mr152083ejb.244.1574936613445; Thu, 28 Nov 2019 02:23:33 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1574936613; cv=none; d=google.com; s=arc-20160816; b=NI0qtSzGMqJWclphmu1Ykuy9rToXDm+aC2S9p806Z/xOWeg2XKg6MhX+3DXo9iBZ4O nISw10CPQar3bqwOR2SGBka9PElkbsUSQI/+0VFWAWAYLSwyNu7MaiNuORGS3B5Lw3ZV RS2KC6BXT/w9ebuVsNfhNcfUFJnJROWmFe9IkDm6jNBNkykRG6PDIkOx8T18v+fvDDOi ZzHYHB63GgPLb3rRhdvJATl9xIKoItr7FBr7foW77KqnHLOvGRDg919PmhN8Wrr95Pm4 jQ4/M/p43QcMSmYlaZH5vSYsaKxIsy6cdFMRJBknvMGUhb4IqxvMxdtKXPCZUQE+JOKr Gvdw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date:from :references:cc:to:subject; bh=58EPqqdzDlTz5CfyzeYrRk1ao14pF6GhCoA9lG64FH0=; b=utjUUYf4/cesh+7Y2LuZ6V9hDvHoZbR0xWQPjvpzFGwiCwB2tjxpUg6P+c3g2S9VS8 MyWKtPjK/9i8PolYbaQN3xwRqwqE+K5BJE2xCwSWBE+3eN7Fi9nSWmkMDwJBXdOeWaoS cIpCYNY3yzmDcEto90KPuVcYd8S/+wcj4APsOJz2VW54IANqJ4iMx5zPvddp+uOEtSw8 apfkMZutTGuSyQ7Js078fpmYE8MwPL7zJ1va+ioK4OnghCg5GE0ZXypyhIVrZvXLexyc kqX+/+suksc81SJajfX2AEnqCT+8ObL878sd2gliuvyPzyhGWwKb8DFgIaTBYv/65nbL 1eVA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id r5si5097970edq.383.2019.11.28.02.23.09; Thu, 28 Nov 2019 02:23:33 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726655AbfK1KUn (ORCPT + 99 others); Thu, 28 Nov 2019 05:20:43 -0500 Received: from mx0b-001b2d01.pphosted.com ([148.163.158.5]:38354 "EHLO mx0a-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1726133AbfK1KUn (ORCPT ); Thu, 28 Nov 2019 05:20:43 -0500 Received: from pps.filterd (m0098414.ppops.net [127.0.0.1]) by mx0b-001b2d01.pphosted.com (8.16.0.42/8.16.0.42) with SMTP id xASAHYnT067053 for ; Thu, 28 Nov 2019 05:20:41 -0500 Received: from e06smtp01.uk.ibm.com (e06smtp01.uk.ibm.com [195.75.94.97]) by mx0b-001b2d01.pphosted.com with ESMTP id 2whcy8t3d0-1 (version=TLSv1.2 cipher=AES256-GCM-SHA384 bits=256 verify=NOT) for ; Thu, 28 Nov 2019 05:20:41 -0500 Received: from localhost by e06smtp01.uk.ibm.com with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted for from ; Thu, 28 Nov 2019 10:20:39 -0000 Received: from b06cxnps3074.portsmouth.uk.ibm.com (9.149.109.194) by e06smtp01.uk.ibm.com (192.168.101.131) with IBM ESMTP SMTP Gateway: Authorized Use Only! Violators will be prosecuted; (version=TLSv1/SSLv3 cipher=AES256-GCM-SHA384 bits=256/256) Thu, 28 Nov 2019 10:20:36 -0000 Received: from d06av21.portsmouth.uk.ibm.com (d06av21.portsmouth.uk.ibm.com [9.149.105.232]) by b06cxnps3074.portsmouth.uk.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id xASAKZFA66322460 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Thu, 28 Nov 2019 10:20:35 GMT Received: from d06av21.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 6525152054; Thu, 28 Nov 2019 10:20:35 +0000 (GMT) Received: from localhost.localdomain (unknown [9.85.87.233]) by d06av21.portsmouth.uk.ibm.com (Postfix) with ESMTP id 6085952057; Thu, 28 Nov 2019 10:20:32 +0000 (GMT) Subject: Re: [PATCH] f2fs: Fix direct IO handling To: Damien Le Moal , "linux-f2fs-devel@lists.sourceforge.net" , "linux-kernel@vger.kernel.org" , Jaegeuk Kim , Chao Yu Cc: "linux-fsdevel@vger.kernel.org" , Javier Gonzalez , Shinichiro Kawasaki References: <20191126075719.1046485-1-damien.lemoal@wdc.com> <20191126083443.F1FD5A405B@b06wcsmtp001.portsmouth.uk.ibm.com> From: Ritesh Harjani Date: Thu, 28 Nov 2019 15:50:30 +0530 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.7.2 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit X-TM-AS-GCONF: 00 x-cbid: 19112810-4275-0000-0000-0000038770F9 X-IBM-AV-DETECTION: SAVI=unused REMOTE=unused XFE=unused x-cbparentid: 19112810-4276-0000-0000-0000389B0254 Message-Id: <20191128102033.6085952057@d06av21.portsmouth.uk.ibm.com> X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:6.0.95,18.0.572 definitions=2019-11-28_01:2019-11-28,2019-11-28 signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 mlxscore=0 bulkscore=0 spamscore=0 mlxlogscore=999 impostorscore=0 malwarescore=0 adultscore=0 priorityscore=1501 phishscore=0 suspectscore=0 lowpriorityscore=0 clxscore=1015 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-1910280000 definitions=main-1911280089 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 11/28/19 7:40 AM, Damien Le Moal wrote: > On 2019/11/26 17:34, Ritesh Harjani wrote: >> Hello Damien, >> >> IIUC, you are trying to fix a stale data read by DIO read for the case >> you explained in your patch w.r.t. DIO-write forced to write as buffIO. >> >> Coincidentally I was just looking at the same code path just now. >> So I do have a query to you/f2fs group. Below could be silly one, as I >> don't understand F2FS in great detail. >> >> How is the stale data by DIO read, is protected against a mmap >> writes via f2fs_vm_page_mkwrite? >> >> f2fs_vm_page_mkwrite() f2fs_direct_IO (read) >> filemap_write_and_wait_range() >> -> f2fs_get_blocks() >> -> submit_bio() >> >> -> set_page_dirty() >> >> Is above race possible with current f2fs code? >> i.e. f2fs_direct_IO could read the stale data from the blocks >> which were allocated due to mmap fault? > > The faulted page is locked until the fault is fully processed so direct > IO has to wait for that to complete first. How about below parallelism? f2fs_vm_page_mkwrite() f2fs_direct_IO (read) filemap_write_and_wait_range() -> down_read(->i_mmap_sem); -> lock_page() -> f2fs_get_blocks() -> submit_bio() -> set_page_dirty() Can above DIO read not expose the stale data from block which was allocated in f2fs_vm_page_mkwrite path? > >> >> Am I missing something here? >> >> -ritesh >> >> On 11/26/19 1:27 PM, Damien Le Moal wrote: >>> f2fs_preallocate_blocks() identifies direct IOs using the IOCB_DIRECT >>> flag for a kiocb structure. However, the file system direct IO handler >>> function f2fs_direct_IO() may have decided that a direct IO has to be >>> exececuted as a buffered IO using the function f2fs_force_buffered_io(). >>> This is the case for instance for volumes including zoned block device >>> and for unaligned write IOs with LFS mode enabled. >>> >>> These 2 different methods of identifying direct IOs can result in >>> inconsistencies generating stale data access for direct reads after a >>> direct IO write that is treated as a buffered write. Fix this >>> inconsistency by combining the IOCB_DIRECT flag test with the result >>> of f2fs_force_buffered_io(). >>> >>> Reported-by: Javier Gonzalez >>> Signed-off-by: Damien Le Moal >>> --- >>> fs/f2fs/data.c | 4 +++- >>> 1 file changed, 3 insertions(+), 1 deletion(-) >>> >>> diff --git a/fs/f2fs/data.c b/fs/f2fs/data.c >>> index 5755e897a5f0..8ac2d3b70022 100644 >>> --- a/fs/f2fs/data.c >>> +++ b/fs/f2fs/data.c >>> @@ -1073,6 +1073,8 @@ int f2fs_preallocate_blocks(struct kiocb *iocb, struct iov_iter *from) >>> int flag; >>> int err = 0; >>> bool direct_io = iocb->ki_flags & IOCB_DIRECT; >>> + bool do_direct_io = direct_io && >>> + !f2fs_force_buffered_io(inode, iocb, from); >>> >>> /* convert inline data for Direct I/O*/ >>> if (direct_io) { >>> @@ -1081,7 +1083,7 @@ int f2fs_preallocate_blocks(struct kiocb *iocb, struct iov_iter *from) >>> return err; >>> } >>> >>> - if (direct_io && allow_outplace_dio(inode, iocb, from)) >>> + if (do_direct_io && allow_outplace_dio(inode, iocb, from)) >>> return 0; >>> >>> if (is_inode_flag_set(inode, FI_NO_PREALLOC)) >>> >> >> > >