Received: by 2002:a05:6a10:1287:0:0:0:0 with SMTP id d7csp5284767pxv; Wed, 21 Jul 2021 01:36:02 -0700 (PDT) X-Google-Smtp-Source: ABdhPJw4TAisvsF5K0Zg9WXqAs365Iz7yvjz3RaJQFf6c/a+WTq8siAyg4o+rm8Y8wQYH2kZBlQd X-Received: by 2002:a50:ee15:: with SMTP id g21mr46240297eds.334.1626856562338; Wed, 21 Jul 2021 01:36:02 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1626856562; cv=none; d=google.com; s=arc-20160816; b=XTbEt7GVjWVpopsfFQvW/bV8tcDtfKd6UikzajGD44QQwWwTbp7Iod8gsPepr+6sN7 jTYihAYAmB+oXhXRBFxxsUe4KKiCANkNVf8qOiJfqZnxJwpEQ+OICiYx/zvO/kXOpzYG EtKVvff1Jk8B9Nl6kWwU1GoDupTX7rrFOfU86fcONx/RFGQ3kBF1J/dUWtmKgY5Wfa4B 4PHiyLn1ae8GViQIkLkOU3lZaGLEGqZrrs5cvMftxZmuCZpQaNTspzDaOApvnN5eQJ/e 3UDKK0DbXuf1UjR4c8mnsAJ+dy0sWJLpisJzmAYJhRZDJa7wry9mfTFM0i7aOd7Q7DL3 b7HA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :message-id:date:subject:cc:to:from; bh=gzzKf47kfkQjhKuYrCS2R0yfUj1M3m939vSVSoBe2Ig=; b=WHP16QoMZ+KgMEtoHaBEMGkgUYZ90Ty2OOKrLsQtK4lS6Zhw+OjLM64A7ROGLAiBaE 3gxvFI11KZwhPqH0auVK69bB2i9Zl7hUPHysQhJ8eezyc8ZWF/bH/XVHGqiKRtpHAndP g6Q4ELNR2tn7b7hJl6+Cik8ERCEll6ezx10Sydm2bHoCGJfpia12k3IxNCoBeRsnKu3/ +Foz13SpxOylj0zjnXy0ynQLcbgNOcLKmtJxyAn7L+XgeOBn+D41O5UJZ1F9+JVE3SfD o1KLCZJwho4wIj9hlquzWaXnFnXiUYQLndVgxGZut6OcfAorFDV/3jlpQZ3I6MjL33te fLHA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id bd21si20727761edb.593.2021.07.21.01.35.38; Wed, 21 Jul 2021 01:36:02 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=alibaba.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S235919AbhGUHvt (ORCPT + 99 others); Wed, 21 Jul 2021 03:51:49 -0400 Received: from out4436.biz.mail.alibaba.com ([47.88.44.36]:42239 "EHLO out4436.biz.mail.alibaba.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S236422AbhGUHnE (ORCPT ); Wed, 21 Jul 2021 03:43:04 -0400 X-Alimail-AntiSpam: AC=PASS;BC=-1|-1;BR=01201311R131e4;CH=green;DM=||false|;DS=||;FP=0|-1|-1|-1|0|-1|-1|-1;HT=e01e04426;MF=hsiangkao@linux.alibaba.com;NM=1;PH=DS;RN=8;SR=0;TI=SMTPD_---0UgVIMrR_1626855804; Received: from e18g09479.et15sqa.tbsite.net(mailfrom:hsiangkao@linux.alibaba.com fp:SMTPD_---0UgVIMrR_1626855804) by smtp.aliyun-inc.com(127.0.0.1); Wed, 21 Jul 2021 16:23:37 +0800 From: Gao Xiang To: linux-erofs@lists.ozlabs.org, linux-fsdevel@vger.kernel.org Cc: LKML , Gao Xiang , Christoph Hellwig , "Darrick J . Wong" , Matthew Wilcox , Andreas Gruenbacher Subject: [PATCH v5] iomap: support tail packing inline read Date: Wed, 21 Jul 2021 16:23:23 +0800 Message-Id: <20210721082323.41933-1-hsiangkao@linux.alibaba.com> X-Mailer: git-send-email 2.24.4 MIME-Version: 1.0 Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org This tries to add tail packing inline read to iomap, which can support several inline tail blocks. Similar to the previous approach, it cleans post-EOF in one iteration. The write path remains untouched since EROFS cannot be used for testing. It'd be better to be implemented if upcoming real users care rather than leave untested dead code around. Cc: Christoph Hellwig Cc: Darrick J. Wong Cc: Matthew Wilcox Cc: Andreas Gruenbacher Signed-off-by: Gao Xiang --- v4: https://lore.kernel.org/r/20210720133554.44058-1-hsiangkao@linux.alibaba.com changes since v4: - turn to WARN_ON_ONCE() suggested by Darrick; - fix size to "min(iomap->length + iomap->offset - pos, PAGE_SIZE - poff)" fs/iomap/buffered-io.c | 58 +++++++++++++++++++++++++++--------------- fs/iomap/direct-io.c | 13 +++++++--- 2 files changed, 47 insertions(+), 24 deletions(-) diff --git a/fs/iomap/buffered-io.c b/fs/iomap/buffered-io.c index 87ccb3438bec..d8436d34a159 100644 --- a/fs/iomap/buffered-io.c +++ b/fs/iomap/buffered-io.c @@ -205,25 +205,27 @@ struct iomap_readpage_ctx { struct readahead_control *rac; }; -static void +static int iomap_read_inline_data(struct inode *inode, struct page *page, - struct iomap *iomap) + struct iomap *iomap, loff_t pos) { - size_t size = i_size_read(inode); + unsigned int size, poff = offset_in_page(pos); void *addr; - if (PageUptodate(page)) - return; - - BUG_ON(page_has_private(page)); - BUG_ON(page->index); - BUG_ON(size > PAGE_SIZE - offset_in_page(iomap->inline_data)); + /* inline source data must be inside a single page */ + if (WARN_ON_ONCE(iomap->length > PAGE_SIZE - + offset_in_page(iomap->inline_data))) + return -EIO; + /* handle tail-packing blocks cross the current page into the next */ + size = min_t(unsigned int, iomap->length + iomap->offset - pos, + PAGE_SIZE - poff); addr = kmap_atomic(page); - memcpy(addr, iomap->inline_data, size); - memset(addr + size, 0, PAGE_SIZE - size); + memcpy(addr + poff, iomap->inline_data - iomap->offset + pos, size); + memset(addr + poff + size, 0, PAGE_SIZE - poff - size); kunmap_atomic(addr); - SetPageUptodate(page); + iomap_set_range_uptodate(page, poff, PAGE_SIZE - poff); + return PAGE_SIZE - poff; } static inline bool iomap_block_needs_zeroing(struct inode *inode, @@ -245,19 +247,23 @@ iomap_readpage_actor(struct inode *inode, loff_t pos, loff_t length, void *data, loff_t orig_pos = pos; unsigned poff, plen; sector_t sector; + int ret; - if (iomap->type == IOMAP_INLINE) { - WARN_ON_ONCE(pos); - iomap_read_inline_data(inode, page, iomap); - return PAGE_SIZE; - } - - /* zero post-eof blocks as the page may be mapped */ iop = iomap_page_create(inode, page); + /* needs to skip some leading uptodate blocks */ iomap_adjust_read_range(inode, iop, &pos, length, &poff, &plen); if (plen == 0) goto done; + if (iomap->type == IOMAP_INLINE) { + ret = iomap_read_inline_data(inode, page, iomap, pos); + if (ret < 0) + return ret; + plen = ret; + goto done; + } + + /* zero post-eof blocks as the page may be mapped */ if (iomap_block_needs_zeroing(inode, iomap, pos)) { zero_user(page, poff, plen); iomap_set_range_uptodate(page, poff, plen); @@ -589,6 +595,18 @@ __iomap_write_begin(struct inode *inode, loff_t pos, unsigned len, int flags, return 0; } +static int iomap_write_begin_inline(struct inode *inode, loff_t pos, + struct page *page, struct iomap *srcmap) +{ + /* needs more work for the tailpacking case, disable for now */ + if (WARN_ON_ONCE(srcmap->offset != 0)) + return -EIO; + if (PageUptodate(page)) + return 0; + iomap_read_inline_data(inode, page, srcmap, 0); + return 0; +} + static int iomap_write_begin(struct inode *inode, loff_t pos, unsigned len, unsigned flags, struct page **pagep, struct iomap *iomap, struct iomap *srcmap) @@ -618,7 +636,7 @@ iomap_write_begin(struct inode *inode, loff_t pos, unsigned len, unsigned flags, } if (srcmap->type == IOMAP_INLINE) - iomap_read_inline_data(inode, page, srcmap); + status = iomap_write_begin_inline(inode, pos, page, srcmap); else if (iomap->flags & IOMAP_F_BUFFER_HEAD) status = __block_write_begin_int(page, pos, len, NULL, srcmap); else diff --git a/fs/iomap/direct-io.c b/fs/iomap/direct-io.c index 9398b8c31323..cbadb99fb88c 100644 --- a/fs/iomap/direct-io.c +++ b/fs/iomap/direct-io.c @@ -379,22 +379,27 @@ iomap_dio_inline_actor(struct inode *inode, loff_t pos, loff_t length, { struct iov_iter *iter = dio->submit.iter; size_t copied; + void *dst = iomap->inline_data + pos - iomap->offset; - BUG_ON(pos + length > PAGE_SIZE - offset_in_page(iomap->inline_data)); + /* inline data must be inside a single page */ + if (WARN_ON_ONCE(length > PAGE_SIZE - + offset_in_page(iomap->inline_data))) + return -EIO; if (dio->flags & IOMAP_DIO_WRITE) { loff_t size = inode->i_size; if (pos > size) - memset(iomap->inline_data + size, 0, pos - size); - copied = copy_from_iter(iomap->inline_data + pos, length, iter); + memset(iomap->inline_data + size - iomap->offset, + 0, pos - size); + copied = copy_from_iter(dst, length, iter); if (copied) { if (pos + copied > size) i_size_write(inode, pos + copied); mark_inode_dirty(inode); } } else { - copied = copy_to_iter(iomap->inline_data + pos, length, iter); + copied = copy_to_iter(dst, length, iter); } dio->size += copied; return copied; -- 2.24.4