Received: by 2002:a25:c205:0:0:0:0:0 with SMTP id s5csp1184387ybf; Fri, 28 Feb 2020 16:15:03 -0800 (PST) X-Google-Smtp-Source: APXvYqz2phmMBnSI1NTk+YnI9ztETq7JDuvs+3tZvK3kenXemF79pvyKE+18QgvF6S2Vt7r5vKrS X-Received: by 2002:aca:db56:: with SMTP id s83mr4984915oig.171.1582935302846; Fri, 28 Feb 2020 16:15:02 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1582935302; cv=none; d=google.com; s=arc-20160816; b=yv+LeKwhs5A6Rrjw7PPjisDP5Y0q9So3cjSxE4t6WqDtTZwsIO+zCJgL9vRGBwN6Rq mVB7N1TLWb6GZGamgYNxpSoXzAFRVQrLz5rZBgc3GBONMa0NJJMR28qrud90nt+LSyy0 Yr8CvtZqkXMln2C4kiAXvaj5c8PIS5bFahhXGLQLT1U8FC11w7HHufpjrig26RX0Mmc1 Pty4pTbjJJo24wzcgqZ9lUmelzAtRYpwLzVNpO5z3wTpDYIG3m7NEETaG+PsFDVtKj1L Fm08wqIrMYgXAlquhmuhDklDi+F1o4B7qUyRnwNcN4Dfo5zgELNv7tEJZhLaG5FT9Rk2 2//A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:message-id:date:subject :smtp-origin-cluster:cc:to:smtp-origin-hostname:from :smtp-origin-hostprefix:dkim-signature; bh=54qrPCjj2xg1JSMPnr0yRA36FAicW6ZVn6PlvT5gTU0=; b=IbfPQoakrsuhzBxoVZFDbTQXsBmfnQcL9MNwSWnS0aRm4SBLi3k3cWZIoYw310v58K IMZOFDYHwN7gqBnghOA4DC12r7eQFSuaZlka7oHT5WvjuHUdcUZ41C61eweB7FVqbDZo JD5e7F0t+lWmYzplT+vp6xR6AU3+w0h9oKVfqAojL457m/jcCCdGW3TY84iPbfRCsi6J 6UoW01MiPE4siZB2rni9wlVearDzXGUzgcnwsGQPSDlpGf4orNLrKNu0mPEafGE/u0vL Lq7iVnWuCePkalyjK19vdsY+2y+P2Swofk2YRUToIEWVK+p4Bhk/r3ew4jNUikUYMqb9 S3eQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@fb.com header.s=facebook header.b=IcA9q2sZ; spf=pass (google.com: best guess record for domain of linux-ext4-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-ext4-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=fb.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id t7si2382743otl.133.2020.02.28.16.14.43; Fri, 28 Feb 2020 16:15:02 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-ext4-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@fb.com header.s=facebook header.b=IcA9q2sZ; spf=pass (google.com: best guess record for domain of linux-ext4-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-ext4-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=fb.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726490AbgB2AO3 (ORCPT + 99 others); Fri, 28 Feb 2020 19:14:29 -0500 Received: from mx0b-00082601.pphosted.com ([67.231.153.30]:64278 "EHLO mx0b-00082601.pphosted.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726359AbgB2AO3 (ORCPT ); Fri, 28 Feb 2020 19:14:29 -0500 Received: from pps.filterd (m0109331.ppops.net [127.0.0.1]) by mx0a-00082601.pphosted.com (8.16.0.42/8.16.0.42) with SMTP id 01T058fT009269 for ; Fri, 28 Feb 2020 16:14:28 -0800 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=fb.com; h=from : to : cc : subject : date : message-id : mime-version : content-type; s=facebook; bh=54qrPCjj2xg1JSMPnr0yRA36FAicW6ZVn6PlvT5gTU0=; b=IcA9q2sZ0zV84z6i/Ul3t/w4uXwDZzbVtSIsRp+DmuXwht+h/ezymNUFI1fWfUOf6T9J dk+sD8eMnhtKvAL699qJD0rScSNYXgWYzzht9PqEJYDugRZQJ/XFO3lIP27iBpNdncZl CUOxKnqW3p0DqqCWmqwbtlYqzXBIeNDhl9k= Received: from mail.thefacebook.com ([163.114.132.120]) by mx0a-00082601.pphosted.com with ESMTP id 2yeputxbua-3 (version=TLSv1.2 cipher=ECDHE-RSA-AES128-GCM-SHA256 bits=128 verify=NOT) for ; Fri, 28 Feb 2020 16:14:28 -0800 Received: from intmgw004.06.prn3.facebook.com (2620:10d:c085:108::8) by mail.thefacebook.com (2620:10d:c085:11d::6) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256) id 15.1.1779.2; Fri, 28 Feb 2020 16:14:26 -0800 Received: by devvm4439.prn2.facebook.com (Postfix, from userid 111017) id 33081739F065; Fri, 28 Feb 2020 16:14:19 -0800 (PST) Smtp-Origin-Hostprefix: devvm From: Roman Gushchin Smtp-Origin-Hostname: devvm4439.prn2.facebook.com To: , , CC: Alexander Viro , Andreas Dilger , Roman Gushchin , Andrew Perepechko , Theodore Ts'o , Gioh Kim , Jan Kara Smtp-Origin-Cluster: prn2c23 Subject: [PATCH v2] ext4: use non-movable memory for superblock readahead Date: Fri, 28 Feb 2020 16:14:11 -0800 Message-ID: <20200229001411.128010-1-guro@fb.com> X-Mailer: git-send-email 2.17.1 X-FB-Internal: Safe MIME-Version: 1.0 Content-Type: text/plain X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:6.0.138,18.0.572 definitions=2020-02-28_09:2020-02-28,2020-02-28 signatures=0 X-Proofpoint-Spam-Details: rule=fb_default_notspam policy=fb_default score=0 clxscore=1015 mlxlogscore=871 malwarescore=0 adultscore=0 suspectscore=0 mlxscore=0 bulkscore=0 lowpriorityscore=0 spamscore=0 priorityscore=1501 phishscore=0 impostorscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2001150001 definitions=main-2002280176 X-FB-Internal: deliver Sender: linux-ext4-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-ext4@vger.kernel.org Since commit a8ac900b8163 ("ext4: use non-movable memory for the superblock") buffers for ext4 superblock were allocated using the sb_bread_unmovable() helper which allocated buffer heads out of non-movable memory blocks. It was necessarily to not block page migrations and do not cause cma allocation failures. However commit 85c8f176a611 ("ext4: preload block group descriptors") broke this by introducing pre-reading of the ext4 superblock. The problem is that __breadahead() is using __getblk() underneath, which allocates buffer heads out of movable memory. It resulted in page migration failures I've seen on a machine with an ext4 partition and a preallocated cma area. Fix this by introducing sb_breadahead_unmovable() and __breadahead_gfp() helpers which use non-movable memory for buffer head allocations and use them for the ext4 superblock readahead. v2: found a similar issue in __ext4_get_inode_loc() Fixes: 85c8f176a611 ("ext4: preload block group descriptors") Signed-off-by: Roman Gushchin Cc: Andrew Perepechko Cc: Theodore Ts'o Cc: Gioh Kim Cc: Jan Kara --- fs/buffer.c | 11 +++++++++++ fs/ext4/inode.c | 2 +- fs/ext4/super.c | 2 +- include/linux/buffer_head.h | 8 ++++++++ 4 files changed, 21 insertions(+), 2 deletions(-) diff --git a/fs/buffer.c b/fs/buffer.c index 4299e100a05b..25462edd920e 100644 --- a/fs/buffer.c +++ b/fs/buffer.c @@ -1414,6 +1414,17 @@ void __breadahead(struct block_device *bdev, sector_t block, unsigned size) } EXPORT_SYMBOL(__breadahead); +void __breadahead_gfp(struct block_device *bdev, sector_t block, unsigned size, + gfp_t gfp) +{ + struct buffer_head *bh = __getblk_gfp(bdev, block, size, gfp); + if (likely(bh)) { + ll_rw_block(REQ_OP_READ, REQ_RAHEAD, 1, &bh); + brelse(bh); + } +} +EXPORT_SYMBOL(__breadahead_gfp); + /** * __bread_gfp() - reads a specified block and returns the bh * @bdev: the block_device to read from diff --git a/fs/ext4/inode.c b/fs/ext4/inode.c index fa0ff78dc033..b131fedc6b77 100644 --- a/fs/ext4/inode.c +++ b/fs/ext4/inode.c @@ -4348,7 +4348,7 @@ static int __ext4_get_inode_loc(struct inode *inode, if (end > table) end = table; while (b <= end) - sb_breadahead(sb, b++); + sb_breadahead_unmovable(sb, b++); } /* diff --git a/fs/ext4/super.c b/fs/ext4/super.c index ff1b764b0c0e..fb2338a5220e 100644 --- a/fs/ext4/super.c +++ b/fs/ext4/super.c @@ -4331,7 +4331,7 @@ static int ext4_fill_super(struct super_block *sb, void *data, int silent) /* Pre-read the descriptors into the buffer cache */ for (i = 0; i < db_count; i++) { block = descriptor_loc(sb, logical_sb_block, i); - sb_breadahead(sb, block); + sb_breadahead_unmovable(sb, block); } for (i = 0; i < db_count; i++) { diff --git a/include/linux/buffer_head.h b/include/linux/buffer_head.h index 7b73ef7f902d..b56cc825f64d 100644 --- a/include/linux/buffer_head.h +++ b/include/linux/buffer_head.h @@ -189,6 +189,8 @@ struct buffer_head *__getblk_gfp(struct block_device *bdev, sector_t block, void __brelse(struct buffer_head *); void __bforget(struct buffer_head *); void __breadahead(struct block_device *, sector_t block, unsigned int size); +void __breadahead_gfp(struct block_device *, sector_t block, unsigned int size, + gfp_t gfp); struct buffer_head *__bread_gfp(struct block_device *, sector_t block, unsigned size, gfp_t gfp); void invalidate_bh_lrus(void); @@ -319,6 +321,12 @@ sb_breadahead(struct super_block *sb, sector_t block) __breadahead(sb->s_bdev, block, sb->s_blocksize); } +static inline void +sb_breadahead_unmovable(struct super_block *sb, sector_t block) +{ + __breadahead_gfp(sb->s_bdev, block, sb->s_blocksize, 0); +} + static inline struct buffer_head * sb_getblk(struct super_block *sb, sector_t block) { -- 2.24.1