Received: by 2002:a5d:9c59:0:0:0:0:0 with SMTP id 25csp63501iof; Sun, 5 Jun 2022 21:12:40 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxR4JdJdDYcpjote/HJZjl5voMjSxeFfpa4ibrFTVMb/z6XqlE7eMETgNSstC1LWGT9iG6P X-Received: by 2002:a63:ea06:0:b0:3fd:a385:a856 with SMTP id c6-20020a63ea06000000b003fda385a856mr4689250pgi.393.1654488760238; Sun, 05 Jun 2022 21:12:40 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1654488760; cv=none; d=google.com; s=arc-20160816; b=lPDOm4oTsmck3JzkdEqqcnDBd+K4Y+xygJYvlfoORzFV7XR8G9JlFsnHOM/LC39Sib 3m1of1T4kszQgCcXSc1qLuCDFQ914lwoERJCI0CRnIkLiXDJ81NDH9Pw3cnZdUhXdujw XxXs7/p+z1XdayvRGmfKIxc+0Y6mDXsFUIwIq4Wkd08ssWUH4Bk/dfECT+GtO6R6iSee 1OQLSshKBSdH0AFmM1KFGA2WHKsV2I4nSDY7d/dK9d6o6xBqnn2HLP5GiE7veEcQO6N/ VeLiCOijdJTqBRy7zdOkbb0Jx/PQtgymGb5Xp3PT7bm+5s6xSKZ5CH5rXKzY39UnN+ri mPbw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=4njyTH7nXQHuz+gsWkYSQBekmqjtGJtk5RZMu1hQzfk=; b=OlwOkQQ0P4ShhICu4wsSLRVfqXV3iUY3QCblUMlbIpTi50TGE2wyydWosijFIVjCn5 ceQLkRx4ulFLajxqqsUv09auIlmjAca73Cix+plP9zT8LzJ/B2noXlnE8WZ7D03KqlLP gD/QZXIbhuZSQqQBeWrCe7G62z+WTloqz3H48HhzW4qaOy7UWpfi8Cc1Npky8rOLRyoZ lrIweb+vzS6muuq7Po2RZnrPGgmLKfrypZmo4hX6+UKzEvq3o9cL5c2ZDgWl4g5FA6qq AuGcDQJAmW+jyK0LMid8J40tCKX9XYHo0GUWxVMLeh6ZgkMfb1QzsBqwEBXkJyI7OMZ2 u/cg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=T5BuIgPu; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [23.128.96.19]) by mx.google.com with ESMTPS id q10-20020aa7842a000000b0051bc3d323e5si15366223pfn.341.2022.06.05.21.12.39 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 05 Jun 2022 21:12:40 -0700 (PDT) Received-SPF: softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) client-ip=23.128.96.19; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=T5BuIgPu; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 9B778737BD; Sun, 5 Jun 2022 20:50:57 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S243337AbiFEN17 (ORCPT + 99 others); Sun, 5 Jun 2022 09:27:59 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:38112 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S242526AbiFEN16 (ORCPT ); Sun, 5 Jun 2022 09:27:58 -0400 Received: from ams.source.kernel.org (ams.source.kernel.org [145.40.68.75]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id AEA5226F; Sun, 5 Jun 2022 06:27:56 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ams.source.kernel.org (Postfix) with ESMTPS id 72FFCB80735; Sun, 5 Jun 2022 13:27:55 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 039BAC385A5; Sun, 5 Jun 2022 13:27:54 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1654435674; bh=8gbfFJeISrWscV+WZ8yaxKVwfzLWnFEB2Q++sxohQbY=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=T5BuIgPufM1YuXVp40+UHF0b236SpBsf6Xcf1Y+xPQFvaf2UTJPBUy5+aY7pSnVJm pDId2gsNXT6O0quNekDMfb4qsCGA807KZUYbwI42RGPq6v7DfXQptd3a7sF/nEJbGS D/UKUaYyyjKKYaT90Yd/uU9S+RIifyYydgelOMX+sKr7Jc9l01QQmRAM4JOEvpfCYl 1N2CMxKhHNDfwhtY9t1wYOuX2GBMP4gIvpc4aSQOvAQKI8Hf6YAQ9s8g6UkaFL52Ix hik3xXJ8hiJwSBtuhzlcv0IkIClgOdJucQnvcNViXYLt2Fej90HtAf6Ns+4bQ/+k0O axABqUwzMG4LA== Date: Sun, 5 Jun 2022 09:27:52 -0400 From: Sasha Levin To: John Stoffel Cc: linux-kernel@vger.kernel.org, stable@vger.kernel.org, Heming Zhao , kernel test robot , Dan Carpenter , Guoqing Jiang , Song Liu , linux-raid@vger.kernel.org Subject: Re: [PATCH AUTOSEL 5.4 18/55] md/bitmap: don't set sb values if can't pass sanity check Message-ID: References: <20220530134701.1935933-1-sashal@kernel.org> <20220530134701.1935933-18-sashal@kernel.org> <25239.56271.848372.965726@quad.stoffel.home> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii; format=flowed Content-Disposition: inline In-Reply-To: <25239.56271.848372.965726@quad.stoffel.home> X-Spam-Status: No, score=-3.5 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,MAILING_LIST_MULTI, RDNS_NONE,SPF_HELO_NONE,T_SCC_BODY_TEXT_LINE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org I'm sorry, I couldn't parse the mail below. On Wed, Jun 01, 2022 at 05:36:15PM -0400, John Stoffel wrote: >>>>>> "Sasha" == Sasha Levin writes: > >Sasha> From: Heming Zhao >Sasha> [ Upstream commit e68cb83a57a458b01c9739e2ad9cb70b04d1e6d2 ] > >Sasha> If bitmap area contains invalid data, kernel will crash then mdadm >Sasha> triggers "Segmentation fault". >Sasha> This is cluster-md speical bug. In non-clustered env, mdadm will > >special > >All the commit messages need to be fixed from what I see. > >Sasha> handle broken metadata case. In clustered array, only kernel space >Sasha> handles bitmap slot info. But even this bug only happened in clustered >Sasha> env, current sanity check is wrong, the code should be changed. > >Sasha> How to trigger: (faulty injection) > >Sasha> dd if=/dev/zero bs=1M count=1 oflag=direct of=/dev/sda >Sasha> dd if=/dev/zero bs=1M count=1 oflag=direct of=/dev/sdb >Sasha> mdadm -C /dev/md0 -b clustered -e 1.2 -n 2 -l mirror /dev/sda /dev/sdb >Sasha> mdadm -Ss >Sasha> echo aaa > magic.txt >Sasha> == below modifying slot 2 bitmap data == >Sasha> dd if=magic.txt of=/dev/sda seek=16384 bs=1 count=3 <== destroy magic >Sasha> dd if=/dev/zero of=/dev/sda seek=16436 bs=1 count=4 <== ZERO chunksize >Sasha> mdadm -A /dev/md0 /dev/sda /dev/sdb >Sasha> == kernel crashes. mdadm outputs "Segmentation fault" == > >Sasha> Reason of kernel crash: > >Sasha> In md_bitmap_read_sb (called by md_bitmap_create), bad bitmap magic didn't >Sasha> block chunksize assignment, and zero value made DIV_ROUND_UP_SECTOR_T() >Sasha> trigger "divide error". > >Sasha> Crash log: > >Sasha> kernel: md: md0 stopped. >Sasha> kernel: md/raid1:md0: not clean -- starting background reconstruction >Sasha> kernel: md/raid1:md0: active with 2 out of 2 mirrors >Sasha> kernel: dlm: ... ... >Sasha> kernel: md-cluster: Joined cluster 44810aba-38bb-e6b8-daca-bc97a0b254aa slot 1 >Sasha> kernel: md0: invalid bitmap file superblock: bad magic >Sasha> kernel: md_bitmap_copy_from_slot can't get bitmap from slot 2 >Sasha> kernel: md-cluster: Could not gather bitmaps from slot 2 >Sasha> kernel: divide error: 0000 [#1] SMP NOPTI >Sasha> kernel: CPU: 0 PID: 1603 Comm: mdadm Not tainted 5.14.6-1-default >Sasha> kernel: Hardware name: QEMU Standard PC (i440FX + PIIX, 1996) >Sasha> kernel: RIP: 0010:md_bitmap_create+0x1d1/0x850 [md_mod] >Sasha> kernel: RSP: 0018:ffffc22ac0843ba0 EFLAGS: 00010246 >Sasha> kernel: ... ... >Sasha> kernel: Call Trace: >Sasha> kernel: ? dlm_lock_sync+0xd0/0xd0 [md_cluster 77fe..7a0] >Sasha> kernel: md_bitmap_copy_from_slot+0x2c/0x290 [md_mod 24ea..d3a] >Sasha> kernel: load_bitmaps+0xec/0x210 [md_cluster 77fe..7a0] >Sasha> kernel: md_bitmap_load+0x81/0x1e0 [md_mod 24ea..d3a] >Sasha> kernel: do_md_run+0x30/0x100 [md_mod 24ea..d3a] >Sasha> kernel: md_ioctl+0x1290/0x15a0 [md_mod 24ea....d3a] >Sasha> kernel: ? mddev_unlock+0xaa/0x130 [md_mod 24ea..d3a] >Sasha> kernel: ? blkdev_ioctl+0xb1/0x2b0 >Sasha> kernel: block_ioctl+0x3b/0x40 >Sasha> kernel: __x64_sys_ioctl+0x7f/0xb0 >Sasha> kernel: do_syscall_64+0x59/0x80 >Sasha> kernel: ? exit_to_user_mode_prepare+0x1ab/0x230 >Sasha> kernel: ? syscall_exit_to_user_mode+0x18/0x40 >Sasha> kernel: ? do_syscall_64+0x69/0x80 >Sasha> kernel: entry_SYSCALL_64_after_hwframe+0x44/0xae >Sasha> kernel: RIP: 0033:0x7f4a15fa722b >Sasha> kernel: ... ... >Sasha> kernel: ---[ end trace 8afa7612f559c868 ]--- >Sasha> kernel: RIP: 0010:md_bitmap_create+0x1d1/0x850 [md_mod] > >Sasha> Reported-by: kernel test robot >Sasha> Reported-by: Dan Carpenter >Sasha> Acked-by: Guoqing Jiang >Sasha> Signed-off-by: Heming Zhao >Sasha> Signed-off-by: Song Liu >Sasha> Signed-off-by: Sasha Levin >Sasha> --- >Sasha> drivers/md/md-bitmap.c | 44 ++++++++++++++++++++++-------------------- >Sasha> 1 file changed, 23 insertions(+), 21 deletions(-) > >Sasha> diff --git a/drivers/md/md-bitmap.c b/drivers/md/md-bitmap.c >Sasha> index d7eef5292ae2..a95e20c3d0d4 100644 >Sasha> --- a/drivers/md/md-bitmap.c >Sasha> +++ b/drivers/md/md-bitmap.c >Sasha> @@ -642,14 +642,6 @@ static int md_bitmap_read_sb(struct bitmap *bitmap) >Sasha> daemon_sleep = le32_to_cpu(sb->daemon_sleep) * HZ; >Sasha> write_behind = le32_to_cpu(sb->write_behind); >Sasha> sectors_reserved = le32_to_cpu(sb->sectors_reserved); >Sasha> - /* Setup nodes/clustername only if bitmap version is >Sasha> - * cluster-compatible >Sasha> - */ >Sasha> - if (sb->version == cpu_to_le32(BITMAP_MAJOR_CLUSTERED)) { >Sasha> - nodes = le32_to_cpu(sb->nodes); >Sasha> - strlcpy(bitmap->mddev->bitmap_info.cluster_name, >Sasha> - sb->cluster_name, 64); >Sasha> - } > >Sasha> /* verify that the bitmap-specific fields are valid */ >Sasha> if (sb->magic != cpu_to_le32(BITMAP_MAGIC)) >Sasha> @@ -671,6 +663,16 @@ static int md_bitmap_read_sb(struct bitmap *bitmap) >Sasha> goto out; >Sasha> } > >Sasha> + /* >Sasha> + * Setup nodes/clustername only if bitmap version is >Sasha> + * cluster-compatible >Sasha> + */ >Sasha> + if (sb->version == cpu_to_le32(BITMAP_MAJOR_CLUSTERED)) { >Sasha> + nodes = le32_to_cpu(sb->nodes); >Sasha> + strlcpy(bitmap->mddev->bitmap_info.cluster_name, >Sasha> + sb->cluster_name, 64); >Sasha> + } >Sasha> + >Sasha> /* keep the array size field of the bitmap superblock up to date */ >sb-> sync_size = cpu_to_le64(bitmap->mddev->resync_max_sectors); > >Sasha> @@ -703,9 +705,9 @@ static int md_bitmap_read_sb(struct bitmap *bitmap) > >Sasha> out: >Sasha> kunmap_atomic(sb); >Sasha> - /* Assigning chunksize is required for "re_read" */ >Sasha> - bitmap->mddev->bitmap_info.chunksize = chunksize; >Sasha> if (err == 0 && nodes && (bitmap->cluster_slot < 0)) { >Sasha> + /* Assigning chunksize is required for "re_read" */ >Sasha> + bitmap->mddev->bitmap_info.chunksize = chunksize; >Sasha> err = md_setup_cluster(bitmap->mddev, nodes); >Sasha> if (err) { >Sasha> pr_warn("%s: Could not setup cluster service (%d)\n", >Sasha> @@ -716,18 +718,18 @@ static int md_bitmap_read_sb(struct bitmap *bitmap) >Sasha> goto re_read; >Sasha> } > >Sasha> - >Sasha> out_no_sb: >Sasha> - if (test_bit(BITMAP_STALE, &bitmap->flags)) >Sasha> - bitmap->events_cleared = bitmap->mddev->events; >Sasha> - bitmap->mddev->bitmap_info.chunksize = chunksize; >Sasha> - bitmap->mddev->bitmap_info.daemon_sleep = daemon_sleep; >Sasha> - bitmap->mddev->bitmap_info.max_write_behind = write_behind; >Sasha> - bitmap->mddev->bitmap_info.nodes = nodes; >Sasha> - if (bitmap->mddev->bitmap_info.space == 0 || >Sasha> - bitmap->mddev->bitmap_info.space > sectors_reserved) >Sasha> - bitmap->mddev->bitmap_info.space = sectors_reserved; >Sasha> - if (err) { >Sasha> + if (err == 0) { >Sasha> + if (test_bit(BITMAP_STALE, &bitmap->flags)) >Sasha> + bitmap->events_cleared = bitmap->mddev->events; >Sasha> + bitmap->mddev->bitmap_info.chunksize = chunksize; >Sasha> + bitmap->mddev->bitmap_info.daemon_sleep = daemon_sleep; >Sasha> + bitmap->mddev->bitmap_info.max_write_behind = write_behind; >Sasha> + bitmap->mddev->bitmap_info.nodes = nodes; >Sasha> + if (bitmap->mddev->bitmap_info.space == 0 || >Sasha> + bitmap->mddev->bitmap_info.space > sectors_reserved) >Sasha> + bitmap->mddev->bitmap_info.space = sectors_reserved; >Sasha> + } else { >Sasha> md_bitmap_print_sb(bitmap); >Sasha> if (bitmap->cluster_slot < 0) >Sasha> md_cluster_stop(bitmap->mddev); >Sasha> -- >Sasha> 2.35.1 > -- Thanks, Sasha