Received: by 10.223.176.5 with SMTP id f5csp1287784wra; Fri, 2 Feb 2018 14:46:32 -0800 (PST) X-Google-Smtp-Source: AH8x226eTkFia3xSpAheLbcs69m8DWCDyjhM9A0Jr2PeUOrUQEn/qQm6Jvf+f8A0n+Dvp/6W5/us X-Received: by 2002:a17:902:6f08:: with SMTP id w8-v6mr36147663plk.155.1517611592511; Fri, 02 Feb 2018 14:46:32 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1517611592; cv=none; d=google.com; s=arc-20160816; b=eHCMFK2zvLh6vp3emlrfFjwZV2FK57KsuhGKJS4Ycw/951hY6yA4FSABFOdKc+vnXY EB05fRXAX0FuNXQ5jtgz0U915Nw3tsB+0Auokah735Gmg4iVL27D28soQJoawMEuP/+G ZjYlyahStP4aCeb1yg2rrOH+euIeTCaQvcCwaW93vzjlxkCx/2P+x7RPT4OCMq3U55wi DGA6GVA9t+Wv6GXq56QSSfyOrbABFwRlIHFfAQiOwlvgK4LFRq9nrzuaUz+zb8uYWmr1 UIHXyStJMjSnyuENa6sKA/LSuSwlCZ3hKG2DxGtJwjDyUN4vXzmKIoIB/gmeZQz/l6k5 4mpw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:message-id:date:subject:to:from :arc-authentication-results; bh=Dy8eRz04y1qVH+Wsl7/ZoLwb7pbQ6IbJHFZoy0uZbUU=; b=B4sle4NJlZi94jd3CgOxuHnslPoflXlfZig2KD7DFrM+Ud5j0ux9nwNQC6zXZZnpPD XJyyhQgmXNAwBzG1fW9A6ZuEShLcscUODo0Dhi/+k77LWfl5Er1jK38VT+jXU2SGYB/b bkxOJOVYqeRCzE8LanScv46cHZQF+Bp/m0ZP6zAshn3wSJCWukY4etXdGI44B/b3VuKT uSFOaeuoYZTAgeDyLEN7lzfTH6wS42+v93NrsJQ9LWBtTS48Y19v6VHZHnsdQF4o09tj Em08qhek0XEbTpLQBW9kXjBXz16gBsXSndaYZZOPNLF5K22cHIVb4AX+U1zPtRvcMg5S e1FQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id c41-v6si2565971plj.682.2018.02.02.14.46.18; Fri, 02 Feb 2018 14:46:32 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1752712AbeBBWNo (ORCPT + 99 others); Fri, 2 Feb 2018 17:13:44 -0500 Received: from mx1.redhat.com ([209.132.183.28]:51132 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752215AbeBBWNg (ORCPT ); Fri, 2 Feb 2018 17:13:36 -0500 Received: from smtp.corp.redhat.com (int-mx03.intmail.prod.int.phx2.redhat.com [10.5.11.13]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id B5EAC87624; Fri, 2 Feb 2018 22:13:36 +0000 (UTC) Received: from redhat.com.com (ovpn-117-145.ams2.redhat.com [10.36.117.145]) by smtp.corp.redhat.com (Postfix) with ESMTP id 961536766F; Fri, 2 Feb 2018 22:13:33 +0000 (UTC) From: Heinz Mauelshagen To: heinzm@redhat.com, dm-devel@redhat.com, linux-kernel@vger.kernel.org, linux-raid@vger.kernel.org, shli@kernel.org Subject: [PATCH] md: fix md_write_start() deadlock w/o metadata devices Date: Fri, 2 Feb 2018 23:13:19 +0100 Message-Id: <054eff295e0cd2df2b11a3f9ba3b3d66e89beb47.1517609290.git.heinzm@redhat.com> X-Scanned-By: MIMEDefang 2.79 on 10.5.11.13 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.26]); Fri, 02 Feb 2018 22:13:36 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org If no metadata devices are configured on raid1/4/5/6/10 (e.g. via dm-raid), md_write_start() unconditionally waits for superblocks to be written thus deadlocking. Fix introduces mddev->has_superblocks bool, defines it in md_run() and checks for it in md_write_start() to conditionally avoid waiting. Once on it, check for non-existing superblocks in md_super_write(). Link: https://bugzilla.kernel.org/show_bug.cgi?id=198647 Fixes: cc27b0c78c796 ("md: fix deadlock between mddev_suspend() and md_write_start()") Signed-off-by: Heinz Mauelshagen --- drivers/md/md.c | 10 ++++++++++ drivers/md/md.h | 2 ++ 2 files changed, 12 insertions(+) diff --git a/drivers/md/md.c b/drivers/md/md.c index 0081ace39a64..8a7e7034962c 100644 --- a/drivers/md/md.c +++ b/drivers/md/md.c @@ -801,6 +801,9 @@ void md_super_write(struct mddev *mddev, struct md_rdev *rdev, struct bio *bio; int ff = 0; + if (!page) + return; + if (test_bit(Faulty, &rdev->flags)) return; @@ -5452,6 +5455,7 @@ int md_run(struct mddev *mddev) * the only valid external interface is through the md * device. */ + mddev->has_superblocks = false; rdev_for_each(rdev, mddev) { if (test_bit(Faulty, &rdev->flags)) continue; @@ -5465,6 +5469,9 @@ int md_run(struct mddev *mddev) set_disk_ro(mddev->gendisk, 1); } + if (rdev->sb_page) + mddev->has_superblocks = true; + /* perform some consistency tests on the device. * We don't want the data to overlap the metadata, * Internal Bitmap issues have been handled elsewhere. @@ -8049,6 +8056,7 @@ EXPORT_SYMBOL(md_done_sync); bool md_write_start(struct mddev *mddev, struct bio *bi) { int did_change = 0; + if (bio_data_dir(bi) != WRITE) return true; @@ -8081,6 +8089,8 @@ bool md_write_start(struct mddev *mddev, struct bio *bi) rcu_read_unlock(); if (did_change) sysfs_notify_dirent_safe(mddev->sysfs_state); + if (!mddev->has_superblocks) + return true; wait_event(mddev->sb_wait, !test_bit(MD_SB_CHANGE_PENDING, &mddev->sb_flags) || mddev->suspended); diff --git a/drivers/md/md.h b/drivers/md/md.h index 58cd20a5e85e..fbc925cce810 100644 --- a/drivers/md/md.h +++ b/drivers/md/md.h @@ -468,6 +468,8 @@ struct mddev { void (*sync_super)(struct mddev *mddev, struct md_rdev *rdev); struct md_cluster_info *cluster_info; unsigned int good_device_nr; /* good device num within cluster raid */ + + bool has_superblocks:1; }; enum recovery_flags { -- 2.14.3