Received: by 2002:a5d:9c59:0:0:0:0:0 with SMTP id 25csp79882iof; Wed, 8 Jun 2022 15:46:24 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzMnMpmyDYBnK/K+cWZuE8RM4V0vCk3Fj312GWDzVFcNfDF52mqdCBT6ZBvl6BSN3ie3hDi X-Received: by 2002:aa7:c508:0:b0:42d:cc6b:df80 with SMTP id o8-20020aa7c508000000b0042dcc6bdf80mr41439879edq.393.1654728384212; Wed, 08 Jun 2022 15:46:24 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1654728384; cv=none; d=google.com; s=arc-20160816; b=v6ehU5IEuuxeXoMUM1CROWRG6ij2arjZPCSVony6KQ77iTHWhXdI2KaYpBAmq0yoV8 5lf9OIqRMuAs0KSaxUruEsuNE1LSJvPJmMrGycoBfYoBdX0nUKA7Ywhsx/jw66HWZlfr kaXkYnareZ5VAHR0Z+6BSpOwvwJ/5vmPzXa9ABDLgDN1j1zGJlJEY+QIN95lxTKMlAVa ZnO2gy/ixmwEru0v0Ggp4lOpwohF05ph19YAWg3fPLUIuOo3zieYuRo3o2BmGTGU1JK8 n+UohCo9lZpRJ6NFZUMjPptrKioENkbPoidparcPWVMqP/DgNGHjxJs/iKqN7PQVxrmR nuMA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:subject:content-transfer-encoding:in-reply-to :from:references:cc:to:content-language:user-agent:mime-version:date :message-id:dkim-signature; bh=LufXSXEBmZFLcq4AsIpzM6cI1Bf+LCdEeFyP6I+AHkw=; b=O1w9Di8NCd79kcDB8c4tBWmTjK7gznUrkgiZINzjSkD+bhCeEp8EcJLOtAgDAwVCx9 Usft0CiQNfhhfT4tti4OJxv4FaesAJSb1A7oKmNu/QnTJnaXlhGM1WtiyJArnWYbbHdm k8e8RDi+I9Tsn57T8j3VSSethPbvOxj/cXAZStpeOYb1Vr5lX/0vo8QGAadeu+3FFpvw Td1WLdvNmwnskmaQiHwv9dhAo0oCMdeS2xPVVWZQuh9DJu76bOLrm0TaXklvQqU5/uCz B5kJEKcQNcZ1+bfcyJruWKtnu5YqnY9qyVbrBdnhleHvJrMu2gGZjx19yprcd/+oeidn uxpA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@deltatee.com header.s=20200525 header.b=ICdf9Du9; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=deltatee.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id l23-20020a056402345700b0042ddc8cda2csi22395710edc.48.2022.06.08.15.45.57; Wed, 08 Jun 2022 15:46:24 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@deltatee.com header.s=20200525 header.b=ICdf9Du9; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=deltatee.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S233337AbiFHWHZ (ORCPT + 99 others); Wed, 8 Jun 2022 18:07:25 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:37634 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230197AbiFHWHY (ORCPT ); Wed, 8 Jun 2022 18:07:24 -0400 Received: from ale.deltatee.com (ale.deltatee.com [204.191.154.188]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 3834837F04A; Wed, 8 Jun 2022 15:07:20 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; q=dns/txt; c=relaxed/relaxed; d=deltatee.com; s=20200525; h=Subject:In-Reply-To:From:References:Cc:To: MIME-Version:Date:Message-ID:content-disposition; bh=LufXSXEBmZFLcq4AsIpzM6cI1Bf+LCdEeFyP6I+AHkw=; b=ICdf9Du9tUtQzPKwgSsmL8vHLG YFRVGamfWS+YrCigjB+QuA8nU+dc7GXAz+sfR0wNDc6xRGbVbMFpnZyEpfxIcd83UOXSQ1b894DFH 31vgzKNwGlHeo4+lfedPT5QXoNQ9iviOXsREFHgtCc0QnvPjkyd794LTvwfnlmUzOclYeCawKo2pb /j+D8KbXZn1cLIWi/5VGnXRQCjnmvbXev1AgIy8FS3/pXOs8bf+HO3jM7bUvGa3+aB8+2vJ/gnHQ7 WW9DylHS/P/EtaWgkxJtksC0aCQAj4SA6FxvQU5PxQtbAoisUImUtnSdKjphaO0ZaMQ4xCK/D1RI/ 1BgWSGUg==; Received: from s0106ac1f6bb1ecac.cg.shawcable.net ([70.73.163.230] helo=[192.168.11.155]) by ale.deltatee.com with esmtpsa (TLS1.3) tls TLS_ECDHE_RSA_WITH_AES_128_GCM_SHA256 (Exim 4.94.2) (envelope-from ) id 1nz3pd-002CqB-CJ; Wed, 08 Jun 2022 16:07:19 -0600 Message-ID: <992ba80d-cc26-08bb-51eb-14f6cc6b126b@deltatee.com> Date: Wed, 8 Jun 2022 16:07:13 -0600 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101 Thunderbird/91.8.0 Content-Language: en-US To: Song Liu Cc: open list , linux-raid , Christoph Hellwig , Donald Buczek , Guoqing Jiang , Xiao Ni , Stephen Bates , Martin Oliveira , David Sloan , Christoph Hellwig References: <20220608162756.144600-1-logang@deltatee.com> <20220608162756.144600-5-logang@deltatee.com> <08b80caa-d726-b9f3-7ce0-f486b8080ec5@deltatee.com> From: Logan Gunthorpe In-Reply-To: Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-SA-Exim-Connect-IP: 70.73.163.230 X-SA-Exim-Rcpt-To: song@kernel.org, linux-kernel@vger.kernel.org, linux-raid@vger.kernel.org, hch@infradead.org, buczek@molgen.mpg.de, guoqing.jiang@linux.dev, xni@redhat.com, sbates@raithlin.com, Martin.Oliveira@eideticom.com, David.Sloan@eideticom.com, hch@lst.de X-SA-Exim-Mail-From: logang@deltatee.com X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net X-Spam-Level: X-Spam-Status: No, score=-3.3 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,NICE_REPLY_A,SPF_HELO_PASS, SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 Subject: Re: [PATCH v4 04/11] md/raid5: Ensure array is suspended for calls to log_exit() X-SA-Exim-Version: 4.2.1 (built Sat, 13 Feb 2021 17:57:42 +0000) X-SA-Exim-Scanned: Yes (on ale.deltatee.com) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2022-06-08 16:02, Song Liu wrote: > On Wed, Jun 8, 2022 at 11:21 AM Logan Gunthorpe wrote: >> >> >> >> On 2022-06-08 11:59, Song Liu wrote: >>> On Wed, Jun 8, 2022 at 9:28 AM Logan Gunthorpe wrote: >>>> >>>> The raid5-cache code relies on there being no IO in flight when >>>> log_exit() is called. There are two places where this is not >>>> guaranteed so add mddev_suspend() and mddev_resume() calls to these >>>> sites. >>>> >>>> The site in raid5_remove_disk() has a comment saying that it is >>>> called in raid5d and thus cannot wait for pending writes; however that >>>> does not appear to be correct anymore (if it ever was) as >>>> raid5_remove_disk() is called from hot_remove_disk() which only >>>> appears to be called in the md_ioctl(). Thus, the comment is removed, >>>> as well as the racy check and replaced with calls to suspend/resume. >>>> >>>> The site in raid5_change_consistency_policy() is in the error path, >>>> and another similar call site already has suspend/resume calls just >>>> below it; so it should be equally safe to make that change here. >>>> >>>> Signed-off-by: Logan Gunthorpe >>>> Reviewed-by: Christoph Hellwig >>>> --- >>>> drivers/md/raid5.c | 18 ++++++------------ >>>> 1 file changed, 6 insertions(+), 12 deletions(-) >>>> >>>> diff --git a/drivers/md/raid5.c b/drivers/md/raid5.c >>>> index 5d09256d7f81..3ad37dd4c5cd 100644 >>>> --- a/drivers/md/raid5.c >>>> +++ b/drivers/md/raid5.c >>>> @@ -7938,18 +7938,9 @@ static int raid5_remove_disk(struct mddev *mddev, struct md_rdev *rdev) >>>> >>>> print_raid5_conf(conf); >>>> if (test_bit(Journal, &rdev->flags) && conf->log) { >>>> - /* >>>> - * we can't wait pending write here, as this is called in >>>> - * raid5d, wait will deadlock. >>>> - * neilb: there is no locking about new writes here, >>>> - * so this cannot be safe. >>>> - */ >>>> - if (atomic_read(&conf->active_stripes) || >>>> - atomic_read(&conf->r5c_cached_full_stripes) || >>>> - atomic_read(&conf->r5c_cached_partial_stripes)) { >>>> - return -EBUSY; >>>> - } >>>> + mddev_suspend(mddev); >>> >>> Unfortunately, the comment about deadlock is still true, and we cannot call >>> mddev_suspend here. To trigger it: >> >> Ah, yes. What a tangle. I think we can just drop this patch. Now that we >> are removing RCU it isn't actually necessary to fix the bug I was >> seeing. It's still probably broken as the comment notes though. > > How about we keep the suspend/resume in raid5_change_consistency_policy()? > Like the one I just pushed to md-next: > > https://git.kernel.org/pub/scm/linux/kernel/git/song/md.git/commit/?h=md-next&id=ac1506992459fe45a085c1f93df74d51c381887b I'm good with that. Thanks. I'll do some testing on md-next shortly. Thanks, Logan