Received: by 2002:a05:7412:419a:b0:f3:1519:9f41 with SMTP id i26csp935173rdh; Fri, 24 Nov 2023 01:13:29 -0800 (PST) X-Google-Smtp-Source: AGHT+IGOjdfBJUqAF0zCDK/ossBumA4gDxLs/vRfqIGDLOr2bDP6fTkKqCBo4+bt7bzBj7RspfHG X-Received: by 2002:a05:6a00:1c8f:b0:68f:cbd3:5b01 with SMTP id y15-20020a056a001c8f00b0068fcbd35b01mr7907869pfw.13.1700817209584; Fri, 24 Nov 2023 01:13:29 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1700817209; cv=none; d=google.com; s=arc-20160816; b=rpQe15Tedt8Dg3RKIAcYEqicDcITAxFGDsWSVMyzZRaZHOaYLoCad1tPQKw5TUPzYM sKMTsaEiCbH+FiMaMNm5tBmEWyb1w9xjWc1+wMk36nsp979IDM1y+l4lnH/TtNx7+ue+ Jg3krkw3wE+kQ/5iAkEJh7j8TlrrUKedFqfuUAl7vgHU94/HsFh8nPt7teKKs0zX0pko LqOfyiOjZFZMMM1mQXD9gSozWkrfTiAllXxbeJE68LOPxQ85/VV4gCHjOMJYjM+/XOc8 Rp3uKZnWeKTnrYkiTSTBrYTsWCTFcpJoeYdHAsYfjNyckl5Ph5869At/ZqhBfavYBhJX x82Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to :mime-version:user-agent:date:message-id:from:references:cc:to :subject; bh=4H3/VjhDboVrr3BXHpL2ELbUErCVlmUzRQpAXDyJVrE=; fh=y757WKqhp7YLoLu5MeP1hGH0XkyEMry3I2kvtdXeVZY=; b=k7hYWdoyzAWkUBdKYFLL0fEtgUfcMpqNpvRaHyIq5DjNyljf9O6iHmI5jAxUsfgTZz 5IHAw+Zs2gvg+GQI1T1HEjTau/Q2nGWfGU4teLwKthwaT6aiAawxyJnXxed2wb78zh6L TL/kdVkYkvyNS8PG2h53BHB5LVdnvvb9kW2OrevDkBrZIn5OxWJFflDSsgJBscURX6kF EBYy6UJ2RIEolwxMNaIs2SPJDf48hkL8aUi7FGgUMCFhfI+STgxfQFXZFlmH327i+Hbg k+N8oyNPM+Yq1B73TRN+GM9HXjGJelgLMKXV6jHxykM++ksHNTruyITVKQ+QSc7FmTG7 22tg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.34 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from howler.vger.email (howler.vger.email. [23.128.96.34]) by mx.google.com with ESMTPS id y17-20020a056a00191100b006cbef428b5bsi2281810pfi.288.2023.11.24.01.13.29 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 24 Nov 2023 01:13:29 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.34 as permitted sender) client-ip=23.128.96.34; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.34 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by howler.vger.email (Postfix) with ESMTP id 9227A804BB72; Fri, 24 Nov 2023 01:13:26 -0800 (PST) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.11 at howler.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232837AbjKXJNC (ORCPT + 99 others); Fri, 24 Nov 2023 04:13:02 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:38352 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230317AbjKXJNA (ORCPT ); Fri, 24 Nov 2023 04:13:00 -0500 Received: from dggsgout11.his.huawei.com (dggsgout11.his.huawei.com [45.249.212.51]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 05EA3D68; Fri, 24 Nov 2023 01:13:07 -0800 (PST) Received: from mail.maildlp.com (unknown [172.19.93.142]) by dggsgout11.his.huawei.com (SkyGuard) with ESMTP id 4Sc8Qh0zm3z4f3m6w; Fri, 24 Nov 2023 17:13:00 +0800 (CST) Received: from mail02.huawei.com (unknown [10.116.40.112]) by mail.maildlp.com (Postfix) with ESMTP id 73ABF1A0516; Fri, 24 Nov 2023 17:13:04 +0800 (CST) Received: from [10.174.176.73] (unknown [10.174.176.73]) by APP1 (Coremail) with SMTP id cCh0CgA3iA4faWBl+w0vBw--.5572S3; Fri, 24 Nov 2023 17:13:04 +0800 (CST) Subject: Re: [PATCH -next v2 2/6] md: remove flag RemoveSynchronized To: Song Liu , Yu Kuai Cc: linux-raid@vger.kernel.org, linux-kernel@vger.kernel.org, yi.zhang@huawei.com, yangerkun@huawei.com, "yukuai (C)" References: <20231021102059.3198284-1-yukuai1@huaweicloud.com> <20231021102059.3198284-3-yukuai1@huaweicloud.com> From: Yu Kuai Message-ID: Date: Fri, 24 Nov 2023 17:13:03 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.8.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit X-CM-TRANSID: cCh0CgA3iA4faWBl+w0vBw--.5572S3 X-Coremail-Antispam: 1UD129KBjvJXoW3Gw1DJFy8Cr47Xry8GFyrCrg_yoWxJw45p3 93XFy3Cr4UXr4jkr4DXrWUCFyrXw17KrW8Cr93ua4Svas2vryDW34rJFW5Wr98ZFZ3Za10 qF1UJwsrGF1xKFJanT9S1TB71UUUUUUqnTZGkaVYY2UrUUUUjbIjqfuFe4nvWSU5nxnvy2 9KBjDU0xBIdaVrnRJUUUkG14x267AKxVW8JVW5JwAFc2x0x2IEx4CE42xK8VAvwI8IcIk0 rVWrJVCq3wAFIxvE14AKwVWUJVWUGwA2ocxC64kIII0Yj41l84x0c7CEw4AK67xGY2AK02 1l84ACjcxK6xIIjxv20xvE14v26w1j6s0DM28EF7xvwVC0I7IYx2IY6xkF7I0E14v26r4U JVWxJr1l84ACjcxK6I8E87Iv67AKxVW0oVCq3wA2z4x0Y4vEx4A2jsIEc7CjxVAFwI0_Gc CE3s1le2I262IYc4CY6c8Ij28IcVAaY2xG8wAqx4xG64xvF2IEw4CE5I8CrVC2j2WlYx0E 2Ix0cI8IcVAFwI0_JrI_JrylYx0Ex4A2jsIE14v26r1j6r4UMcvjeVCFs4IE7xkEbVWUJV W8JwACjcxG0xvEwIxGrwACjI8F5VA0II8E6IAqYI8I648v4I1lc7I2V7IY0VAS07AlzVAY IcxG8wCF04k20xvY0x0EwIxGrwCFx2IqxVCFs4IE7xkEbVWUJVW8JwC20s026c02F40E14 v26r1j6r18MI8I3I0E7480Y4vE14v26r106r1rMI8E67AF67kF1VAFwI0_JF0_Jw1lIxkG c2Ij64vIr41lIxAIcVC0I7IYx2IY67AKxVWUJVWUCwCI42IY6xIIjxv20xvEc7CjxVAFwI 0_Gr0_Cr1lIxAIcVCF04k26cxKx2IYs7xG6r4j6FyUMIIF0xvEx4A2jsIE14v26r1j6r4U MIIF0xvEx4A2jsIEc7CjxVAFwI0_Gr0_Gr1UYxBIdaVFxhVjvjDU0xZFpf9x0JUZa9-UUU UU= X-CM-SenderInfo: 51xn3trlr6x35dzhxuhorxvhhfrp/ X-Spam-Status: No, score=-3.7 required=5.0 tests=HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,NICE_REPLY_A,SPF_HELO_NONE,SPF_PASS, T_SCC_BODY_TEXT_LINE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on howler.vger.email Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (howler.vger.email [0.0.0.0]); Fri, 24 Nov 2023 01:13:26 -0800 (PST) Hi, 在 2023/11/24 16:28, Song Liu 写道: > On Fri, Oct 20, 2023 at 7:25 PM Yu Kuai wrote: >> >> From: Yu Kuai >> >> rcu is not used correctly here, because synchronize_rcu() is called >> before replacing old value, for example: >> >> remove_and_add_spares // other path >> synchronize_rcu >> // called before replacing old value >> set_bit(RemoveSynchronized) >> rcu_read_lock() >> rdev = conf->mirros[].rdev >> pers->hot_remove_disk >> conf->mirros[].rdev = NULL; >> if (!test_bit(RemoveSynchronized)) >> synchronize_rcu >> /* >> * won't be called, and won't wait >> * for concurrent readers to be done. >> */ >> // access rdev after remove_and_add_spares() >> rcu_read_unlock() >> >> Fortunately, there is a separate rcu protection to prevent such rdev >> to be freed: >> >> md_kick_rdev_from_array //other path >> rcu_read_lock() >> rdev = conf->mirros[].rdev >> list_del_rcu(&rdev->same_set) >> >> rcu_read_unlock() >> /* >> * rdev can be removed from conf, but >> * rdev won't be freed. >> */ >> synchronize_rcu() >> free rdev >> >> Hence remove this useless flag and prepare to remove rcu protection to >> access rdev from 'conf'. >> >> Signed-off-by: Yu Kuai > > RemoveSynchronized is not removed from md.h. Yes, thanks for the notice. Kuai > > >> --- >> drivers/md/md-multipath.c | 9 --------- >> drivers/md/md.c | 37 ++++++------------------------------- >> drivers/md/raid1.c | 9 --------- >> drivers/md/raid10.c | 9 --------- >> drivers/md/raid5.c | 9 --------- >> 5 files changed, 6 insertions(+), 67 deletions(-) >> >> diff --git a/drivers/md/md-multipath.c b/drivers/md/md-multipath.c >> index d22276870283..aa77133f3188 100644 >> --- a/drivers/md/md-multipath.c >> +++ b/drivers/md/md-multipath.c >> @@ -258,15 +258,6 @@ static int multipath_remove_disk(struct mddev *mddev, struct md_rdev *rdev) >> goto abort; >> } >> p->rdev = NULL; >> - if (!test_bit(RemoveSynchronized, &rdev->flags)) { >> - synchronize_rcu(); >> - if (atomic_read(&rdev->nr_pending)) { >> - /* lost the race, try later */ >> - err = -EBUSY; >> - p->rdev = rdev; >> - goto abort; >> - } >> - } >> err = md_integrity_register(mddev); >> } >> abort: >> diff --git a/drivers/md/md.c b/drivers/md/md.c >> index 09686d8db983..68f3bb6e89cb 100644 >> --- a/drivers/md/md.c >> +++ b/drivers/md/md.c >> @@ -9250,44 +9250,19 @@ static int remove_and_add_spares(struct mddev *mddev, >> struct md_rdev *rdev; >> int spares = 0; >> int removed = 0; >> - bool remove_some = false; >> >> if (this && test_bit(MD_RECOVERY_RUNNING, &mddev->recovery)) >> /* Mustn't remove devices when resync thread is running */ >> return 0; >> >> rdev_for_each(rdev, mddev) { >> - if ((this == NULL || rdev == this) && >> - rdev->raid_disk >= 0 && >> - !test_bit(Blocked, &rdev->flags) && >> - test_bit(Faulty, &rdev->flags) && >> - atomic_read(&rdev->nr_pending)==0) { >> - /* Faulty non-Blocked devices with nr_pending == 0 >> - * never get nr_pending incremented, >> - * never get Faulty cleared, and never get Blocked set. >> - * So we can synchronize_rcu now rather than once per device >> - */ >> - remove_some = true; >> - set_bit(RemoveSynchronized, &rdev->flags); >> - } >> - } >> - >> - if (remove_some) >> - synchronize_rcu(); >> - rdev_for_each(rdev, mddev) { >> - if ((this == NULL || rdev == this) && >> - (test_bit(RemoveSynchronized, &rdev->flags) || >> - rdev_removeable(rdev))) { >> - if (mddev->pers->hot_remove_disk( >> - mddev, rdev) == 0) { >> - sysfs_unlink_rdev(mddev, rdev); >> - rdev->saved_raid_disk = rdev->raid_disk; >> - rdev->raid_disk = -1; >> - removed++; >> - } >> + if ((this == NULL || rdev == this) && rdev_removeable(rdev) && >> + !mddev->pers->hot_remove_disk(mddev, rdev)) { >> + sysfs_unlink_rdev(mddev, rdev); >> + rdev->saved_raid_disk = rdev->raid_disk; >> + rdev->raid_disk = -1; >> + removed++; >> } >> - if (remove_some && test_bit(RemoveSynchronized, &rdev->flags)) >> - clear_bit(RemoveSynchronized, &rdev->flags); >> } >> >> if (removed && mddev->kobj.sd) >> diff --git a/drivers/md/raid1.c b/drivers/md/raid1.c >> index c13088eae401..4348d670439d 100644 >> --- a/drivers/md/raid1.c >> +++ b/drivers/md/raid1.c >> @@ -1836,15 +1836,6 @@ static int raid1_remove_disk(struct mddev *mddev, struct md_rdev *rdev) >> goto abort; >> } >> p->rdev = NULL; >> - if (!test_bit(RemoveSynchronized, &rdev->flags)) { >> - synchronize_rcu(); >> - if (atomic_read(&rdev->nr_pending)) { >> - /* lost the race, try later */ >> - err = -EBUSY; >> - p->rdev = rdev; >> - goto abort; >> - } >> - } >> if (conf->mirrors[conf->raid_disks + number].rdev) { >> /* We just removed a device that is being replaced. >> * Move down the replacement. We drain all IO before >> diff --git a/drivers/md/raid10.c b/drivers/md/raid10.c >> index 4b5f34f320c8..33ab00323cae 100644 >> --- a/drivers/md/raid10.c >> +++ b/drivers/md/raid10.c >> @@ -2219,15 +2219,6 @@ static int raid10_remove_disk(struct mddev *mddev, struct md_rdev *rdev) >> goto abort; >> } >> *rdevp = NULL; >> - if (!test_bit(RemoveSynchronized, &rdev->flags)) { >> - synchronize_rcu(); >> - if (atomic_read(&rdev->nr_pending)) { >> - /* lost the race, try later */ >> - err = -EBUSY; >> - *rdevp = rdev; >> - goto abort; >> - } >> - } >> if (p->replacement) { >> /* We must have just cleared 'rdev' */ >> p->rdev = p->replacement; >> diff --git a/drivers/md/raid5.c b/drivers/md/raid5.c >> index 27a4dce51c92..a80be51b4825 100644 >> --- a/drivers/md/raid5.c >> +++ b/drivers/md/raid5.c >> @@ -8202,15 +8202,6 @@ static int raid5_remove_disk(struct mddev *mddev, struct md_rdev *rdev) >> goto abort; >> } >> *rdevp = NULL; >> - if (!test_bit(RemoveSynchronized, &rdev->flags)) { >> - lockdep_assert_held(&mddev->reconfig_mutex); >> - synchronize_rcu(); >> - if (atomic_read(&rdev->nr_pending)) { >> - /* lost the race, try later */ >> - err = -EBUSY; >> - rcu_assign_pointer(*rdevp, rdev); >> - } >> - } >> if (!err) { >> err = log_modify(conf, rdev, false); >> if (err) >> -- >> 2.39.2 >> > > . >