Received: by 2002:a05:6358:3188:b0:123:57c1:9b43 with SMTP id q8csp1836856rwd; Thu, 25 May 2023 19:58:15 -0700 (PDT) X-Google-Smtp-Source: ACHHUZ7sa8jq514ZbeEmA4vuyI8xHHfQXdw4P5WIpHm3bTLFEFE/27z3lqvOo3bn2Av9beFRFR8n X-Received: by 2002:a05:6a20:7348:b0:10f:96aa:9e4a with SMTP id v8-20020a056a20734800b0010f96aa9e4amr581308pzc.2.1685069895156; Thu, 25 May 2023 19:58:15 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1685069895; cv=none; d=google.com; s=arc-20160816; b=nBimT9D8dKmEH756TmnxBr7UBOb5Z+VA/BJvsnc9dA0ekuv9n4GvmtIPd64D1KGIEs XF39d2DfjXkv6nwIjOHpaYbXb9jHqyDsyCbnIMWUONINRkid4ldMGwBiwKzNRXr8Tvvi fKgtjOxDT8G/XzCrwNqdor9jEplApMKmmEAYvPqGCb/sFDSGnJJeV4ZyDRmLpmG4g+yc Hxs8/9DDBBz3RtHYiU9SaYLTM0SZYxHiv7Td0xe4kYbasQwO5jB2jBOdNrHojY/zndjd M+W/4t/6gOdDbJ4I8vhSMN4EJknSeemOkwrItideTem6ge+/w8zMtzduJavIEIVScZOf GCMg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to :mime-version:user-agent:date:message-id:from:references:cc:to :subject; bh=+zWB4TUxFRVpUkKkRU/rghAgZVHj+CiOy72iNHa2lVo=; b=bwZKdIzq5alGHqYIotzK51xoiFSgveVnZ6sRa1LuTxD9I9A/Ox6J6j8T2JzZl1ImxX VIniYATJyphKKO4/gKXXeMv1skrgzgiTnB2rLmKZR5LZT+5zbGYE0DLK9lsQuRFV1r0a kKw/CvKSnyC6PTOpjNzfqiDFJf+xbBJpcFh6ZT9BHcSfkKfbj7ct90uv7yoLgAIkPGLs FuNYTskq8Ize71k68HH4ErQRKhs6idemsDIy8Sqslf6++4lPPz+/zEBKVx8iN3iEpVWh +7+f2vA+NfZj8k87tIYP6qP0Q3w9FV9NIWAaPp+s53m85eAxMvkQOG0HoeQHdUQXXSH2 oAjg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id t191-20020a6381c8000000b0053f25333ab7si2620281pgd.759.2023.05.25.19.58.01; Thu, 25 May 2023 19:58:15 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229885AbjEZCzq (ORCPT + 99 others); Thu, 25 May 2023 22:55:46 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:35150 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229631AbjEZCzp (ORCPT ); Thu, 25 May 2023 22:55:45 -0400 Received: from dggsgout11.his.huawei.com (dggsgout11.his.huawei.com [45.249.212.51]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 99559BB; Thu, 25 May 2023 19:55:43 -0700 (PDT) Received: from mail02.huawei.com (unknown [172.30.67.153]) by dggsgout11.his.huawei.com (SkyGuard) with ESMTP id 4QS8gH5Hmxz4f3pCR; Fri, 26 May 2023 10:55:39 +0800 (CST) Received: from [10.174.176.73] (unknown [10.174.176.73]) by APP4 (Coremail) with SMTP id gCh0CgD3X7OsH3BkvufoKA--.48588S3; Fri, 26 May 2023 10:55:40 +0800 (CST) Subject: Re: [PATCH 2/3] md/raid10: fix incorrect done of recovery To: Li Nan , Yu Kuai , song@kernel.org, shli@fb.com, allenpeng@synology.com, alexwu@synology.com, bingjingc@synology.com, neilb@suse.de Cc: linux-raid@vger.kernel.org, linux-kernel@vger.kernel.org, yi.zhang@huawei.com, houtao1@huawei.com, yangerkun@huawei.com, "yukuai (C)" References: <20230522115449.2203939-1-linan666@huaweicloud.com> <20230522115449.2203939-3-linan666@huaweicloud.com> <801a3a11-9a2c-dca2-cec4-4a9c71d3afb6@huaweicloud.com> <10e164cc-149f-baf6-de52-0b7d3c9468f6@huaweicloud.com> From: Yu Kuai Message-ID: <1398a108-90ab-3790-eb43-faeaacda2c99@huaweicloud.com> Date: Fri, 26 May 2023 10:55:39 +0800 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:60.0) Gecko/20100101 Thunderbird/60.8.0 MIME-Version: 1.0 In-Reply-To: <10e164cc-149f-baf6-de52-0b7d3c9468f6@huaweicloud.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit X-CM-TRANSID: gCh0CgD3X7OsH3BkvufoKA--.48588S3 X-Coremail-Antispam: 1UD129KBjvJXoW7AFW5XF45ZF1xAr4UCFWfKrg_yoW8KrW3pw s7JFZIqryUG3s5Aw1jkryUAFyrt348t34UJr1xWa4fXFZIqryqgFy8Xr4vgFyDXr48tF1U tw1jqFW3uFy2yaDanT9S1TB71UUUUUUqnTZGkaVYY2UrUUUUjbIjqfuFe4nvWSU5nxnvy2 9KBjDU0xBIdaVrnRJUUU9214x267AKxVW8JVW5JwAFc2x0x2IEx4CE42xK8VAvwI8IcIk0 rVWrJVCq3wAFIxvE14AKwVWUJVWUGwA2ocxC64kIII0Yj41l84x0c7CEw4AK67xGY2AK02 1l84ACjcxK6xIIjxv20xvE14v26w1j6s0DM28EF7xvwVC0I7IYx2IY6xkF7I0E14v26r4U JVWxJr1l84ACjcxK6I8E87Iv67AKxVW0oVCq3wA2z4x0Y4vEx4A2jsIEc7CjxVAFwI0_Gc CE3s1le2I262IYc4CY6c8Ij28IcVAaY2xG8wAqx4xG64xvF2IEw4CE5I8CrVC2j2WlYx0E 2Ix0cI8IcVAFwI0_JrI_JrylYx0Ex4A2jsIE14v26r1j6r4UMcvjeVCFs4IE7xkEbVWUJV W8JwACjcxG0xvEwIxGrwACjI8F5VA0II8E6IAqYI8I648v4I1lFIxGxcIEc7CjxVA2Y2ka 0xkIwI1lc7I2V7IY0VAS07AlzVAYIcxG8wCF04k20xvY0x0EwIxGrwCFx2IqxVCFs4IE7x kEbVWUJVW8JwC20s026c02F40E14v26r1j6r18MI8I3I0E7480Y4vE14v26r106r1rMI8E 67AF67kF1VAFwI0_Jw0_GFylIxkGc2Ij64vIr41lIxAIcVC0I7IYx2IY67AKxVWUJVWUCw CI42IY6xIIjxv20xvEc7CjxVAFwI0_Gr0_Cr1lIxAIcVCF04k26cxKx2IYs7xG6rWUJVWr Zr1UMIIF0xvEx4A2jsIE14v26r1j6r4UMIIF0xvEx4A2jsIEc7CjxVAFwI0_Gr0_Gr1UYx BIdaVFxhVjvjDU0xZFpf9x0JUZa9-UUUUU= X-CM-SenderInfo: 51xn3trlr6x35dzhxuhorxvhhfrp/ X-CFilter-Loop: Reflected X-Spam-Status: No, score=-2.0 required=5.0 tests=BAYES_00,NICE_REPLY_A, SPF_HELO_NONE,SPF_NONE,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi, 在 2023/05/25 22:00, Li Nan 写道: > > > 在 2023/5/22 21:54, Yu Kuai 写道: >> Hi, >> >> 在 2023/05/22 19:54, linan666@huaweicloud.com 写道: >>> From: Li Nan >>> >>> Recovery will go to giveup and let chunks_skipped++ in >>> raid10_sync_request() if there are some bad_blocks, and it will return >>> max_sector when chunks_skipped >= geo.raid_disks. Now, recovery fail and >>> data is inconsistent but user think recovery is done, it is wrong. >>> >>> Fix it by set mirror's recovery_disabled and spare device shouln't be >>> added to here. >>> >>> Signed-off-by: Li Nan >>> --- >>>   drivers/md/raid10.c | 16 +++++++++++++++- >>>   1 file changed, 15 insertions(+), 1 deletion(-) >>> >>> diff --git a/drivers/md/raid10.c b/drivers/md/raid10.c >>> index e21502c03b45..70cc87c7ee57 100644 >>> --- a/drivers/md/raid10.c >>> +++ b/drivers/md/raid10.c >>> @@ -3303,6 +3303,7 @@ static sector_t raid10_sync_request(struct >>> mddev *mddev, sector_t sector_nr, >>>       int chunks_skipped = 0; >>>       sector_t chunk_mask = conf->geo.chunk_mask; >>>       int page_idx = 0; >>> +    int error_disk = -1; >>>       /* >>>        * Allow skipping a full rebuild for incremental assembly >>> @@ -3386,7 +3387,18 @@ static sector_t raid10_sync_request(struct >>> mddev *mddev, sector_t sector_nr, >>>           return reshape_request(mddev, sector_nr, skipped); >>>       if (chunks_skipped >= conf->geo.raid_disks) { >>> -        /* if there has been nothing to do on any drive, >>> +        pr_err("md/raid10:%s: %s fail\n", mdname(mddev), >>> +            test_bit(MD_RECOVERY_SYNC, &mddev->recovery) ?  "resync" >>> : "recovery"); >> >> Line exceed 80 columns, and following. >>> +        if (error_disk >= 0 && !test_bit(MD_RECOVERY_SYNC, >>> &mddev->recovery)) { >> >> Resync has the same problem, right? >> > > Yes. But I have no idea to fix it. md_error disk nor set > recovery_disabled is a good solution. So, just print error message now. > Do you have any ideas? I'll look into this, in the meadtime, I don't suggest to apply this patch because this is just temporary solution that only fix half of the problem. Thanks, Kuai