Received: by 2002:a05:6a10:6d10:0:0:0:0 with SMTP id gq16csp2982618pxb; Mon, 18 Apr 2022 12:37:25 -0700 (PDT) X-Google-Smtp-Source: ABdhPJx3Cb77J2/B1lh1hzMctrA3R5zLUOSZi/aTKpR8z3N/CxOzv2ZA2pJhglOa9Rqb/mmEZhEG X-Received: by 2002:a05:6a00:cd0:b0:50a:64d0:b90b with SMTP id b16-20020a056a000cd000b0050a64d0b90bmr9578320pfv.59.1650310645463; Mon, 18 Apr 2022 12:37:25 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1650310645; cv=none; d=google.com; s=arc-20160816; b=Yt2jO0lCHmqAA8mAxn3q9bOMMQmVgszlflPeLTCfnrPxp28DDxXVbw7GExAgVne+nv dmNwUAj22+osk2K2uGSJn7GP2t1fQZmuCqhoCJURzFAayU1GPJDEx3jxOsFA1if8GUEr aLHiDiZw1RG09lrV38hymjOHMg0wg4RN8IieeeDzJiS/SrsAyhWfNQknTl7G8QdqTlKb QkmW3PHPgahvYtWgc60pDL3x0Ka3P+fvkQXzGt/T/e8b30GHrkWe7Q98DMk9+I/NGOf8 7Ps+MgmTZSsu/pYmhUmIoQKI8GihiAYsiOxmruQsmK6pQutCwEXKXS6LkHxjRhcu4RU0 FXMQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:message-id:date:subject:cc:to :from:dkim-signature; bh=36yqMzjOGZkpaH50E+99ekAu+dxV1U9mNbMwBpK6SLo=; b=zILzU59zg7AqY85JHIxklhlUTmW87POD/YJcVaCrFFUqnQPFQdt8lHuKNVODCJjHa6 W6zIxc80sheEcz7yqFWDzb5IRdsSuLF2fX7mADHDhHVpfoWqk93adT71yKWxEagQTntG GFZsv5UC3OkyekF9weyzFDVNFmpZS1y2QqrnLx3Dnv4U790YNJfwNf6QsIOWCdHSdfeC qDi+Sq7ME4NbRyefpeo5NMqIKZ6K3UCadGrdKpj9x0GUISmm3CpgCQwC0/8p5maF+DSR kicoPRT1dntCf8nuIuL/14zMlFs4oiM0h42EZwgr43TQuXvMsTB+QF7E1xfzILxgRMlb YuMw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=korg header.b=TmBrCtWb; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id e18-20020a170902ed9200b00153b640ed90si8758056plj.38.2022.04.18.12.37.09; Mon, 18 Apr 2022 12:37:25 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@linuxfoundation.org header.s=korg header.b=TmBrCtWb; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linuxfoundation.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S242170AbiDRNfN (ORCPT + 99 others); Mon, 18 Apr 2022 09:35:13 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:53244 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S242688AbiDRNJU (ORCPT ); Mon, 18 Apr 2022 09:09:20 -0400 Received: from dfw.source.kernel.org (dfw.source.kernel.org [139.178.84.217]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 511E636173; Mon, 18 Apr 2022 05:48:51 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id A504061267; Mon, 18 Apr 2022 12:48:48 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 9F208C385A1; Mon, 18 Apr 2022 12:48:47 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=linuxfoundation.org; s=korg; t=1650286128; bh=ppHpru4VsQzJxITMdZEUGtbM4eqf231wGeMtdnZLn78=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=TmBrCtWb+uwMn+RNwBccyaM1Fsp8Z/9x2ePMJOlDS3SdNhVEJhoLxPp3MqWjoicKU Lq0zTBM9Evu7/jWnn7bYC+or5Ul2NAi+mWClVDRGNnVOrYuqsGWYSpv19rGUawG4fp CQn2c7XeR4Dqxz3kXVgtzLzbq8VXc3qbtuSaAsns= From: Greg Kroah-Hartman To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman , stable@vger.kernel.org, Lars Ellenberg , =?UTF-8?q?Christoph=20B=C3=B6hmwalder?= , Jens Axboe Subject: [PATCH 4.14 036/284] drbd: fix potential silent data corruption Date: Mon, 18 Apr 2022 14:10:17 +0200 Message-Id: <20220418121211.721427897@linuxfoundation.org> X-Mailer: git-send-email 2.35.3 In-Reply-To: <20220418121210.689577360@linuxfoundation.org> References: <20220418121210.689577360@linuxfoundation.org> User-Agent: quilt/0.66 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-7.7 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_HI, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Lars Ellenberg commit f4329d1f848ac35757d9cc5487669d19dfc5979c upstream. Scenario: --------- bio chain generated by blk_queue_split(). Some split bio fails and propagates its error status to the "parent" bio. But then the (last part of the) parent bio itself completes without error. We would clobber the already recorded error status with BLK_STS_OK, causing silent data corruption. Reproducer: ----------- How to trigger this in the real world within seconds: DRBD on top of degraded parity raid, small stripe_cache_size, large read_ahead setting. Drop page cache (sysctl vm.drop_caches=1, fadvise "DONTNEED", umount and mount again, "reboot"). Cause significant read ahead. Large read ahead request is split by blk_queue_split(). Parts of the read ahead that are already in the stripe cache, or find an available stripe cache to use, can be serviced. Parts of the read ahead that would need "too much work", would need to wait for a "stripe_head" to become available, are rejected immediately. For larger read ahead requests that are split in many pieces, it is very likely that some "splits" will be serviced, but then the stripe cache is exhausted/busy, and the remaining ones will be rejected. Signed-off-by: Lars Ellenberg Signed-off-by: Christoph Böhmwalder Cc: # 4.13.x Link: https://lore.kernel.org/r/20220330185551.3553196-1-christoph.boehmwalder@linbit.com Signed-off-by: Jens Axboe Signed-off-by: Greg Kroah-Hartman --- drivers/block/drbd/drbd_req.c | 3 ++- 1 file changed, 2 insertions(+), 1 deletion(-) --- a/drivers/block/drbd/drbd_req.c +++ b/drivers/block/drbd/drbd_req.c @@ -207,7 +207,8 @@ void start_new_tl_epoch(struct drbd_conn void complete_master_bio(struct drbd_device *device, struct bio_and_error *m) { - m->bio->bi_status = errno_to_blk_status(m->error); + if (unlikely(m->error)) + m->bio->bi_status = errno_to_blk_status(m->error); bio_endio(m->bio); dec_ap_bio(device); }