Received: by 2002:ad5:474a:0:0:0:0:0 with SMTP id i10csp1347401imu; Thu, 20 Dec 2018 14:46:52 -0800 (PST) X-Google-Smtp-Source: ALg8bN7AJ34W4fiBUVQAF5yE6WzYhqLqvhFBF7pkPj519xpDy92NvnRd8Wg2UFZSt7LFJAGsxdsq X-Received: by 2002:a17:902:7443:: with SMTP id e3mr90113plt.304.1545346012288; Thu, 20 Dec 2018 14:46:52 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1545346012; cv=none; d=google.com; s=arc-20160816; b=z9VitZBJCC3av2jLbHXaUsZmxGkeVo3/0155AA6xgpIOv8GhLGWtbPzNpgVEKWyAo5 2eJDRbHLRHXhUiX+XTAGZBvI46sTNplwGp+xRJrEt2civ4tXhcDxlzsBRtdATi+YBgih UhjDEuXo+Nqg+bQMWK4vSNCjI/HxJecVPTdbevAHzsKxJUhwTSao5+acYAeJnCnlxwMz BHm5zt1XTlmw+X1gPq3c9nOg4E2qgs51LKvxxdFP8tT1vv/Lc3DTjBczl9gUz2+QgDf4 FAh3E63A5Vmr3ZohnKo6uVWFVJAy2+CucGRPNbJQaRS495htZ9Ea/2CgzD1MsoMJZQKe IHrw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:references:in-reply-to:message-id:date :subject:cc:to:from; bh=Gd0XCtQRPH5gyd3dx198x4K0fAsAoV1gza0G+wXL+HM=; b=Ido4uh/6cvno+EVapuOC7AoX4GMrXbn+nkocCMTdF5T9ReML3VGM1vSxQSb2EOqCTk 6qx+70XgwocZIq6hD6ALKRmAZIhL+1+YNSxvemcsUKaeJTPONp2beeU68+bkiwsNMQ0I dc10ApHWNRk2dY3W0x7I69tF8c/hTU8zDtXEeikM+aGVVcMdRW5FHkaLyVG3BaCwj55T lCx7Zrs7F2gGX4v/0XnZ0ojwHN5bNjvVdLKDuyRDLjFYccUtmApa4lFIfypTzGSWwsWc /gikyg6omwvwNZF1DUHN+c0J5cMoIf143NhqJp5zDj4Hw3iIK/zVk5EHvg8gf1c7eng7 VPxQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id x12si19053141plo.164.2018.12.20.14.46.04; Thu, 20 Dec 2018 14:46:52 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1730453AbeLTQji (ORCPT + 99 others); Thu, 20 Dec 2018 11:39:38 -0500 Received: from mail09.linbit.com ([212.69.161.110]:50122 "EHLO mail09.linbit.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1729103AbeLTQjh (ORCPT ); Thu, 20 Dec 2018 11:39:37 -0500 Received: from soda.linbit (212-186-191-219.static.upcbusiness.at [212.186.191.219]) by mail09.linbit.com (LINBIT Mail Daemon) with ESMTP id B6A2C1045C32; Thu, 20 Dec 2018 17:23:48 +0100 (CET) From: Lars Ellenberg To: Jens Axboe , linux-kernel@vger.kernel.org, linux-block@vger.kernel.org Cc: drbd-dev@lists.linbit.com Subject: [PATCH 11/17] drbd: avoid spurious self-outdating with concurrent disconnect / down Date: Thu, 20 Dec 2018 17:23:38 +0100 Message-Id: <20181220162344.8430-12-lars.ellenberg@linbit.com> X-Mailer: git-send-email 2.17.1 In-Reply-To: <20181220162344.8430-1-lars.ellenberg@linbit.com> References: <20181220162344.8430-1-lars.ellenberg@linbit.com> Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org If peers are "simultaneously" told to disconnect from each other, either explicitly, or implicitly by taking down the resource, with bad timing, one side may see its disconnect "fail" with a result of "state change failed by peer", and interpret this as "please oudate yourself". Try to catch this by checking for current connection status, and possibly retry as local-only state change instead. Signed-off-by: Lars Ellenberg --- drivers/block/drbd/drbd_nl.c | 7 +++++++ 1 file changed, 7 insertions(+) diff --git a/drivers/block/drbd/drbd_nl.c b/drivers/block/drbd/drbd_nl.c index 1958eb33b643..82915880c5e9 100644 --- a/drivers/block/drbd/drbd_nl.c +++ b/drivers/block/drbd/drbd_nl.c @@ -2711,8 +2711,10 @@ int drbd_adm_connect(struct sk_buff *skb, struct genl_info *info) static enum drbd_state_rv conn_try_disconnect(struct drbd_connection *connection, bool force) { + enum drbd_conns cstate; enum drbd_state_rv rv; +repeat: rv = conn_request_state(connection, NS(conn, C_DISCONNECTING), force ? CS_HARD : 0); @@ -2730,6 +2732,11 @@ static enum drbd_state_rv conn_try_disconnect(struct drbd_connection *connection break; case SS_CW_FAILED_BY_PEER: + spin_lock_irq(&connection->resource->req_lock); + cstate = connection->cstate; + spin_unlock_irq(&connection->resource->req_lock); + if (cstate <= C_WF_CONNECTION) + goto repeat; /* The peer probably wants to see us outdated. */ rv = conn_request_state(connection, NS2(conn, C_DISCONNECTING, disk, D_OUTDATED), 0); -- 2.17.1