Received: by 10.223.176.5 with SMTP id f5csp3375390wra; Mon, 29 Jan 2018 12:18:43 -0800 (PST) X-Google-Smtp-Source: AH8x224FAhv9yK2U5ICAAZ2FvLb9SNyFzniorqQ2oU7+U2fnH+7dUzfOeLBedfiEEsa3Bsk6GE0U X-Received: by 10.99.114.93 with SMTP id c29mr22606223pgn.419.1517257123585; Mon, 29 Jan 2018 12:18:43 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1517257123; cv=none; d=google.com; s=arc-20160816; b=McpQVrbnju7pUUfbClQatKMy6pkm7zS4W8YMYS6CrNGIAjn3UVlyoqpVreNVSWHI09 hRlw7A5zoFZXBAFow769pzk6rdSy21uitjUHdqGyK9pAYPiTqAMbrGmBy88bcSyl55vK Q8we5tc6jy6sW5xmwhBLDPlUBR2J5/VBphFaFm5CTN34p92EM+lFE06gMZc/0YLBbNJI aqzCMHHBeiOQ55XkEPQBD64vzvsftfqw9eqMmKjr6tu1JjoI9OFOrZbIokbErA9MVy5T hVC5sQbNX6MZiQhsoOnf8N8tidUq1OWA9LWq4YvPNWQwmQHGjn2zJ9I/WrcOeM/uH++j CZkQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:user-agent:references :in-reply-to:message-id:date:subject:cc:to:from :arc-authentication-results; bh=CX0+HaVe5WExq6BO1YMvFhgu0EuNXE1IIvST29WEXoQ=; b=i8CrB2gtxC2IbRQhQGmvsegNUIxGLykjODxpBEoIcR1EI2D74J3M/cwtDH+YPjr7ef 3kKXv/w9dCwWCryclOegNpzeFLpFY6niltZZ45XvUSwTVSaL9ZGSVDSL5EwGxBm5wfOu aec40ipGIRhH+oWWR1Tj9VrGI+PlZ3TcW2Tfdj+DIPuZfVcZI0dfAuCdcpxn/o8Uuhb9 FFw19qX20IYbP2IisleDKYTLwXcnww0m+2Qzc6hx12ilhWvDlu5xkHZJAV+dKh3nBBCZ xffnpc+6P/SSPmi9DI0Hs/VeiV+juVqa+VzxeLPx73FfJ+r9FpjbkWRD4yOUf4PpgOBj OTsA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id w10si232826pgc.243.2018.01.29.12.18.29; Mon, 29 Jan 2018 12:18:43 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932548AbeA2URg (ORCPT + 99 others); Mon, 29 Jan 2018 15:17:36 -0500 Received: from mail.linuxfoundation.org ([140.211.169.12]:46974 "EHLO mail.linuxfoundation.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S932540AbeA2URb (ORCPT ); Mon, 29 Jan 2018 15:17:31 -0500 Received: from localhost (LFbn-1-12258-90.w90-92.abo.wanadoo.fr [90.92.71.90]) by mail.linuxfoundation.org (Postfix) with ESMTPSA id 81E482EC5; Mon, 29 Jan 2018 13:00:05 +0000 (UTC) From: Greg Kroah-Hartman To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman , stable@vger.kernel.org, Monty Pavel , Joe Thornber , Mike Snitzer Subject: [PATCH 3.18 16/52] dm btree: fix serious bug in btree_split_beneath() Date: Mon, 29 Jan 2018 13:56:34 +0100 Message-Id: <20180129123628.903108090@linuxfoundation.org> X-Mailer: git-send-email 2.16.1 In-Reply-To: <20180129123628.168904217@linuxfoundation.org> References: <20180129123628.168904217@linuxfoundation.org> User-Agent: quilt/0.65 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org 3.18-stable review patch. If anyone has any objections, please let me know. ------------------ From: Joe Thornber commit bc68d0a43560e950850fc69b58f0f8254b28f6d6 upstream. When inserting a new key/value pair into a btree we walk down the spine of btree nodes performing the following 2 operations: i) space for a new entry ii) adjusting the first key entry if the new key is lower than any in the node. If the _root_ node is full, the function btree_split_beneath() allocates 2 new nodes, and redistibutes the root nodes entries between them. The root node is left with 2 entries corresponding to the 2 new nodes. btree_split_beneath() then adjusts the spine to point to one of the two new children. This means the first key is never adjusted if the new key was lower, ie. operation (ii) gets missed out. This can result in the new key being 'lost' for a period; until another low valued key is inserted that will uncover it. This is a serious bug, and quite hard to make trigger in normal use. A reproducing test case ("thin create devices-in-reverse-order") is available as part of the thin-provision-tools project: https://github.com/jthornber/thin-provisioning-tools/blob/master/functional-tests/device-mapper/dm-tests.scm#L593 Fix the issue by changing btree_split_beneath() so it no longer adjusts the spine. Instead it unlocks both the new nodes, and lets the main loop in btree_insert_raw() relock the appropriate one and make any neccessary adjustments. Reported-by: Monty Pavel Signed-off-by: Joe Thornber Signed-off-by: Mike Snitzer Signed-off-by: Greg Kroah-Hartman --- drivers/md/persistent-data/dm-btree.c | 19 ++----------------- 1 file changed, 2 insertions(+), 17 deletions(-) --- a/drivers/md/persistent-data/dm-btree.c +++ b/drivers/md/persistent-data/dm-btree.c @@ -572,23 +572,8 @@ static int btree_split_beneath(struct sh pn->keys[1] = rn->keys[0]; memcpy_disk(value_ptr(pn, 1), &val, sizeof(__le64)); - /* - * rejig the spine. This is ugly, since it knows too - * much about the spine - */ - if (s->nodes[0] != new_parent) { - unlock_block(s->info, s->nodes[0]); - s->nodes[0] = new_parent; - } - if (key < le64_to_cpu(rn->keys[0])) { - unlock_block(s->info, right); - s->nodes[1] = left; - } else { - unlock_block(s->info, left); - s->nodes[1] = right; - } - s->count = 2; - + unlock_block(s->info, left); + unlock_block(s->info, right); return 0; }