Received: by 2002:a25:ad19:0:0:0:0:0 with SMTP id y25csp10425321ybi; Wed, 24 Jul 2019 23:00:26 -0700 (PDT) X-Google-Smtp-Source: APXvYqyKsUktVqS5gQvdWkt6NzPjVbOuBMMs2M0Nv5w6NJ8mITp8mHgvhlh+FirfzyvV2LRalhwd X-Received: by 2002:a17:902:fe14:: with SMTP id g20mr84553265plj.54.1564034425944; Wed, 24 Jul 2019 23:00:25 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1564034425; cv=none; d=google.com; s=arc-20160816; b=wTB4ErXjX+OaWYWr4CqZxOhEMpoA1rEtf6tO32cvh4nyydY2VGD9Dr+3idnuWVsoiu +3rl6FMf18+F8trMCdMdVVInEMloLRVtx8Bo6ERG0cbsQXk9rJ4rzmxTz5CToniS1sr0 lIDhQHvf7Cmg74PvNwnBdnF+YGQGffMqeSephKAvfaycpgtB2fAIZrT4iIGoq7t0N74l l02WUuj/NRA6A+CAbRH/seiQ8Hr/W6TUP6JPUu2BT1nISala9TxjSv3KDRjLUi/3FaHc LiAAkOQ698Dy9I2f965eNpsRRK24cwo3RnV7aFUaPtOI1m93DEknjx8K8dtFJ6l0mV16 upUw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:message-id:date:subject:cc:to :from:dkim-signature; bh=tUv3upMUIDE/Fw3fqkx3Gha2q3GMZmMO7yzO0aJjkbU=; b=IgVQXxmlqUk1oxxSVVOccgHxCwl4Q11AqHHKpjRz5mQbFwXHs5HCewbMlBUljZgOu/ 6gI8ADs27jwjYiq/YlLeoyqisR4Y6a4FwKWbv8dF8SdPnLTnGvDv3UPYspTooVOb3xcy dO8E+ekxDUkpNfnTjayBJ/zc6lBAPGoSuvZQoOSkn2CkHfVuAnkE2bE/d98yzIDEjANp UOHVECRfWpRDHY0RTwvg0Rrt+deHIPMUdVZvCWxZmGDm+5oPENNgZ/07KlvrETS96raC 8V6gP4YK/vDbA4ek3jFXnVyzzsujAi18ZgL8tJGq4XK200WauRozaqypzZSuilkrxc8I 1MUQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b="SvTX//mI"; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id a17si18822062pff.195.2019.07.24.23.00.11; Wed, 24 Jul 2019 23:00:25 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b="SvTX//mI"; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2404814AbfGYFm4 (ORCPT + 99 others); Thu, 25 Jul 2019 01:42:56 -0400 Received: from mail.kernel.org ([198.145.29.99]:57738 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2390943AbfGYFmr (ORCPT ); Thu, 25 Jul 2019 01:42:47 -0400 Received: from localhost (83-86-89-107.cable.dynamic.v4.ziggo.nl [83.86.89.107]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id BDF0E21880; Thu, 25 Jul 2019 05:42:45 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1564033366; bh=8frdZ2c/nwbuf5ayWbpcbZ1EZJwyPNhQLipVNts1tZ8=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=SvTX//mIXP3HXoieLVL+iRQuk96lZzS+/TTJ5cE+b0CYs3MRAGDXNV0XrLXhCYkhJ jgn4GNXtlmFGm/izgOPpsaR6441QnWk4f8rHgGJI2qsEi8s5fJY7U8YQ5PYeVFykKD kl4w9eab7WTLixhgRcXYjGzrfnXl4eo4vvvlN6zQ= From: Greg Kroah-Hartman To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman , stable@vger.kernel.org, Coly Li , Tang Junhui , Jens Axboe Subject: [PATCH 4.19 184/271] bcache: Revert "bcache: fix high CPU occupancy during journal" Date: Wed, 24 Jul 2019 21:20:53 +0200 Message-Id: <20190724191710.904921845@linuxfoundation.org> X-Mailer: git-send-email 2.22.0 In-Reply-To: <20190724191655.268628197@linuxfoundation.org> References: <20190724191655.268628197@linuxfoundation.org> User-Agent: quilt/0.66 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Coly Li commit 249a5f6da57c28a903c75d81505d58ec8c10030d upstream. This reverts commit c4dc2497d50d9c6fb16aa0d07b6a14f3b2adb1e0. This patch enlarges a race between normal btree flush code path and flush_btree_write(), which causes deadlock when journal space is exhausted. Reverts this patch makes the race window from 128 btree nodes to only 1 btree nodes. Fixes: c4dc2497d50d ("bcache: fix high CPU occupancy during journal") Signed-off-by: Coly Li Cc: stable@vger.kernel.org Cc: Tang Junhui Signed-off-by: Jens Axboe Signed-off-by: Greg Kroah-Hartman --- drivers/md/bcache/bcache.h | 2 - drivers/md/bcache/journal.c | 47 ++++++++++++++------------------------------ drivers/md/bcache/util.h | 2 - 3 files changed, 15 insertions(+), 36 deletions(-) --- a/drivers/md/bcache/bcache.h +++ b/drivers/md/bcache/bcache.h @@ -708,8 +708,6 @@ struct cache_set { #define BUCKET_HASH_BITS 12 struct hlist_head bucket_hash[1 << BUCKET_HASH_BITS]; - - DECLARE_HEAP(struct btree *, flush_btree); }; struct bbio { --- a/drivers/md/bcache/journal.c +++ b/drivers/md/bcache/journal.c @@ -390,12 +390,6 @@ err: } /* Journalling */ -#define journal_max_cmp(l, r) \ - (fifo_idx(&c->journal.pin, btree_current_write(l)->journal) < \ - fifo_idx(&(c)->journal.pin, btree_current_write(r)->journal)) -#define journal_min_cmp(l, r) \ - (fifo_idx(&c->journal.pin, btree_current_write(l)->journal) > \ - fifo_idx(&(c)->journal.pin, btree_current_write(r)->journal)) static void btree_flush_write(struct cache_set *c) { @@ -403,35 +397,25 @@ static void btree_flush_write(struct cac * Try to find the btree node with that references the oldest journal * entry, best is our current candidate and is locked if non NULL: */ - struct btree *b; - int i; + struct btree *b, *best; + unsigned int i; atomic_long_inc(&c->flush_write); - retry: - spin_lock(&c->journal.lock); - if (heap_empty(&c->flush_btree)) { - for_each_cached_btree(b, c, i) - if (btree_current_write(b)->journal) { - if (!heap_full(&c->flush_btree)) - heap_add(&c->flush_btree, b, - journal_max_cmp); - else if (journal_max_cmp(b, - heap_peek(&c->flush_btree))) { - c->flush_btree.data[0] = b; - heap_sift(&c->flush_btree, 0, - journal_max_cmp); - } - } - - for (i = c->flush_btree.used / 2 - 1; i >= 0; --i) - heap_sift(&c->flush_btree, i, journal_min_cmp); - } + best = NULL; - b = NULL; - heap_pop(&c->flush_btree, b, journal_min_cmp); - spin_unlock(&c->journal.lock); + for_each_cached_btree(b, c, i) + if (btree_current_write(b)->journal) { + if (!best) + best = b; + else if (journal_pin_cmp(c, + btree_current_write(best)->journal, + btree_current_write(b)->journal)) { + best = b; + } + } + b = best; if (b) { mutex_lock(&b->write_lock); if (!btree_current_write(b)->journal) { @@ -873,8 +857,7 @@ int bch_journal_alloc(struct cache_set * j->w[0].c = c; j->w[1].c = c; - if (!(init_heap(&c->flush_btree, 128, GFP_KERNEL)) || - !(init_fifo(&j->pin, JOURNAL_PIN, GFP_KERNEL)) || + if (!(init_fifo(&j->pin, JOURNAL_PIN, GFP_KERNEL)) || !(j->w[0].data = (void *) __get_free_pages(GFP_KERNEL, JSET_BITS)) || !(j->w[1].data = (void *) __get_free_pages(GFP_KERNEL, JSET_BITS))) return -ENOMEM; --- a/drivers/md/bcache/util.h +++ b/drivers/md/bcache/util.h @@ -113,8 +113,6 @@ do { \ #define heap_full(h) ((h)->used == (h)->size) -#define heap_empty(h) ((h)->used == 0) - #define DECLARE_FIFO(type, name) \ struct { \ size_t front, back, size, mask; \