Received: by 2002:a25:31c3:0:0:0:0:0 with SMTP id x186csp6576145ybx; Mon, 11 Nov 2019 11:12:42 -0800 (PST) X-Google-Smtp-Source: APXvYqx+mNwDRKNJ4hpqimTl7TIToZK/89PDvrxofKUqGnoQx4cuKWVeE3OrW+1BeUqKIBpzgUj4 X-Received: by 2002:aa7:dc1a:: with SMTP id b26mr28306382edu.139.1573499562721; Mon, 11 Nov 2019 11:12:42 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1573499562; cv=none; d=google.com; s=arc-20160816; b=TPmGD6f1FwKfp/R5ZNTFlnCJ1W7XunaRHb6/LdyPeFK8SICGrsa+NUZDcK6Of5am56 yYBAv+aVcLmd7tsSIIExQxIB/sXquxA9+Jx3a9QKNQ8NOAy8jhhJekZfnZXWKdkK9UGT Aw8oMsJaLest+1VVNmvSVdpSxenhuaaWLH3kbd68R7BG9eY+SuAXA0P8WSlO5hpMwZFZ 5LjoIt/4p5TWTyf+geUqVs27++1LO2N2NurVqKLdNpzN6mjRNUEPz7Ygfq8qDcZ59je6 8vQMEcs0UaYUfn3s3D8TCfjFE75/x1B03CYAgfF56eP8+p3AkmNiX6D/OnA+gZvDvbh6 qKdg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:message-id:date:subject:cc:to :from:dkim-signature; bh=C0FlQuqCctI+q4iDnpHp+ILHnEA2mcONg4W5ZprwkGI=; b=f3MfM72SvNM+q/MArz09e4xi9zxWlJLt7WcxIDx2Y4xreXqkoeNNWKNIzCB3G9MF/G alWGlgrwRcHZeaXTp37UmabAMwwLetzTK1u/RMVbWHkWHQ4QRpE7hrLz/+o9kqjV76+2 RYYW2FWzk6iyYSlm3BzZ6Q2ywNwW3EJF7rprpvAfATF7OXbfiieZEo4niNUZB1PxVAi7 gA5HiVBOTqXzJiMiPWdt85G9GW3qiUFY4xJYYnxSq+CoRb4CJLCYyBI/yiWF9xRFzmww cMzONSOHt2hn5tGm2L/VS8l4TnTBwwJkKPP6LhdS3gAYoA9pgCasnPZAnySvimIWhQ3l gRiA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=cTFoLSfA; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id l4si10010432edr.286.2019.11.11.11.12.19; Mon, 11 Nov 2019 11:12:42 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=cTFoLSfA; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727113AbfKKTKd (ORCPT + 99 others); Mon, 11 Nov 2019 14:10:33 -0500 Received: from mail.kernel.org ([198.145.29.99]:48674 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1727786AbfKKSby (ORCPT ); Mon, 11 Nov 2019 13:31:54 -0500 Received: from localhost (83-86-89-107.cable.dynamic.v4.ziggo.nl [83.86.89.107]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 1DF57214E0; Mon, 11 Nov 2019 18:31:52 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1573497113; bh=MnjhQlZwaODzj+JadDJkhh9j+AWvo5XL4w5QwgNBMyU=; h=From:To:Cc:Subject:Date:In-Reply-To:References:From; b=cTFoLSfAxCJevBnHf6DN8idr93rN+ipzkB3n4UcpbW+H3hE6PHBaPNEefGydgeSDK Qg5tmJXIg1+JfMW1F4sbBSHKltYTnAR7zpzDYz12UCv/qd9IT6iTipV+Iwp+kR+ggP wfiHuHNcEbVoCD1EE6RoHSAftDdHVO9JM3rm+Cd4= From: Greg Kroah-Hartman To: linux-kernel@vger.kernel.org Cc: Greg Kroah-Hartman , stable@vger.kernel.org, Konstantin Khlebnikov , Dennis Zhou , Tejun Heo , Jens Axboe Subject: [PATCH 4.4 43/43] cgroup,writeback: dont switch wbs immediately on dead wbs if the memcg is dead Date: Mon, 11 Nov 2019 19:28:57 +0100 Message-Id: <20191111181329.189491027@linuxfoundation.org> X-Mailer: git-send-email 2.24.0 In-Reply-To: <20191111181246.772983347@linuxfoundation.org> References: <20191111181246.772983347@linuxfoundation.org> User-Agent: quilt/0.66 MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Tejun Heo commit 65de03e251382306a4575b1779c57c87889eee49 upstream. cgroup writeback tries to refresh the associated wb immediately if the current wb is dead. This is to avoid keeping issuing IOs on the stale wb after memcg - blkcg association has changed (ie. when blkcg got disabled / enabled higher up in the hierarchy). Unfortunately, the logic gets triggered spuriously on inodes which are associated with dead cgroups. When the logic is triggered on dead cgroups, the attempt fails only after doing quite a bit of work allocating and initializing a new wb. While c3aab9a0bd91 ("mm/filemap.c: don't initiate writeback if mapping has no dirty pages") alleviated the issue significantly as it now only triggers when the inode has dirty pages. However, the condition can still be triggered before the inode is switched to a different cgroup and the logic simply doesn't make sense. Skip the immediate switching if the associated memcg is dying. This is a simplified version of the following two patches: * https://lore.kernel.org/linux-mm/20190513183053.GA73423@dennisz-mbp/ * http://lkml.kernel.org/r/156355839560.2063.5265687291430814589.stgit@buzz Cc: Konstantin Khlebnikov Fixes: e8a7abf5a5bd ("writeback: disassociate inodes from dying bdi_writebacks") Acked-by: Dennis Zhou Signed-off-by: Tejun Heo Signed-off-by: Jens Axboe Signed-off-by: Greg Kroah-Hartman --- fs/fs-writeback.c | 9 ++++++--- 1 file changed, 6 insertions(+), 3 deletions(-) --- a/fs/fs-writeback.c +++ b/fs/fs-writeback.c @@ -582,10 +582,13 @@ void wbc_attach_and_unlock_inode(struct spin_unlock(&inode->i_lock); /* - * A dying wb indicates that the memcg-blkcg mapping has changed - * and a new wb is already serving the memcg. Switch immediately. + * A dying wb indicates that either the blkcg associated with the + * memcg changed or the associated memcg is dying. In the first + * case, a replacement wb should already be available and we should + * refresh the wb immediately. In the second case, trying to + * refresh will keep failing. */ - if (unlikely(wb_dying(wbc->wb))) + if (unlikely(wb_dying(wbc->wb) && !css_is_dying(wbc->wb->memcg_css))) inode_switch_wbs(inode, wbc->wb_id); }