Received: by 2002:ac0:a582:0:0:0:0:0 with SMTP id m2-v6csp4798276imm; Mon, 15 Oct 2018 23:28:41 -0700 (PDT) X-Google-Smtp-Source: ACcGV60uTFyBYBnV5124Lq/zIe5Sndz6Iny0qBWSK+43xDbAjRAnCQbUDv7+cvIVxCMCfXl2dbHs X-Received: by 2002:a63:e442:: with SMTP id i2-v6mr19137094pgk.381.1539671321818; Mon, 15 Oct 2018 23:28:41 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1539671321; cv=none; d=google.com; s=arc-20160816; b=F4AQIW7vQIHBhr9FDCjmaif/kIcjodVn22g+rqaocEFRepFMbd9dsxyzHWyiWbzIDY Y64Uh45B58/eEMlczMEE/kMg/+5tXoM3x6IA6MU2ZrjdCJ/4uTeCt40Wzd/Jmo1kA936 18173gWdFdcUkqgE6SdM0QhLQohOY5xnSHfYFoqzyrRcFkMAAIG2V36VWBVPuGBfo36S 7h0/+bSbBYNjmDju22oIXyKk6RdJVUGdaK/ZVlPS/xrxy544HF6MgL+UUcIs7kT8CzL2 Jj4vl150QioJmjGKHTIbTa6yhOd4GdZ82btpi7lJUChPE9oHtaPToUxQO1K79dQxNIUu H0Ew== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:autocrypt:openpgp:from:references:cc:to:subject; bh=EJYwqJrapyyblba5obXPu77srPM/yrFgfnhxeiVKJNI=; b=AkrX3iYtv1Go9Nm0+l0qimNViTaT7N+tJMpwwsSpyMnlQo2N0fcfUvUtsszqjhrt8C upek0ggQiqZt9YCyGeizDzRpfHxO4Xcaz8CqiQVebh/O0Nxo85eRPz/k7Q4QR0zli/8i P6hfztLT7cFzQIjZXLPO7sGUGolf2C0XvkoTbH6H/quxGiRMtfgLGD5CZWj7Wm+mAh4J xVW1WvOHJ1p95Ooj9uYafayAXubwrUlts04s+5mbyV45KOa+k9UBj3E2UvNqcXLMeSc7 SGgRpXnad5XzhV/7nMWTt7k+G3V7NvnhjevXE0uAxiqNLklcoZsEGLPS1VQVe3TCZh7x HGsQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id u1-v6si13242427pgq.1.2018.10.15.23.28.26; Mon, 15 Oct 2018 23:28:41 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727447AbeJPOQl (ORCPT + 99 others); Tue, 16 Oct 2018 10:16:41 -0400 Received: from mx2.suse.de ([195.135.220.15]:39994 "EHLO mx1.suse.de" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1727093AbeJPOQl (ORCPT ); Tue, 16 Oct 2018 10:16:41 -0400 X-Virus-Scanned: by amavisd-new at test-mx.suse.de Received: from relay1.suse.de (unknown [195.135.220.254]) by mx1.suse.de (Postfix) with ESMTP id 771C2AD83; Tue, 16 Oct 2018 06:27:46 +0000 (UTC) Subject: Re: [PATCH] bcache: add separate workqueue for journal_write to avoid deadlock To: Eddie Chapman , guoju Cc: kent.overstreet@gmail.com, linux-bcache@vger.kernel.org, linux-kernel@vger.kernel.org, s.priebe@profihost.ag References: <1538055919-5757-1-git-send-email-fangguoju@gmail.com> <9b1abe65-c6f5-77b4-290e-e6af21723cfc@ehuk.net> <17ea92fa-a07b-2391-aba3-377382c63d9b@ehuk.net> From: Coly Li Openpgp: preference=signencrypt Autocrypt: addr=colyli@suse.de; prefer-encrypt=mutual; keydata= xsFNBFYX6S8BEAC9VSamb2aiMTQREFXK4K/W7nGnAinca7MRuFUD4JqWMJ9FakNRd/E0v30F qvZ2YWpidPjaIxHwu3u9tmLKqS+2vnP0k7PRHXBYbtZEMpy3kCzseNfdrNqwJ54A430BHf2S GMVRVENiScsnh4SnaYjFVvB8SrlhTsgVEXEBBma5Ktgq9YSoy5miatWmZvHLFTQgFMabCz/P j5/xzykrF6yHo0rHZtwzQzF8rriOplAFCECp/t05+OeHHxjSqSI0P/G79Ll+AJYLRRm9til/ K6yz/1hX5xMToIkYrshDJDrUc8DjEpISQQPhG19PzaUf3vFpmnSVYprcWfJWsa2wZyyjRFkf J51S82WfclafNC6N7eRXedpRpG6udUAYOA1YdtlyQRZa84EJvMzW96iSL1Gf+ZGtRuM3k49H 1wiWOjlANiJYSIWyzJjxAd/7Xtiy/s3PRKL9u9y25ftMLFa1IljiDG+mdY7LyAGfvdtIkanr iBpX4gWXd7lNQFLDJMfShfu+CTMCdRzCAQ9hIHPmBeZDJxKq721CyBiGAhRxDN+TYiaG/UWT 7IB7LL4zJrIe/xQ8HhRO+2NvT89o0LxEFKBGg39yjTMIrjbl2ZxY488+56UV4FclubrG+t16 r2KrandM7P5RjR+cuHhkKseim50Qsw0B+Eu33Hjry7YCihmGswARAQABzRhDb2x5IExpIDxj b2x5bGlAc3VzZS5kZT7CwX8EEwEIACkFAlYX6ZACGyMFCQlmAYAHCwkIBwMCAQYVCAIJCgsE FgIDAQIeAQIXgAAKCRDHOQeTa334/CncD/9B97EIjcDOm0TS164bpMlsbZWEm8GQnV6nVzm8 QsywPRM8S8nqkqX1atTYl/fTdJsasH8mgryUqL0eHBPs5RmJhDk3YgYsTrzbOjMdsdRwv24W J5RXdulRag2XDPIhSP7rWsOSh66gljdAp8XQQZD0zFXi4IytoAuLtx8RMjzzKk1iP6uz8MIv em7iFu6NYcHd3cmvSPo7CnBVaG0dZ6P2p2gS7ydSWOGsWkNh/XM4ojJaX1ZdCeFR0XLS76Gi 6e01DoN2UsqZE/TQu1czYMMA1uM/Es6ZTYgobTrrnNB79ctqgtbBrjME5sOHLX40ccbBI3QB Ta4opSp8VqUMXw/yd5ckLPocnkJBTVxuaOfRhpxr6gWeudrkMetMj+39yeklskP7up0JvAUG 7/HjjqwWR7xAaZHmZORYsIxJ9ploBb8eSqHHx+7489ZDNLP+WCsAonpKTdJNAzGJClnLFxKS DY4cOPs7o4IFBk6dVXJWMqyLGwmMQ51Pq6BID4epaAuuBAL6x7n7NrFPuS68Fn/VaxqMEld9 L2eCi4cv++1AJyMF3iQKT56I8BjHEuf0wo1tmZ3BgBT19xRsEl7YItixxtYQm66Pb4lSQQmE Ep+uQNwaqPpeAU+vkDg/0Q+dhPTsvwx0OAI30HwhuzNA8OIfHBx7dJNm0b0fg5x0pg3LDM7B TQRWF+kvARAA2T/tnJeA0RWkmgZrNPFvP7JnOU9gjmIQKMoGZ+9awew45pdmXb6y0Y0fEG59 EP9i9oBlFXOt6SZ2645V0sdi3wBRNEpX2CCddWhXRfcO0b6lgckIwyaK92dH1rzxMaZTYDL8 aQ9FNEK1U+XSBk8fYWnXowpf7oNPS6+jD0J/muPqrGkVsIAkh2iLg5B98yNTCV4ql1xSlMyf xcseke9q6ojDxx9p38JjLusDlwF2+/rF42c+T6PRiYNjnBHPq6VLSlCRsnkLJwg8VHKiV2Qw Yvxp4TwnK2kLqokOxBlriX45Odb2iP61uG2ZAPchDwfawWJ4G8+3EMplLH8bk0/DkpYcYz95 eGSGRSiIQ2kHmTI/KbpgXxFVMoheilUn4HzUP+T6TEeP6Zhm0aqwABJYa0T2ykJwpBlg6/Mx vgIzdSheqx2hYACDu07WfhdvI6uK3i5Lq9DebUBcMMBcMc0TnXix7mYy+3hLXJzZ80pFx3My 5FeJEN/r6/+xpuuZkH51aYOiacKVa2w2EHjhZcWfPhhEWOQ2oOCoCmv+HEmV9sf+fipEMfcB 8GnJMOYAwrwHWfkPNZ5urUcRGAQYlQ0GWKju97LYE2cq5McpFG0CMvDyPoO1zAwjJz4g53EK oH/eikd3L8OMDfEK4AOsUaPMTnNgt1+40zEFMrQs/dDMldUAEQEAAcLBZQQYAQgADwUCVhfp LwIbDAUJCWYBgAAKCRDHOQeTa334/PtREACDN8W/pHeHyPW/mTt6MEe/GICG5YdlBW5ft7HY Cf6rTz+uLZolGc5SYKuJJ0JC/L2Ifh3BWmwLIOxV868KB3oEfmGszBY+4n/icLyIEAkkthBb 2V5sP5KgB3bOg7mSFBxfHi2pyO9K9d+Lr+UkORjCGyV33QFrcN+OQdPDactontnQglB7xm2K phGWqxoqepHCqFIulZ3yKGhQhmdpyz0J19Ry6GkxPE85MG/NC98D5+4Yn/V3G+yZpbGsuFhE CP26JvdXh1jNCUdU46pEjZwu0GXBIo6r1cb1v+swfYB86NeFUHWtvxamh8i6RBl1FLDhN6xb r9f7M++xoADyzPQYQPQUxWK+iG6lz3qVVq5312z/is3fcdyESPNs09DMT43xCCBr9UOMq6dZ IC9EsSeMYv4librfuSRqH4R0MuVbVWLJFg/Q7s+nbPb2YjhqIYr51hBDyXpzUDoIz43maIPk UmCNKa43mNFktMrwU21J5lVXEwBuTY6JlHOAl0Fgo28X+eTa8fx2Uiz9OVgWe03ebJGIGowe XTgqVWJMsKM1tmW+QFmgtczDGRYCZ6OQYpqt0SoTg1yx5MN4RzUtlLka2qLfPiOGUUN3qNJ5 nP+spvF+s+dHtLjjhy7AL86N01a6S0rwaClVVv0XTucvIntwccIx0CZfUKlfn5BWnB64Ig== Message-ID: Date: Tue, 16 Oct 2018 14:27:38 +0800 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.12; rv:52.0) Gecko/20100101 Thunderbird/52.9.1 MIME-Version: 1.0 In-Reply-To: <17ea92fa-a07b-2391-aba3-377382c63d9b@ehuk.net> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2018/10/4 下午10:07, Eddie Chapman wrote: > On 28/09/18 03:32, Coly Li wrote: >> >> On 9/27/18 11:53 PM, Eddie Chapman wrote: >>> On 27/09/18 16:23, Coly Li wrote: >>>> >>>> On 9/27/18 9:45 PM, guoju wrote: >>>>> After write SSD completed, bcache schedule journal_write work to >>>>> system_wq, that is a public workqueue in system, without >>>>> WQ_MEM_RECLAIM>>>> flag. system_wq is also a bound wq, and there >>>>> may be no idle kworker on >>>>> current processor. Creating a new kworker may unfortunately need to >>>>> reclaim memory first, by shrinking cache and slab used by vfs, which >>>>> depends on bcache device. That's a deadlock. >>>>> >>>>> This patch create a new workqueue for journal_write with >>>>> WQ_MEM_RECLAIM >>>>> flag. It's rescuer thread will work to avoid the deadlock. >>>>> >>>>> Signed-off-by: guoju >>>> >>>> Nice catch, this fix is quite important. I will try to submit to >>>> Jens ASAP. >>>> >>>> Thanks. >>>> >>>> Coly Li >>> >>> Once this goes into 4.19, would this be a candidate for backporting >>> to any stable kernels, or does it only fix something introduced in >>> this cycle? >>> >> This bug exists in upstream for quite long time, it should be applied >> to all stable kernels which it can be applied. And it is Cced to >> stable@vger.kernel.org already. >> >> Coly Li > > Thanks Coly! :-) > > Just to let you know, I applied this (and couple of other cherry picks) > to a couple of 4.14 boxes last night, so far so good, running without > issues. However, this one needed this recent commit upstream as a > pre-requisite: > > 16c1fdf4cfd6c0091e59b93ec2cb7e99973f8244 > bcache: do not assign in if condition in bcache_init() > > in order to be able to apply it. > > This is because the context of the second hunk for > drivers/md/bcache/super.c (in this journal_write workqueue patch) > contains code added by that commit > 16c1fdf4cfd6c0091e59b93ec2cb7e99973f8244. > > So I guess either 16c1fdf4cfd6c0091e59b93ec2cb7e99973f8244 also needs > tagging for stable, or perhaps a backport of this journal_write > workqueue will have to be created for earlier kernels, with different > context for that hunk? > > Eddie Hi Eddie, Yes I missed the patch dependency, thanks for the hint:-) Guoju Fang or I will take care of the back port to stable tree. Coly Li