Received: by 2002:ac0:946b:0:0:0:0:0 with SMTP id j40csp1551845imj; Sun, 17 Feb 2019 08:29:33 -0800 (PST) X-Google-Smtp-Source: AHgI3IYE8JAChPjhxGtuROSjVKNbD/ucMp5v0YU0GlUM3oz4hzRRL9s9/7bOsBqWatZDpmEsEN3X X-Received: by 2002:a17:902:bb89:: with SMTP id m9mr20523572pls.320.1550420973489; Sun, 17 Feb 2019 08:29:33 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1550420973; cv=none; d=google.com; s=arc-20160816; b=UxcHE5Upg+Ex5zkVwu7ohg0RwToVywlwCPOaHOw6xzXvU9W0IEWf5jqXb/nocBHIha Dc5gr3ECWefndv6KnNcKP2wUsO7b3pCVgpgkQD656VDWoHnJOFSHebvuuICJvyMBXvrl jxyueskk4VaR+iZPinSuaRl6jUhX5JJl/k7reDxva9Cj4urf8EJkYkHp0FjnLucXqiHY ZVelTQ42fzjoL/v3fDCjbNxl5bS1MJfLqJzxArnEMilyBCpkLg1AbGMFpPs4rPMS1eMw lZ769+vfTQozpBhSniQFsYHtzW1EFu2Xnde4R7DiyycuB60+Rypo6ntvz2hibj8xDcYW 1DdA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=lQwvVyKA58R51I7WCjQMjDGcbV9iwXrsl3If86ynv6Q=; b=L7MF2KHa/37QuyA0xtbPIP/QeED5pG3x+OCsVuGY3OF9tAFWtsQ2yXEo7RgPKeFC1B /rCjC6lF8Yf1T3nYJYTKy8053nMsddY/U/excC7c5VILLWH6cCFIs2oe0vZYYS0jm82b xLQmzr8BPohCIO+/YJdnO2D0uOkLDmaCojKEratse8I82Ar+bk0cJuk5xQnu4v48fueS EXrqZqnFbQh7dWpCWAJnuCdWtd1qlLD6AJTT+rcUKdZ0rjimjIWg0AnT5RUmukctU4HF q0dGCXcMxqhQZiNwEOLAjsNA2BHZnDVPUJf4u9Y9upAK/I1mrLufSKlle0J1zA9FJ7ay wEGQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id c11si8674743pfc.162.2019.02.17.08.29.17; Sun, 17 Feb 2019 08:29:33 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728184AbfBQPhe (ORCPT + 99 others); Sun, 17 Feb 2019 10:37:34 -0500 Received: from relay3-d.mail.gandi.net ([217.70.183.195]:57955 "EHLO relay3-d.mail.gandi.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1728110AbfBQPhd (ORCPT ); Sun, 17 Feb 2019 10:37:33 -0500 X-Originating-IP: 78.194.159.98 Received: from gandi.net (unknown [78.194.159.98]) (Authenticated sender: thibaut@sautereau.fr) by relay3-d.mail.gandi.net (Postfix) with ESMTPSA id 29E3660004; Sun, 17 Feb 2019 15:37:30 +0000 (UTC) Date: Sun, 17 Feb 2019 16:37:29 +0100 From: Thibaut Sautereau To: stable@vger.kernel.org Cc: Jianchao Wang , m19@florianstecker.de, Jens Axboe , linux-block@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH] blk-mq: fix a hung issue when fsync Message-ID: <20190217153729.GA3835@gandi.net> References: <1548838916-25051-1-git-send-email-jianchao.w.wang@oracle.com> <319fffef-2fa8-afff-8f93-1ce8fd721581@kernel.dk> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline In-Reply-To: <319fffef-2fa8-afff-8f93-1ce8fd721581@kernel.dk> User-Agent: Mutt/1.11.3 (2019-02-01) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Jan 30, 2019 at 08:54:09AM -0700, Jens Axboe wrote: > On 1/30/19 2:01 AM, Jianchao Wang wrote: > > Florian reported a io hung issue when fsync(). It should be > > triggered by following race condition. > > > > data + post flush a flush > > > > blk_flush_complete_seq > > case REQ_FSEQ_DATA > > blk_flush_queue_rq > > issued to driver blk_mq_dispatch_rq_list > > try to issue a flush req > > failed due to NON-NCQ command > > .queue_rq return BLK_STS_DEV_RESOURCE > > > > request completion > > req->end_io // doesn't check RESTART > > mq_flush_data_end_io > > case REQ_FSEQ_POSTFLUSH > > blk_kick_flush > > do nothing because previous flush > > has not been completed > > blk_mq_run_hw_queue > > insert rq to hctx->dispatch > > due to RESTART is still set, do nothing > > > > To fix this, replace the blk_mq_run_hw_queue in mq_flush_data_end_io > > with blk_mq_sched_restart to check and clear the RESTART flag. > > Applied, thanks. > > -- > Jens Axboe Can this be applied to stable kernels please? It's commit 85bd6e61f34dffa8ec2dc75ff3c02ee7b2f1cbce upstream. Thanks, -- Thibaut