Received: by 10.223.176.5 with SMTP id f5csp77551wra; Tue, 6 Feb 2018 17:34:12 -0800 (PST) X-Google-Smtp-Source: AH8x227XwM+O38mzz2p5gbw3ijj4KoKqaBMw/TJhhT1KHx1+lhz7L1ECFQTUZszzXPSVkz9u1HkK X-Received: by 10.101.86.137 with SMTP id v9mr3414785pgs.353.1517967252278; Tue, 06 Feb 2018 17:34:12 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1517967252; cv=none; d=google.com; s=arc-20160816; b=JIN+H2njWpXt1NG+xpBE21Bp6o5n/OAxBNFPeNsB3VJ+HpQIVXGMf6cbwQ+lcDFKtq +lIqou80Npot9GTwNjp16Wl7yU//ldb7+RDsDxmH7xgnfF9iCwUiHTnv5IDAJgLCroXF 3Upr6MuZhwhMxhwzWy1wsJC2JQcqJrtY2bE9NMaveyk63zEfH2C2+dOAMyJbrVvIoid+ +OOWegT5w16mJAYaEzwanQ80MXPYdfm117IPkJoPdX55LJwRgnqux+nZMqfbTIyqyGUP BPEWrwdIFhEvEw19KlfLmd9yTlfYrbpWXaDjAmZEU6keJyxw3Nqh1hNmed4PRRAFVppL Q4zA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=IKCfP7dJolImnG1lFy+qVaWFVMXvHgy+Vgm3E7WniHc=; b=EXAwXC/xdcvh9Xen/ip0BDENGp8JnKRWcp/a+IUlA+N4R3wiqltFkIpl5ZyDUrcyYC +8dBnCyae3ECwRzM+2EwoKQ5FDapOR0fQxrbZ9+e6kcE3OgZ6aaBkdZ7/mwra7EuZaAJ v6YO/2VSc9sPKYxlQR26gg4DEBkrfM4cbpML13EqDpWfFvwoHMEie286nndJT4Olfi8F tSr4GCBlCvXUHzykn9tuX+ZopMfxFxrSFV8dme91HY+pSUiH9+UyGHXYvBa/DKPrHjvl uBm3oqYzgnyBmSOHWF/L9W9F4kE35PeWaHkE5+gAlyyOlPRAysIvZofJan6AFxl/QdxO CRsA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 190si216411pgh.536.2018.02.06.17.33.57; Tue, 06 Feb 2018 17:34:12 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S932274AbeBGBcl (ORCPT + 99 others); Tue, 6 Feb 2018 20:32:41 -0500 Received: from mx3-rdu2.redhat.com ([66.187.233.73]:55950 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S932184AbeBGBck (ORCPT ); Tue, 6 Feb 2018 20:32:40 -0500 X-Greylist: delayed 466 seconds by postgrey-1.27 at vger.kernel.org; Tue, 06 Feb 2018 20:32:40 EST Received: from smtp.corp.redhat.com (int-mx05.intmail.prod.int.rdu2.redhat.com [10.11.54.5]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id 3DCE6818B103; Wed, 7 Feb 2018 01:24:54 +0000 (UTC) Received: from ming.t460p (ovpn-12-20.pek2.redhat.com [10.72.12.20]) by smtp.corp.redhat.com (Postfix) with ESMTPS id 1A5F4AB58E; Wed, 7 Feb 2018 01:24:41 +0000 (UTC) Date: Wed, 7 Feb 2018 09:24:37 +0800 From: Ming Lei To: wenxiong Cc: Keith Busch , wenxiong@vmlinux.vnet.ibm.com, linux-nvme@lists.infradead.org, axboe@fb.com, linux-kernel@vger.kernel.org, wenxiong@us.ibm.com Subject: Re: [PATCH]nvme-pci: Fixes EEH failure on ppc Message-ID: <20180207012353.GD13470@ming.t460p> References: <1517867380-18790-1-git-send-email-wenxiong@vmlinux.vnet.ibm.com> <20180206163347.GG31110@localhost.localdomain> <787e4960b62a03b3888c67e73d7e1ee2@linux.vnet.ibm.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <787e4960b62a03b3888c67e73d7e1ee2@linux.vnet.ibm.com> User-Agent: Mutt/1.9.1 (2017-09-22) X-Scanned-By: MIMEDefang 2.79 on 10.11.54.5 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.11.55.8]); Wed, 07 Feb 2018 01:24:54 +0000 (UTC) X-Greylist: inspected by milter-greylist-4.5.16 (mx1.redhat.com [10.11.55.8]); Wed, 07 Feb 2018 01:24:54 +0000 (UTC) for IP:'10.11.54.5' DOMAIN:'int-mx05.intmail.prod.int.rdu2.redhat.com' HELO:'smtp.corp.redhat.com' FROM:'ming.lei@redhat.com' RCPT:'' Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Feb 06, 2018 at 02:01:05PM -0600, wenxiong wrote: > On 2018-02-06 10:33, Keith Busch wrote: > > On Mon, Feb 05, 2018 at 03:49:40PM -0600, wenxiong@vmlinux.vnet.ibm.com > > wrote: > > > @@ -1189,6 +1183,12 @@ static enum blk_eh_timer_return > > > nvme_timeout(struct request *req, bool reserved) > > > struct nvme_command cmd; > > > u32 csts = readl(dev->bar + NVME_REG_CSTS); > > > > > > + /* If PCI error recovery process is happening, we cannot reset or > > > + * the recovery mechanism will surely fail. > > > + */ > > > + if (pci_channel_offline(to_pci_dev(dev->dev))) > > > + return BLK_EH_HANDLED; > > > + > > > > This patch will tell the block layer to complete the request and > > consider > > it a success, but it doesn't look like the command actually completed at > > all. You're going to get data corruption this way, right? Is returning > > BLK_EH_HANDLED immediately really the right thing to do here? > > Hi Ming, > > Can you help checking if it is ok if returning BLK_EH_HANDLEDED in this > case? Hi Wenxiong, Looks Keith is correct, and this timed out request will be completed by block layer and NVMe driver if BLK_EH_HANDLED is returned, but this IO isn't completed actually, so either data loss(write) or read failure is caused. Maybe BLK_EH_RESET_TIMER is fine under this situation. Thanks, Ming