Received: by 2002:a25:ab43:0:0:0:0:0 with SMTP id u61csp1084390ybi; Fri, 24 May 2019 17:01:27 -0700 (PDT) X-Google-Smtp-Source: APXvYqxA41IlrWv69a+FcsnbQs7y9yuWRqthiGuwQQrs6sYhAaLwHd8n+GdZkGaPjwWomBYvGEVx X-Received: by 2002:a63:1344:: with SMTP id 4mr107515469pgt.448.1558742486926; Fri, 24 May 2019 17:01:26 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1558742486; cv=none; d=google.com; s=arc-20160816; b=zBX6thEhaClqiru3jy9bpdA1/82yJR/jEGPwWUVS4RKoZUlKK2qfFFeUxrcpSGS7xV K96jArp+bJFcBwz9N1wYU1zoAZjqCRkweRl5oRQ1peIyG+1fxVbUbPRkwhMXpWGKTknA tn7/6RJ3TCWhhBy27pNDH8TCa0i3+sruGq75DnVjW4Tg7/MTx3GgZoHi/hmXUF84zy7Z iVcJTujIEoGDXGGo2WGW8CmFApYlcHKiwLuDbJ1kAOdRDoU5uFWAwYaXiR5FLi/E+9cG Z9VeAHX8wwcOqe7JmAoZL1/T+MVPmHLxhFaoBYNfHU1huUqZkuBeB/X1OExywMl2mDTC n5cQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:in-reply-to :mime-version:user-agent:date:message-id:from:cc:references:to :subject:dkim-signature; bh=mYxaTaB4BfCfxzRuaJ3NuA5Kz4LPWPU+aIoam+b5CUo=; b=f8YsAquErpO7pjeXkvcSpXxWWZonHteBMnGVw+IOBwebSfZQSn1b+ePGef+RIbvKpk PuIz1JWEBxatuZ5H51oyow4y/srkEBMDeL4p0kmoxPgv0gEYTOzadbgf3dyqA6dQH8ao Lzrw7jwksV648sETzzAE08cc8pxf6mCp5LKCLt8GIxDodZj9ORL6G32WW+PE8AF+Pep5 wS5PZ8F1s/HSS9FROv4Iu0fZ+mjXePWDB+96gEaKwimiSsw1KWKSiNiUVxXLpWJB+hQh K4DtEIhCDfMvEA5O7iwnfe6vKQzcRB8RAYRvduSh38Dz/aDNaJdCXXsMxek2ZL6+LDJx BB4A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@oracle.com header.s=corp-2018-07-02 header.b=p5gaDl7g; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=oracle.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id e67si6382135pgc.11.2019.05.24.17.01.10; Fri, 24 May 2019 17:01:26 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@oracle.com header.s=corp-2018-07-02 header.b=p5gaDl7g; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=oracle.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726114AbfEYAAB (ORCPT + 99 others); Fri, 24 May 2019 20:00:01 -0400 Received: from userp2130.oracle.com ([156.151.31.86]:50904 "EHLO userp2130.oracle.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726015AbfEYAAB (ORCPT ); Fri, 24 May 2019 20:00:01 -0400 Received: from pps.filterd (userp2130.oracle.com [127.0.0.1]) by userp2130.oracle.com (8.16.0.27/8.16.0.27) with SMTP id x4ONx1TC017589; Fri, 24 May 2019 23:59:01 GMT DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=oracle.com; h=subject : to : references : cc : from : message-id : date : mime-version : in-reply-to : content-type : content-transfer-encoding; s=corp-2018-07-02; bh=mYxaTaB4BfCfxzRuaJ3NuA5Kz4LPWPU+aIoam+b5CUo=; b=p5gaDl7gb886ZgRhCLD79+fT1q4JqhGzCBzrgiYQn8xn3AEDxQX1vfa22xQnKQXK+LAN OwSdZTJW80I9XA4yne1ROjx4FOUE8CnUYAhyWGR2V+lv+WV2AnX0b+OAoUSQRHKQOvj7 vWm5wx+mcFGdKV3IzpMdYzIxgj43zr6ULA/AylmLYEvplT1oqbPYDurG7auVX3/WyoAG nBNJwSb3kdSeSmZ2mnIv2MCu6uwDUWlyKJTEIBphLHdrwzAxAOgnAS6zqYgeoulZE77z /FsfHxcqMTB9oL/rcXrlykKesBweFZtribiglKXI8dmeJE8ANu2ILOnfyfstkULXy3Ab Lg== Received: from userp3030.oracle.com (userp3030.oracle.com [156.151.31.80]) by userp2130.oracle.com with ESMTP id 2smsk5uqrg-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Fri, 24 May 2019 23:59:00 +0000 Received: from pps.filterd (userp3030.oracle.com [127.0.0.1]) by userp3030.oracle.com (8.16.0.27/8.16.0.27) with SMTP id x4ONw3Je172558; Fri, 24 May 2019 23:59:00 GMT Received: from aserv0122.oracle.com (aserv0122.oracle.com [141.146.126.236]) by userp3030.oracle.com with ESMTP id 2smshg4dcu-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Fri, 24 May 2019 23:59:00 +0000 Received: from abhmp0015.oracle.com (abhmp0015.oracle.com [141.146.116.21]) by aserv0122.oracle.com (8.14.4/8.14.4) with ESMTP id x4ONwvQ9023032; Fri, 24 May 2019 23:58:57 GMT Received: from [10.191.9.152] (/10.191.9.152) by default (Oracle Beehive Gateway v4.0) with ESMTP ; Fri, 24 May 2019 23:58:57 +0000 Subject: Re: [5.2-rc1 regression]: nvme vs. hibernation To: Jiri Kosina References: <20190524154429.GE15192@localhost.localdomain> Cc: Keith Busch , Jens Axboe , Sagi Grimberg , linux-kernel@vger.kernel.org, linux-nvme@lists.infradead.org, Keith Busch , Hannes Reinecke , Christoph Hellwig From: Dongli Zhang Message-ID: <92a15981-dfdc-0ac9-72ee-920555a3c1a4@oracle.com> Date: Sat, 25 May 2019 07:58:49 +0800 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:45.0) Gecko/20100101 Thunderbird/45.3.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: 7bit X-Proofpoint-Virus-Version: vendor=nai engine=6000 definitions=9267 signatures=668687 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 suspectscore=0 malwarescore=0 phishscore=0 bulkscore=0 spamscore=0 mlxscore=0 mlxlogscore=999 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1810050000 definitions=main-1905240163 X-Proofpoint-Virus-Version: vendor=nai engine=6000 definitions=9267 signatures=668687 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 priorityscore=1501 malwarescore=0 suspectscore=0 phishscore=0 bulkscore=0 spamscore=0 clxscore=1011 lowpriorityscore=0 mlxscore=0 impostorscore=0 mlxlogscore=999 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1810050000 definitions=main-1905240163 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Jiri, Looks this has been discussed in the past. http://lists.infradead.org/pipermail/linux-nvme/2019-April/023234.html I created a fix for a case but not good enough. http://lists.infradead.org/pipermail/linux-nvme/2019-April/023277.html Perhaps people would have better solution. Dongli Zhang On 05/25/2019 06:27 AM, Jiri Kosina wrote: > On Fri, 24 May 2019, Keith Busch wrote: > >>> Something is broken in Linus' tree (4dde821e429) with respec to >>> hibernation on my thinkpad x270, and it seems to be nvme related. >>> >>> I reliably see the warning below during hibernation, and then sometimes >>> resume sort of works but the machine misbehaves here and there (seems like >>> lost IRQs), sometimes it never comes back from the hibernated state. >>> >>> I will not have too much have time to look into this over weekend, so I am >>> sending this out as-is in case anyone has immediate idea. Otherwise I'll >>> bisect it on monday (I don't even know at the moment what exactly was the >>> last version that worked reliably, I'll have to figure that out as well >>> later). >> >> I believe the warning call trace was introduced when we converted nvme to >> lock-less completions. On device shutdown, we'll check queues for any >> pending completions, and we temporarily disable the interrupts to make >> sure that queues interrupt handler can't run concurrently. > > Yeah, the completion changes were the primary reason why I brought this up > with all of you guys in CC. > >> On hibernation, most CPUs are offline, and the interrupt re-enabling >> is hitting this warning that says the IRQ is not associated with any >> online CPUs. >> >> I'm sure we can find a way to fix this warning, but I'm not sure that >> explains the rest of the symptoms you're describing though. > > It seems to be more or less reliable enough for bisect. I'll try that on > monday and will let you know. > > Thanks, >