Received: by 10.192.165.148 with SMTP id m20csp4235075imm; Mon, 30 Apr 2018 14:28:04 -0700 (PDT) X-Google-Smtp-Source: AB8JxZpFJXuEfCuvswg2cMOGISLyPLhJ5uxVRMJc2W/ZCWqQQ/vPARbY6mcCFN1JkQTh5Dt5mZQ5 X-Received: by 2002:a17:902:59ce:: with SMTP id d14-v6mr14114989plj.253.1525123684298; Mon, 30 Apr 2018 14:28:04 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1525123684; cv=none; d=google.com; s=arc-20160816; b=BIYqHwpOZZOHQNps0MbR1fgpnwiRS5NKJ8JeegiZtMyN06/pGj8G0ZRpoIV9KDVYOg OPd0pUr7hne3GxfCyFU+RuqdrZw9Miky/1PuGvhKud1oMStxLgfZGty3Yex28zChPnxs YS7sH/ioLB10R1104RTXXbSKZ4li2pGA7hsxPW4DA5fHoVK0Ol/qIWuz8FQFbSKw1GI+ ekqZl99WYnrthMwp8KYBK7vPk+t3yIq2Qd85nVIGNBMZSJEFkvd8G3jcB13NK5BifKfO YPaqlQNG9ERuBXgzAcGitH0cb3jKr9w67VjWIaIAzLU/JWtXj0+TiNFKfz/FbwIWQhOL ha1w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:dmarc-filter :dkim-signature:dkim-signature:arc-authentication-results; bh=w2hkW3wgXlXClDWlRQgBa+pBl82MJ/FJ1YsbrNTjilE=; b=YmGsuvcgpL6RXc5JCCmtLHcdipf1pzPxqycqwgF4AohiWaFrZ6yaHlJcqMjg8v3WB3 5uTt9HpPI+XngGUIoAYFZZTW+f5Xf0WIQ7ScjByb1gZ60ydY+db+06ZhS5Tq+RARK6Kk HdYm9ww/5oDl4GNPk2vNJ5Qwvbq08hk7kksuAVZm4yMqT9Y1I8jupL5eZuut/JBYs31T 2Hwcn1vzi3rNDEe7gcUPlTWvERmJ+3wJ7oXpbk22xtQGwO4R6Pwot+1six5Ogv/UB+wE Gu7lw3VO84sTjLmywrR2y78H+ZeRYxkSWrVR1MTU4Q/SEskYiNfXKMYuPsV9A2zOiov1 DDQA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@codeaurora.org header.s=default header.b=cumdB8T9; dkim=pass header.i=@codeaurora.org header.s=default header.b=cumdB8T9; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id x9-v6si8324360plv.159.2018.04.30.14.27.49; Mon, 30 Apr 2018 14:28:04 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@codeaurora.org header.s=default header.b=cumdB8T9; dkim=pass header.i=@codeaurora.org header.s=default header.b=cumdB8T9; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753683AbeD3V1h (ORCPT + 99 others); Mon, 30 Apr 2018 17:27:37 -0400 Received: from smtp.codeaurora.org ([198.145.29.96]:45398 "EHLO smtp.codeaurora.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751084AbeD3V1g (ORCPT ); Mon, 30 Apr 2018 17:27:36 -0400 Received: by smtp.codeaurora.org (Postfix, from userid 1000) id BA97960767; Mon, 30 Apr 2018 21:27:35 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=codeaurora.org; s=default; t=1525123655; bh=B2G9sb532hOQdPp7jq8kexDtj/D9NRscnmnhNw6i+ZY=; h=Subject:To:Cc:References:From:Date:In-Reply-To:From; b=cumdB8T9Yr+6ZtKWGSbZrLWpWuZPYBK8ZzGP/DWEXiHPlyuU8UjKN3gMAIt+oo25D EL7/rtrcK43H/U7n3AXCo+/bKgEOaU+NOU+uR9Pph+bmrq6qjFRNY8YQjiS1EoQqqK wGUiZVpq/fZF9q6GCHiQFlK1rj9+YBX4v4IYIOE0= X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on pdx-caf-mail.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-2.8 required=2.0 tests=ALL_TRUSTED,BAYES_00, DKIM_SIGNED,T_DKIM_INVALID autolearn=no autolearn_force=no version=3.4.0 Received: from [192.168.0.105] (cpe-174-109-247-98.nc.res.rr.com [174.109.247.98]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) (Authenticated sender: okaya@smtp.codeaurora.org) by smtp.codeaurora.org (Postfix) with ESMTPSA id DE4D760251; Mon, 30 Apr 2018 21:27:33 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=codeaurora.org; s=default; t=1525123655; bh=B2G9sb532hOQdPp7jq8kexDtj/D9NRscnmnhNw6i+ZY=; h=Subject:To:Cc:References:From:Date:In-Reply-To:From; b=cumdB8T9Yr+6ZtKWGSbZrLWpWuZPYBK8ZzGP/DWEXiHPlyuU8UjKN3gMAIt+oo25D EL7/rtrcK43H/U7n3AXCo+/bKgEOaU+NOU+uR9Pph+bmrq6qjFRNY8YQjiS1EoQqqK wGUiZVpq/fZF9q6GCHiQFlK1rj9+YBX4v4IYIOE0= DMARC-Filter: OpenDMARC Filter v1.3.2 smtp.codeaurora.org DE4D760251 Authentication-Results: pdx-caf-mail.web.codeaurora.org; dmarc=none (p=none dis=none) header.from=codeaurora.org Authentication-Results: pdx-caf-mail.web.codeaurora.org; spf=none smtp.mailfrom=okaya@codeaurora.org Subject: Re: pciehp 0000:00:1c.0:pcie004: Timeout on hotplug command 0x1038 (issued 65284 msec ago) To: Bjorn Helgaas Cc: Paul Menzel , Dave Young , linux-pci@vger.kernel.org, kexec@lists.infradead.org, linux-kernel@vger.kernel.org, Lukas Wunner , Eric Biederman , Bjorn Helgaas , Vivek Goyal References: <8770820b-85a0-172b-7230-3a44524e6c9f@molgen.mpg.de> <20180427192207.GG8199@bhelgaas-glaptop.roam.corp.google.com> <20180427211255.GI8199@bhelgaas-glaptop.roam.corp.google.com> <20180428005620.GB1675@dhcp-128-65.nay.redhat.com> <20180428011845.GC1675@dhcp-128-65.nay.redhat.com> <3ebc908fb196168bf0373875ffc5679e@codeaurora.org> <20180430211740.GG95643@bhelgaas-glaptop.roam.corp.google.com> From: Sinan Kaya Message-ID: <7285da70-2c3e-c3b7-62e1-fdbb55a77729@codeaurora.org> Date: Mon, 30 Apr 2018 17:27:32 -0400 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.7.0 MIME-Version: 1.0 In-Reply-To: <20180430211740.GG95643@bhelgaas-glaptop.roam.corp.google.com> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 4/30/2018 5:17 PM, Bjorn Helgaas wrote: >> What should we do about this? >> >> Since there is an actual HW errata involved, should we quirk this >> root port and not wait as if remove/shutdown doesn't exist? > I was hoping to avoid a quirk because AFAIK all Intel parts have this > issue so it will be an ongoing maintenance issue. I tried to avoid > the timeout delays, e.g., with 40b960831cfa ("PCI: pciehp: Compute > timeout from hotplug command start time"). > > But we still see the alarming messages, so we should probably add a > quirk to get rid of those. > > But I haven't given up on the idea of getting rid of the > pciehp_remove() path. I'm not convinced yet that we actually need to > do anything to shut this device down. I don't like the assumption > that kexec requires this. The kexec is fundamentally just a branch, > and anything we do before the branch (i.e., in the old kernel), we > should also be able to do after the branch (i.e., in the kexec-ed > kernel). > In my experience with kexec, MSI type edge interrupts are harmless. You might just see a few unhandled interrupt messages during boot if something is pending from the first kernel. It is the level interrupts that are more concerning. It remains pending until the interrupt source is cleared. CPU never returns from the interrupt handler to actually continue booting the second kernel. Execution doesn't reach to PCIe hp driver initialization for acknowledging the interrupt. How about remove() only if MSI is disabled? Most root port interrupts are MSI based anyhow. -- Sinan Kaya Qualcomm Datacenter Technologies, Inc. as an affiliate of Qualcomm Technologies, Inc. Qualcomm Technologies, Inc. is a member of the Code Aurora Forum, a Linux Foundation Collaborative Project.