Received: by 2002:ac0:a581:0:0:0:0:0 with SMTP id m1-v6csp1026540imm; Fri, 22 Jun 2018 09:06:19 -0700 (PDT) X-Google-Smtp-Source: ADUXVKKJWPPKDNmeJTiwX6CAWeBjIyPtoyFxAX4WkjGIZ3/Fa6HnTxz3SVikZGd6BmZEV0m2CUSz X-Received: by 2002:a63:65c2:: with SMTP id z185-v6mr1933571pgb.276.1529683579112; Fri, 22 Jun 2018 09:06:19 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1529683579; cv=none; d=google.com; s=arc-20160816; b=oKLEKkhTgbosdJF5PpWcb70EILWAoIZTS8xQV30s7NGybkFltyyn4mRIt7DrhNzqg6 0OYcvNxoOBW1e1MCkfiXImrXsfB0jdnPgeZZ3032x1S7DmfbA65QqlTSYtTh1nHF5YgP /P9GsUxQSFcYdTUEPulVXiWNbdvSPH0y7TwYDrU9GBC9T2f0OOq8YBCq5xPEY2URMCSd VU/KSm0WoVV2K/iAW3fAEFcfCm9P32I3u4+Sm3+WPmh5HElkLG2NyIdY7OzC96bDF8rv uJnobowki37mkYMN8bOjU627rcm8UW4g4tYYTOXmShP6Zs+6gXbC8JdyP7P6RwJaEk5/ Q3Dw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:dmarc-filter :dkim-signature:dkim-signature:arc-authentication-results; bh=O3/tH+8y4kp9WL3H2/r0pMU6wR93dwq4R6DZ+Nm66FY=; b=MWPa0u8MtvPFU8W5CRN2F3fafXk1iuw0ezMVG7JrzcWHVyJzCzrwBPAlbFJHL3q7wK 3ser73zVSIblWNaSq11YJMXoWfO14vO4l7z9o48yIIoSVJ2evVPWJGr0NvsszYc2GGbf s7kmxLK4NlnYPjnYedV031aCBWMvXDmt+hzz8leIiJpNoJUg3WRpgVD+LZw1UYHExls+ O9sbuQDDXYzCnxe2OvV9hy+O1nUHeu64XA+JjYt6cPMLoNAH+ad2sKPOcBnhEl1KnuMs ZoOboJzlaK4AozRetwmUjvrmJGKJeJC+ELLUVxArpTmw8VZpnZ6TAb27Q5rYExtt7X6C E7/g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@codeaurora.org header.s=default header.b="pCVPyW/v"; dkim=pass header.i=@codeaurora.org header.s=default header.b=UNAb77NO; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id g14-v6si7609648plq.41.2018.06.22.09.06.03; Fri, 22 Jun 2018 09:06:19 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@codeaurora.org header.s=default header.b="pCVPyW/v"; dkim=pass header.i=@codeaurora.org header.s=default header.b=UNAb77NO; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S934005AbeFVQEI (ORCPT + 99 others); Fri, 22 Jun 2018 12:04:08 -0400 Received: from smtp.codeaurora.org ([198.145.29.96]:36732 "EHLO smtp.codeaurora.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751317AbeFVQEF (ORCPT ); Fri, 22 Jun 2018 12:04:05 -0400 Received: by smtp.codeaurora.org (Postfix, from userid 1000) id 40AA860AFF; Fri, 22 Jun 2018 16:04:05 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=codeaurora.org; s=default; t=1529683445; bh=UgkMRQ9YzVRTr/yFymVoDYJLVIAuQ5mjwDNqMVRZGlI=; h=Subject:To:Cc:References:From:Date:In-Reply-To:From; b=pCVPyW/vo4G9yuKA0PXmzfPRaVf46u2LCTX0/RaTAN9737PNYxe731Bi1PdGe8C2h OdwJebhxtxvJFla2Jr/vbKWw+SZq2kOEkKyCSM3lCm943wqv/4kQJPDdR26fs5Rgsr LzOaDoYIlOQZ8DrMV3ZS8DGy/VfFSMBz8H55Kee0= X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on pdx-caf-mail.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-2.8 required=2.0 tests=ALL_TRUSTED,BAYES_00, DKIM_SIGNED,T_DKIM_INVALID autolearn=no autolearn_force=no version=3.4.0 Received: from [192.168.0.117] (cpe-174-109-247-98.nc.res.rr.com [174.109.247.98]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) (Authenticated sender: okaya@smtp.codeaurora.org) by smtp.codeaurora.org (Postfix) with ESMTPSA id 1A608605FD; Fri, 22 Jun 2018 16:04:02 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=codeaurora.org; s=default; t=1529683444; bh=UgkMRQ9YzVRTr/yFymVoDYJLVIAuQ5mjwDNqMVRZGlI=; h=Subject:To:Cc:References:From:Date:In-Reply-To:From; b=UNAb77NOU7r4l25V11wYoOnyy8AtA5QeaL9RQKscRHh5JMCC8n4XFVHB/jZtLBPQB L3QfgOlAULsCb0r9VV1L+4IKsLt9RgOkqIulqvDtFNfssMHhuu9/tzr5tcgacJb+WZ ehKigAyOIhth0E8WUzIum0lQxQrUwE2X+YLEK5n0= DMARC-Filter: OpenDMARC Filter v1.3.2 smtp.codeaurora.org 1A608605FD Authentication-Results: pdx-caf-mail.web.codeaurora.org; dmarc=none (p=none dis=none) header.from=codeaurora.org Authentication-Results: pdx-caf-mail.web.codeaurora.org; spf=none smtp.mailfrom=okaya@codeaurora.org Subject: Re: [PATCH 1/2] IB/hfi1: Try slot reset before secondary bus reset To: Bjorn Helgaas Cc: linux-pci@vger.kernel.org, sulrich@codeaurora.org, timur@codeaurora.org, Mike Marciniszyn , "open list:HFI1 DRIVER" , linux-arm-msm@vger.kernel.org, Dennis Dalessandro , open list , Jason Gunthorpe , Doug Ledford , linux-arm-kernel@lists.infradead.org, Alex Williamson References: <1524167784-5911-1-git-send-email-okaya@codeaurora.org> <20180619214346.GD33049@bhelgaas-glaptop.roam.corp.google.com> <2593baec-8a28-a3e7-7ebf-7c21addda0b8@codeaurora.org> <20180622140125.GD108993@bhelgaas-glaptop.roam.corp.google.com> From: Sinan Kaya Message-ID: <98c4531b-0b64-b9e1-ef95-562d1613d893@codeaurora.org> Date: Fri, 22 Jun 2018 12:04:02 -0400 User-Agent: Mozilla/5.0 (Windows NT 10.0; WOW64; rv:52.0) Gecko/20100101 Thunderbird/52.8.0 MIME-Version: 1.0 In-Reply-To: <20180622140125.GD108993@bhelgaas-glaptop.roam.corp.google.com> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 6/22/2018 10:01 AM, Bjorn Helgaas wrote: >> Since Oza's DPC/AER patch to refactor fatal error handling, both >> hotplug driver and AER/DPC driver will try removing devices and >> perform enumeration on link events/AER events. >> >> Perfect environment for race condition without a change. > Yeah, this looks like a bit of a mess. I guess we're getting two > interrupts (AER interrupt and hotplug interrupt) and we should > coordinate their handling somehow. I don't have a proposal. This > race could happen independent of the device reset paths, of course. I was hoping for pci_reset_slot() to avoid this but it turns out the root port does not have any slot pointers set on my system even though hotplug is enabled. Slot pointer is only set for the child objects. https://patchwork.kernel.org/patch/10351515/ * A PCI bus may host multiple slots, each slot may support a reset mechanism * independent of other slots. For instance, some slots may support slot power * control. In the case of a 1:1 bus to slot architecture, this function may * wrap the bus reset to avoid spurious slot related events such as hotplug. * Generally a slot reset should be attempted before a bus reset. All of the * function of the slot and any subordinate buses behind the slot are reset * through this function. PCI config space of all devices in the slot and * behind the slot is saved before and restored after reset. -- Sinan Kaya Qualcomm Datacenter Technologies, Inc. as an affiliate of Qualcomm Technologies, Inc. Qualcomm Technologies, Inc. is a member of the Code Aurora Forum, a Linux Foundation Collaborative Project.