Received: by 10.192.165.156 with SMTP id m28csp644901imm; Mon, 16 Apr 2018 06:32:06 -0700 (PDT) X-Google-Smtp-Source: AIpwx4/lzubt2HORao2u/lRsTquDh0kCcL5pBUsGNW/NCQeWAJJFJjB26J79GDOlhdAKbJSfUg9r X-Received: by 10.101.64.130 with SMTP id t2mr67320pgp.23.1523885526925; Mon, 16 Apr 2018 06:32:06 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1523885526; cv=none; d=google.com; s=arc-20160816; b=MG70HqdYEbiPAAz5tn38iMgb2LYka4/YjT6Z92OyQVDMRwIKVbOu/T/zSCvDcqfQr5 3amhvCy6bZH1hqVduy9EVd3EISk1nZAswmgV7uLDvv3RVURdJD6irEdfIfxYLkKR6hhb 7gA5FV/sZUBrq7y1GhBZikdDX9nxwohPG07SW7kpaelFevFap5EX0nPlkHWfrqwL+xfv bRi+ehGP+g7PH5ucfnbKpBfRZJNbFV9jh6aCvEk90bbGhnDKpNPcbc2HCesM+Yf5AkOH 19WR1wxosEyL7ayu6TMnWpKQmynI41sZLvfx8zKEKtvpH5AVBTKxWAs9tcs/TsdUccd2 zClg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dmarc-filter:arc-authentication-results; bh=JfTczeZpQsCJqoMoxv7l8VbbaYevggjUIC2UPggSTXs=; b=Mrl46w3Oxz5o9pk33XgO7sOUiwex+x/0t+hxKY2bxnUdGTFfbd/xlynMJpR3fOc29o 2WHVCwMaO+Nz5gX+L+naCE0e9bzmIb5c0G4vzES0WVPmAXJxSJtvGdv8W5Ux9VcQDD7G 82LZYHmhgOTUSu9ZeTp+a2nUbwr9EBd7Ihq1WSXk7W2iky9Y4nVaM9WATWw6ZuGdqc8f OXgmgrcNugnehHtbBIsWbkP6tQStlgMzalZcWfG65oyXXNJLI7+h081+Wf+ePGjC9E0O sLK1eFk9qoJ4QKSnRxx4BGeHQqCEC7l4aAdrq81qem91rgdj5UIfDCXm4nZ60x0C/AVY lf9Q== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id k185si9556368pgk.2.2018.04.16.06.31.22; Mon, 16 Apr 2018 06:32:06 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755197AbeDPN15 (ORCPT + 99 others); Mon, 16 Apr 2018 09:27:57 -0400 Received: from mail.kernel.org ([198.145.29.99]:52502 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754836AbeDPN1z (ORCPT ); Mon, 16 Apr 2018 09:27:55 -0400 Received: from localhost (50-81-22-222.client.mchsi.com [50.81.22.222]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 13AC82177F; Mon, 16 Apr 2018 13:27:55 +0000 (UTC) DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org 13AC82177F Authentication-Results: mail.kernel.org; dmarc=none (p=none dis=none) header.from=kernel.org Authentication-Results: mail.kernel.org; spf=none smtp.mailfrom=helgaas@kernel.org Date: Mon, 16 Apr 2018 08:27:53 -0500 From: Bjorn Helgaas To: poza@codeaurora.org Cc: Sinan Kaya , Bjorn Helgaas , Philippe Ombredanne , Thomas Gleixner , Greg Kroah-Hartman , Kate Stewart , linux-pci@vger.kernel.org, linux-kernel@vger.kernel.org, Dongdong Liu , Keith Busch , Wei Zhang , Timur Tabi Subject: Re: [PATCH v13 0/6] Address error and recovery for AER and DPC Message-ID: <20180416132753.GA28657@bhelgaas-glaptop.roam.corp.google.com> References: <1523284914-2037-1-git-send-email-poza@codeaurora.org> <20180416031600.GB80087@bhelgaas-glaptop.roam.corp.google.com> <5b4e667f-bead-a007-78dd-e42d3194f232@codeaurora.org> <9301606a70a213c180d9e6764b002cf9@codeaurora.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <9301606a70a213c180d9e6764b002cf9@codeaurora.org> User-Agent: Mutt/1.9.2 (2017-12-15) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, Apr 16, 2018 at 11:33:13AM +0530, poza@codeaurora.org wrote: > On 2018-04-16 09:23, Sinan Kaya wrote: > > On 4/15/2018 11:16 PM, Bjorn Helgaas wrote: > > > On Mon, Apr 09, 2018 at 10:41:48AM -0400, Oza Pawandeep wrote: > > > > This patch set brings in error handling support for DPC > > > > > > > > The current implementation of AER and error message broadcasting > > > > to the > > > > EP driver is tightly coupled and limited to AER service driver. > > > > It is important to factor out broadcasting and other link handling > > > > callbacks. So that not only when AER gets triggered, but also > > > > when DPC get > > > > triggered (for e.g. ERR_FATAL), callbacks are handled appropriately. > > > > > > > > DPC should behave identical to AER as far as error handling is > > > > concerned. > > > > DPC should remove the devices and not to do recovery for hotplug > > > > enabled system. > > > > > > Is there a specific bug that's fixed by these patches? I didn't see > > > one mentioned in the changelogs. > > > > > > > There is no actual bug. > > > > We realized that DPC and hotplug is heavily integrated today. We > > have use cases for systems without hotplug support but still > > support DPC. That's the problem we are trying to solve with this > > patchset. Apparently there's a problem with systems that have DPC but not hotplug. It will be extremely helpful if you can articulate what that problem is and include it in the appropriate changelog. > Adding to what Sinan said; > > DPC should handle the error handling and recovery similar to AER, > because finally both are attempting recovery in some or the other > way, and for that error handling and recovery framework has to be > loosely coupled. It achieves uniformity and transparency to the > error handling agents such as AER, DPC, with respect to recovery and > error handling. > > So, this patch-set tries to unify lot of things between error agents > and make them behave in a well defined way. (be it error (FATAL, > NON_FATAL) handling or recovery). I totally support this objective. Bjorn