Received: by 10.192.165.156 with SMTP id m28csp691260imm; Mon, 16 Apr 2018 07:14:10 -0700 (PDT) X-Google-Smtp-Source: AIpwx4+ioicYHq2WgE+4BC/spcSnrsC3F1eQMKCpmVfzy6RFBNrU1gC9swod/Mv8hl7wkbxVQ4R1 X-Received: by 10.101.75.135 with SMTP id t7mr13243239pgq.235.1523888050435; Mon, 16 Apr 2018 07:14:10 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1523888050; cv=none; d=google.com; s=arc-20160816; b=zdIroTtuxFSX9FqQBB+PJnpzQEjBfH69NfC9jr1zC/rKt8PL/EiwKKPJG8K9fJMYTB T+NWnK3bL0YKiqasZFWNGtzVm1w00Iv9VvpSddxSq4dtfEXf9mYvat7lZKavNmZaZoVX t+kdjXWtOaQM+ATRdajf0s39TdonPl6pu3KIfUO+3mAn1fsDy28ZuQB7MCvCiGn8n25u XAnCSYZjYQ7PfHRttxTP+qHlsTp7I4Qtabcq5XnYj2wh86WdaGTB8Gofs20zYxk2EUVa 6bPvY0PINWZv3JM7oPP4rDoUhPvnAzCS93DhGct5Qq6NSmYG6dXq8wEQa6o13MqI86xf NGxg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:message-id:references :in-reply-to:subject:cc:to:from:date:content-transfer-encoding :mime-version:dkim-signature:dkim-signature :arc-authentication-results; bh=7RMGiAhUD4/D0kNjrtHygRC0sYeBlcVJi9S5jSE6f/M=; b=WfOp9VEfLE/u7TSWXJ1gIhzFjwdYX3NoOmo+P+aTATGolSpYUg47X1vz3qLISys1Xz Yzc1/9Oy/lqQRAS4jDCb+6+dPTOZOpZBp8ruJ1sywODUlBZS7DBD7+klj/msSAZVEJDT WTx2y0fGERENoHjSkH52Vo3IqLRdLvJB3Mjo8yIxhIQCyG6TezGaM0NKSXviGTe5pbzF gEPJLNrJm0VekN5A/DVb+msbTYerPKZ4TKsJUFUEclDuLCoaSbic4ERVIs/ZynEQQ/vv w7GmPJXG3FNhS795tvmkRqsos3CNgizI4OeXfI8To0dra5Hhpe/iZsYrjQzDeRM0smiQ q2Mw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@codeaurora.org header.s=default header.b=nA8O28sA; dkim=pass header.i=@codeaurora.org header.s=default header.b=eNbRUkZf; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id x8si9536524pgr.301.2018.04.16.07.13.55; Mon, 16 Apr 2018 07:14:10 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@codeaurora.org header.s=default header.b=nA8O28sA; dkim=pass header.i=@codeaurora.org header.s=default header.b=eNbRUkZf; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755206AbeDPOMs (ORCPT + 99 others); Mon, 16 Apr 2018 10:12:48 -0400 Received: from smtp.codeaurora.org ([198.145.29.96]:52790 "EHLO smtp.codeaurora.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751024AbeDPOMq (ORCPT ); Mon, 16 Apr 2018 10:12:46 -0400 Received: by smtp.codeaurora.org (Postfix, from userid 1000) id 32386610D5; Mon, 16 Apr 2018 14:12:45 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=codeaurora.org; s=default; t=1523887966; bh=Nyu1ts3Ubmge5GXBykgSM3P+j8CMKIrnQRJcNd/covY=; h=Date:From:To:Cc:Subject:In-Reply-To:References:From; b=nA8O28sA3ofmJnSVlNw2WVVukt9mwWJNXSSgIZlmd3iPDL235hpcIp7m4lbd6tUCu QJnNkU+dK9paVkLWE7jYRkL0zzo0tb1oLOH9P6pPDeL/ZyRIOZwHQjozyGMFyT1Fsy 3JJ+l5tZYC6tba3MCrJiLnlGItjH3bpAhI25i9lc= X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on pdx-caf-mail.web.codeaurora.org X-Spam-Level: X-Spam-Status: No, score=-2.8 required=2.0 tests=ALL_TRUSTED,BAYES_00, DKIM_SIGNED,T_DKIM_INVALID autolearn=no autolearn_force=no version=3.4.0 Received: from mail.codeaurora.org (localhost.localdomain [127.0.0.1]) by smtp.codeaurora.org (Postfix) with ESMTP id E7335603AF; Mon, 16 Apr 2018 14:12:44 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=codeaurora.org; s=default; t=1523887965; bh=Nyu1ts3Ubmge5GXBykgSM3P+j8CMKIrnQRJcNd/covY=; h=Date:From:To:Cc:Subject:In-Reply-To:References:From; b=eNbRUkZfiWsPxX5n8SiKgzHOlL2zBD3Rw/S3HZcNUfIGABkm6lzHTnWB+aj9z59Pa pxuV2GJ8zEPYU0xh9chuKpq0u7fPt1bK8Ia1trdCUsLGgg+1WAK8jbhjtC14c8jVGy GXaS5aehpLiYMM10STjwYFWtrQyo6U/3w37xr/d4= MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII; format=flowed Content-Transfer-Encoding: 7bit Date: Mon, 16 Apr 2018 19:42:44 +0530 From: poza@codeaurora.org To: Bjorn Helgaas Cc: Sinan Kaya , Bjorn Helgaas , Philippe Ombredanne , Thomas Gleixner , Greg Kroah-Hartman , Kate Stewart , linux-pci@vger.kernel.org, linux-kernel@vger.kernel.org, Dongdong Liu , Keith Busch , Wei Zhang , Timur Tabi Subject: Re: [PATCH v13 0/6] Address error and recovery for AER and DPC In-Reply-To: <20180416132753.GA28657@bhelgaas-glaptop.roam.corp.google.com> References: <1523284914-2037-1-git-send-email-poza@codeaurora.org> <20180416031600.GB80087@bhelgaas-glaptop.roam.corp.google.com> <5b4e667f-bead-a007-78dd-e42d3194f232@codeaurora.org> <9301606a70a213c180d9e6764b002cf9@codeaurora.org> <20180416132753.GA28657@bhelgaas-glaptop.roam.corp.google.com> Message-ID: <0b94f5a75fbfec9063e2c07006be3fbb@codeaurora.org> X-Sender: poza@codeaurora.org User-Agent: Roundcube Webmail/1.2.5 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2018-04-16 18:57, Bjorn Helgaas wrote: > On Mon, Apr 16, 2018 at 11:33:13AM +0530, poza@codeaurora.org wrote: >> On 2018-04-16 09:23, Sinan Kaya wrote: >> > On 4/15/2018 11:16 PM, Bjorn Helgaas wrote: >> > > On Mon, Apr 09, 2018 at 10:41:48AM -0400, Oza Pawandeep wrote: >> > > > This patch set brings in error handling support for DPC >> > > > >> > > > The current implementation of AER and error message broadcasting >> > > > to the >> > > > EP driver is tightly coupled and limited to AER service driver. >> > > > It is important to factor out broadcasting and other link handling >> > > > callbacks. So that not only when AER gets triggered, but also >> > > > when DPC get >> > > > triggered (for e.g. ERR_FATAL), callbacks are handled appropriately. >> > > > >> > > > DPC should behave identical to AER as far as error handling is >> > > > concerned. >> > > > DPC should remove the devices and not to do recovery for hotplug >> > > > enabled system. >> > > >> > > Is there a specific bug that's fixed by these patches? I didn't see >> > > one mentioned in the changelogs. >> > > >> > >> > There is no actual bug. >> > >> > We realized that DPC and hotplug is heavily integrated today. We >> > have use cases for systems without hotplug support but still >> > support DPC. That's the problem we are trying to solve with this >> > patchset. > > Apparently there's a problem with systems that have DPC but not > hotplug. It will be extremely helpful if you can articulate what that > problem is and include it in the appropriate changelog. > >> Adding to what Sinan said; >> >> DPC should handle the error handling and recovery similar to AER, >> because finally both are attempting recovery in some or the other >> way, and for that error handling and recovery framework has to be >> loosely coupled. It achieves uniformity and transparency to the >> error handling agents such as AER, DPC, with respect to recovery and >> error handling. >> >> So, this patch-set tries to unify lot of things between error agents >> and make them behave in a well defined way. (be it error (FATAL, >> NON_FATAL) handling or recovery). > > I totally support this objective. Thanks Bjorn, I will include this objective in Changelog along with Sinan's text. I am not clear on one last thing Bjorn; which is; do we need last patch ? patch-6 which handles hotplug case. Also I think we could take this patch-set as basic changes/attempt to unify the code which it does. And, in the next follow-up patches we can improve upon the things such as, whether to do different actions for FATAL cases and NON_FATAL cases. And then I can make needed changes to AER and DPC Please let me know how this sounds. > > Bjorn