Received: by 2002:a05:6a10:9848:0:0:0:0 with SMTP id x8csp245406pxf; Wed, 17 Mar 2021 04:25:36 -0700 (PDT) X-Google-Smtp-Source: ABdhPJy0VyQRz0LBfUo61ynGS3qt6hP8X7EXOgIt0eYimXE7GgW4tu7+pKkXpyZqv4hrK64y5YEb X-Received: by 2002:a17:906:3643:: with SMTP id r3mr3619238ejb.527.1615980336173; Wed, 17 Mar 2021 04:25:36 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1615980336; cv=none; d=google.com; s=arc-20160816; b=vN9xgzztY0HoU4kVfKUwocYf1XVdcGTPza07nXDHnZQa+vY9a9znYLiiIsY3mn5rAg uoCehOeLlwTrOsAbD2qj9vCHm+RgcEr4JH7ZG9rahwZ0qCwR/nx6qsm3TQoHvYPCaoEm aES+tCqMkPbA+0OLnLJICwaoe6jFno/+T3M5r9P+uDUt4pIggQsi/rASiHiHkbJFiB9v lZBBTW1ft6s2feBvK/i8KeXo/ImRNU5OOQkqbo5oxr/hviz6cBxIjFVllQ0hjhHkM1K3 oWuJ7ht1n2wAHdRTI8cpiwfOW9GNXiZCFqiOhREbJbChz4/4xr3ax32jJpOVnuhuBuzV 2abg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-transfer-encoding :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature; bh=yuYMamyLlkfeJo7AYiuzHkHjqAKzjEJXvLHS65JAKG4=; b=PbDluH2slw3tpBDV3DwwIuhcEr6Xmdy1FXKS8UkgTz6oZar+SLitStQjM72uF7S2kF 5t5sEbbVuFjs1BjI8pZ0nI0DmSv+LcQH0YgHgXiiWzcZrhKssm+O/OqP6xH2qfPTkuXV Sh+pu+dGw8vueEz1+GkuBBZVuIenryFwdQ3T+uL1ZLFECglXbKmnQFifMCN2yqoP49e8 I7i09TEc9pFHDVYhJEpV2TQQ78R/KZ+IToSqJaYo4XtB+oykd6TEtNUWKr9veLA24Pbi bG/zYrzrH04BZeIf/9Eoebxj6JHD9Tqa74wT/FB/p1Igh6zNXe8P9qEYfTh9ClM1C4f7 GP3A== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=PISAlKPs; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id w26si15925634ejn.699.2021.03.17.04.25.13; Wed, 17 Mar 2021 04:25:36 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=PISAlKPs; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231346AbhCQLYI (ORCPT + 99 others); Wed, 17 Mar 2021 07:24:08 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:43262 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S230171AbhCQLYD (ORCPT ); Wed, 17 Mar 2021 07:24:03 -0400 Received: from mail-pf1-x429.google.com (mail-pf1-x429.google.com [IPv6:2607:f8b0:4864:20::429]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 6B7B5C06174A; Wed, 17 Mar 2021 04:23:52 -0700 (PDT) Received: by mail-pf1-x429.google.com with SMTP id q5so883738pfh.10; Wed, 17 Mar 2021 04:23:52 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:content-transfer-encoding:in-reply-to; bh=yuYMamyLlkfeJo7AYiuzHkHjqAKzjEJXvLHS65JAKG4=; b=PISAlKPs5eJf3xPfNBvEoM5HPOFAxcvj7RITsYjZ2llpeOY5u1kgpaNm+9hfnPiwa3 fH2GQ+z+Rp7UT3r89amdgPZ4bV8noqhAYVnwDMxS0lkzoR4vcoarpkaxVrVdnK7jJfPs 2zfRPNOvAWH05T7tKksu+SIMwK3IsRiqvvJWC/k2NCAIZhy8UNvo+LqemYYTU8TtnuDk Viq7VAxSnzF7b6nIGtlZpJq3Zn7gA7bspMC9phvs2dh7tmTKQL2YUQLbdX2TDq3q9APc ryKQkg1gwn8jfS4NPzBuzjlpzhum2cGRpSybwEmHxALHGQ6YXaZKtspsIbUPsAL0dz8l jlfw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:content-transfer-encoding :in-reply-to; bh=yuYMamyLlkfeJo7AYiuzHkHjqAKzjEJXvLHS65JAKG4=; b=WAwM730H6C07eMy7LSI7bFzLETOAkG5JgG+pzj3O1P7eNk7ny1QKSgKsHJQYztPqQo U6n9xsugpCXplolRGWru7QXOiM3m5M/wgfdF7SF/09uK+tyO9MWHNL/oAHCBjPGYTqOl N/kHi6lS3psHqh3Nt17O47GjZ7H96N34zlw1qEAokEod6NTv2RmBVjlZJwbsWjI7wOoQ iSzkcYbJihHGY2VEo8rN99ftiqlgRlfk5SeCNkqUSg902E7MWi3QD9+PCiLkNIoBMwl3 SgBYEULBRzrwJcDJQ+HcnS21M0MmDCKQNAdrnKbIGl28XjT8hvweM/woejFH70priT8u NbyA== X-Gm-Message-State: AOAM5319LFClXVgvHkEuMTVkjJMkT0lpljAxazdNH2pz3UqMzSjmbiNH CEJuSYk0o6MFYweWlttpPQ0= X-Received: by 2002:aa7:980a:0:b029:20c:5402:5de9 with SMTP id e10-20020aa7980a0000b029020c54025de9mr3792986pfl.18.1615980231948; Wed, 17 Mar 2021 04:23:51 -0700 (PDT) Received: from localhost ([103.248.31.158]) by smtp.gmail.com with ESMTPSA id u9sm19270567pgc.59.2021.03.17.04.23.50 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 17 Mar 2021 04:23:51 -0700 (PDT) Date: Wed, 17 Mar 2021 16:53:09 +0530 From: Amey Narkhede To: Leon Romanovsky Cc: alex.williamson@redhat.com, raphael.norwitz@nutanix.com, linux-pci@vger.kernel.org, bhelgaas@google.com, linux-kernel@vger.kernel.org, alay.shah@nutanix.com, suresh.gumpula@nutanix.com, shyam.rajendran@nutanix.com, felipe@nutanix.com Subject: Re: [PATCH 4/4] PCI/sysfs: Allow userspace to query and set device reset mechanism Message-ID: <20210317112309.nborigwfd26px2mj@archlinux> References: <20210315134323.llz2o7yhezwgealp@archlinux> <20210315135226.avwmnhkfsgof6ihw@pali> <20210315083409.08b1359b@x1.home.shazbot.org> <20210315153341.miip637z35mwv7fv@archlinux> <20210315102950.230de1d6@x1.home.shazbot.org> <20210315183226.GA14801@raphael-debian-dev> <20210317102447.73no7mhox75xetlf@archlinux> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 21/03/17 01:02PM, Leon Romanovsky wrote: > On Wed, Mar 17, 2021 at 03:54:47PM +0530, Amey Narkhede wrote: > > On 21/03/17 06:20AM, Leon Romanovsky wrote: > > > On Mon, Mar 15, 2021 at 06:32:32PM +0000, Raphael Norwitz wrote: > > > > On Mon, Mar 15, 2021 at 10:29:50AM -0600, Alex Williamson wrote: > > > > > On Mon, 15 Mar 2021 21:03:41 +0530 > > > > > Amey Narkhede wrote: > > > > > > > > > > > On 21/03/15 05:07PM, Leon Romanovsky wrote: > > > > > > > On Mon, Mar 15, 2021 at 08:34:09AM -0600, Alex Williamson wrote: > > > > > > > > On Mon, 15 Mar 2021 14:52:26 +0100 > > > > > > > > Pali Roh?r wrote: > > > > > > > > > > > > > > > > > On Monday 15 March 2021 19:13:23 Amey Narkhede wrote: > > > > > > > > > > slot reset (pci_dev_reset_slot_function) and secondary bus > > > > > > > > > > reset(pci_parent_bus_reset) which I think are hot reset and > > > > > > > > > > warm reset respectively. > > > > > > > > > > > > > > > > > > No. PCI secondary bus reset = PCIe Hot Reset. Slot reset is just another > > > > > > > > > type of reset, which is currently implemented only for PCIe hot plug > > > > > > > > > bridges and for PowerPC PowerNV platform and it just call PCI secondary > > > > > > > > > bus reset with some other hook. PCIe Warm Reset does not have API in > > > > > > > > > kernel and therefore drivers do not export this type of reset via any > > > > > > > > > kernel function (yet). > > > > > > > > > > > > > > > > Warm reset is beyond the scope of this series, but could be implemented > > > > > > > > in a compatible way to fit within the pci_reset_fn_methods[] array > > > > > > > > defined here. Note that with this series the resets available through > > > > > > > > pci_reset_function() and the per device reset attribute is sysfs remain > > > > > > > > exactly the same as they are currently. The bus and slot reset > > > > > > > > methods used here are limited to devices where only a single function is > > > > > > > > affected by the reset, therefore it is not like the patch you proposed > > > > > > > > which performed a reset irrespective of the downstream devices. This > > > > > > > > series only enables selection of the existing methods. Thanks, > > > > > > > > > > > > > > Alex, > > > > > > > > > > > > > > I asked the patch author here [1], but didn't get any response, maybe > > > > > > > you can answer me. What is the use case scenario for this functionality? > > > > > > > > > > > > > > Thanks > > > > > > > > > > > > > > [1] https://lore.kernel.org/lkml/YE389lAqjJSeTolM@unreal/ > > > > > > > > > > > > > Sorry for not responding immediately. There were some buggy wifi cards > > > > > > which needed FLR explicitly not sure if that behavior is fixed in > > > > > > drivers. Also there is use a case at Nutanix but the engineer who > > > > > > is involved is on PTO that is why I did not respond immediately as > > > > > > I don't know the details yet. > > > > > > > > > > And more generally, devices continue to have reset issues and we > > > > > impose a fixed priority in our ordering. We can and probably should > > > > > continue to quirk devices when we find broken resets so that we have > > > > > the best default behavior, but it's currently not easy for an end user > > > > > to experiment, ie. this reset works, that one doesn't. We might also > > > > > have platform issues where a given reset works better on a certain > > > > > platform. Exposing a way to test these things might lead to better > > > > > quirks. In the case I think Pali was looking for, they wanted a > > > > > mechanism to force a bus reset, if this was in reference to a single > > > > > function device, this could be accomplished by setting a priority for > > > > > that mechanism, which would translate to not only the sysfs reset > > > > > attribute, but also the reset mechanism used by vfio-pci. Thanks, > > > > > > > > > > Alex > > > > > > > > > > > > > To confirm from our end - we have seen many such instances where default > > > > reset methods have not worked well on our platform. Debugging these > > > > issues is painful in practice, and this interface would make it far > > > > easier. > > > > > > > > Having an interface like this would also help us better communicate the > > > > issues we find with upstream. Allowing others to more easily test our > > > > (or other entities') findings should give better visibility into > > > > which issues apply to the device in general and which are platform > > > > specific. In disambiguating the former from the latter, we should be > > > > able to better quirk devices for everyone, and in the latter cases, this > > > > interface allows for a safer and more elegant solution than any of the > > > > current alternatives. > > > > > > So to summarize, we are talking about test and debug interface to > > > overcome HW bugs, am I right? > > > > > > My personal experience shows that once the easy workaround exists > > > (and write to generally available sysfs is very simple), the vendors > > > and users desire for proper fix decreases drastically. IMHO, we will > > > see increase of copy/paste in SO and blog posts, but reduce in quirks. > > > > > > My 2-cents. > > > > > I agree with your point but at least it gives the userspace ability > > to use broken device until bug is fixed in upstream. > > As I said, I don't expect many fixes once "userspace" will be able to > use cheap workaround. There is no incentive to fix it. > > > This is also applicable for obscure devices without upstream > > drivers for example custom FPGA based devices. > > This is not relevant to upstream kernel. Those vendors ship everything > custom, they don't need upstream, we don't need them :) > By custom I meant hobbyists who could tinker with their custom FPGA. > > Another main application which I forgot to mention is virtualization > > where vmm wants to reset the device when the guest is reset, > > to emulate machine reboot as closely as possible. > > It can work in very narrow case, because reset will cause to device > reprobe and most likely the driver will be different from the one that > started reset. I can imagine that net devices will lose their state and > config after such reset too. > Not sure if I got that 100% right. The pci_reset_function() function saves and restores device state over the reset. > IMHO, it will be saner for everyone if virtualization don't try such resets. > > Thanks > The exists reset sysfs attribute was added for exactly this case though. Thanks, Amey