Received: by 2002:a05:6a10:9848:0:0:0:0 with SMTP id x8csp3601749pxf; Mon, 22 Mar 2021 10:12:06 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxADv45FcaQMQN/vkTGuJoe+fQEmzFHFlSKsBMRXTB9Vcv6VjHlCUUA8O3UcbQvkYUpovBt X-Received: by 2002:a17:906:1986:: with SMTP id g6mr766053ejd.533.1616433126030; Mon, 22 Mar 2021 10:12:06 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1616433126; cv=none; d=google.com; s=arc-20160816; b=m+hTHr1cOGH/8yo6yAyzgCTHPnstPKC6KELQlLyw0R/Vq5PoNH4BW8DfHPys1eKTf9 eRuwbuy1spPO6GnX0oyoniHdLpDvBQ1Fta4xpDqg3vqiy63qio1EexOoi+f98S732jS1 hF/kFixKooKZpJGiZSXf0+FYG5/kgZJnKaqLjXC4QycxjjfCqEnYhFAV6uOXKhkGl/05 +37Jc8/dWlxiRF+XVMRXLqwIL2Mhd5jcb0smTwFSFwFzz/JU0tjCh/eL+kJ4cW2u9zEH jmsvLgOqqnNX7MajrebAfU/87lJW6FZg25e7CqDinp6TGOGmpLaP4IRifqPEHAu5duN1 g/SQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:subject:cc:to:from:date :dkim-signature; bh=wmsfmHTDDT2xHTUMtuftwx+SJo+tDSU1B3xKO6ua/sQ=; b=1JRn538xCjetExToA9tHXl9nQodi6K5dxgb6ZmlZtdXQ8lrn1OnFUGmOnbyBulGGPH ing7tshtKh3/xxpwcfvwYIaTAnQkwx5TUgFzjAA+8S64A4w1oou9nZ8SyzjvO+hcijEI k2xjhEObnvHNLrP+aYueR9/GBQoO2r4e4QWx8IDw2Pg29QgPJqdpXMp/pSxfdgOWFnmV SsR9HgF6dbG75upPqiYcON1oaSPYTedRZv5BoD1bWZTAdFUp7rP9PIywy6N4oBIWaSMx FBbRmm1u49PMz2OJUokmeKPh6zsX0IAEmhoDm6GJtB8VqUhsRUn4ZR9H+1+ZtEG5DALE hptw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=DTBnQEu3; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id h20si11846146ejy.416.2021.03.22.10.11.40; Mon, 22 Mar 2021 10:12:06 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=DTBnQEu3; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231965AbhCVRKb (ORCPT + 99 others); Mon, 22 Mar 2021 13:10:31 -0400 Received: from us-smtp-delivery-124.mimecast.com ([216.205.24.124]:37477 "EHLO us-smtp-delivery-124.mimecast.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231739AbhCVRKJ (ORCPT ); Mon, 22 Mar 2021 13:10:09 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1616433008; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=wmsfmHTDDT2xHTUMtuftwx+SJo+tDSU1B3xKO6ua/sQ=; b=DTBnQEu3d56SSIWAdtP1eRIaLiXh+lrXPxYsVBag4t/sreR3qpx/urP7+kzpQN2j1Vc4On CY7Jvxd+y2UhwU7i+tgcu93spXVeML0kgs8nVDSSCIOJKMhv6k3bMXdZvNY65T9ConVgMS 2CMW+dYnm/Ja1bsY1G4AV+yap3xiZTQ= Received: from mimecast-mx01.redhat.com (mimecast-mx01.redhat.com [209.132.183.4]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-157-UVTAAHXXOY27Wlr9GsGdVA-1; Mon, 22 Mar 2021 13:10:06 -0400 X-MC-Unique: UVTAAHXXOY27Wlr9GsGdVA-1 Received: from smtp.corp.redhat.com (int-mx06.intmail.prod.int.phx2.redhat.com [10.5.11.16]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx01.redhat.com (Postfix) with ESMTPS id BBA9C190A7A0; Mon, 22 Mar 2021 17:10:04 +0000 (UTC) Received: from omen.home.shazbot.org (ovpn-112-120.phx2.redhat.com [10.3.112.120]) by smtp.corp.redhat.com (Postfix) with ESMTP id E242B1724D; Mon, 22 Mar 2021 17:10:03 +0000 (UTC) Date: Mon, 22 Mar 2021 11:10:03 -0600 From: Alex Williamson To: Leon Romanovsky Cc: "Enrico Weigelt, metux IT consult" , Amey Narkhede , raphael.norwitz@nutanix.com, linux-pci@vger.kernel.org, bhelgaas@google.com, linux-kernel@vger.kernel.org, alay.shah@nutanix.com, suresh.gumpula@nutanix.com, shyam.rajendran@nutanix.com, felipe@nutanix.com Subject: Re: [PATCH 4/4] PCI/sysfs: Allow userspace to query and set device reset mechanism Message-ID: <20210322111003.50d64f2c@omen.home.shazbot.org> In-Reply-To: References: <20210317113140.3de56d6c@omen.home.shazbot.org> <20210318103935.2ec32302@omen.home.shazbot.org> <20210319102313.179e9969@omen.home.shazbot.org> <20210320085942.3cefcc48@x1.home.shazbot.org> MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Scanned-By: MIMEDefang 2.79 on 10.5.11.16 Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sun, 21 Mar 2021 10:40:55 +0200 Leon Romanovsky wrote: > On Sat, Mar 20, 2021 at 08:59:42AM -0600, Alex Williamson wrote: > > On Sat, 20 Mar 2021 11:10:08 +0200 > > Leon Romanovsky wrote: > > > On Fri, Mar 19, 2021 at 10:23:13AM -0600, Alex Williamson wrote: > > > > > > > > What if we taint the kernel or pci_warn() for cases where either all > > > > the reset methods are disabled, ie. 'echo none > reset_method', or any > > > > time a device specific method is disabled? > > > > > > What does it mean "none"? Does it mean nothing supported? If yes, I think that > > > pci_warn() will be enough. At least for me, taint is usable during debug stages, > > > probably if device doesn't crash no one will look to see /proc/sys/kernel/tainted. > > > > "none" as implemented in this patch, clearing the enabled function > > reset methods. > > It is far from intuitive, the empty string will be easier to understand, > because "none" means no reset at all. "No reset at all" is what "none" achieves, the pci_dev.reset_methods_enabled bitmap is cleared. We can use an empty string, but I think we want a way to clear all enabled resets and a way to return it to the default. I could see arguments for an empty string serving either purpose, so this version proposed explicitly using "none" and "default", as included in the ABI update. > > > > I'd almost go so far as to prevent disabling a device specific reset > > > > altogether, but for example should a device specific reset that fixes > > > > an aspect of FLR behavior prevent using a bus reset? I'd prefer in that > > > > case if direct FLR were disabled via a device flag introduced with the > > > > quirk and the remaining resets can still be selected by preference. > > > > > > I don't know enough to discuss the PCI details, but you raised good point. > > > This sysfs is user visible API that is presented as is from device point > > > of view. It can be easily run into problems if PCI/core doesn't work with > > > user's choice. > > > > > > > > > > > Theoretically all the other reset methods work and are available, it's > > > > only a policy decision which to use, right? > > > > > > But this patch was presented as a way to overcome situations where > > > supported != working and user magically knows which reset type to set. > > > > It's not magic, the new sysfs attributes expose which resets are > > enabled and the order that they're used, the user can simply select the > > next one. Being able to bypass a broken reset method is a helpful side > > effect of getting to select a preferred reset method. > > Magic in a sense that user has no idea what those resets mean, the > expectation is that he will blindly iterate till something works. Which ought to actually be a safe thing to do. We should have quirks to exclude resets that are known broken but still probe as present and I'd be perfectly fine if we issue a warning if the user disables all resets for a given device. > > > If you want to take this patch to be policy decision tool, > > > it will need to accept "reset_type1,reset_type2,..." sort of input, > > > so fallback will work natively. > > > > I don't see that as a requirement. We have fall-through support in the > > kernel, but for a given device we're really only ever going to make use > > of one of those methods. If a user knows enough about a device to have > > a preference, I think it can be singular. That also significantly > > simplifies the interface and supporting code. Thanks, > > I'm struggling to get requirements from this thread. You talked about > policy decision to overtake fallback mechanism, Amey wanted to avoid > quirks. > > Do you have an example of such devices or we are talking about > theoretical case? Look at any device that already has a reset quirk and the process it took to get there. Those are more than just theoretical cases. For policy preference, I already described how I've configured QEMU to prefer a bus reset rather than a PM reset due to lack of specification regarding the scope of a PM "soft reset". This interface would allow a system policy to do that same thing. I don't think anyone is suggesting this as a means to avoid quirks that would resolve reset issues and create the best default general behavior. This provides a mechanism to test various reset methods, and thereby identify broken methods, and set a policy. Sure, that policy might be to avoid a broken reset in the interim before it gets quirked and there's potential for abuse there, but I think the benefits outweigh the risks. > And I don't see why simple line parser with loop iterator over strchr() > suddenly becomes complicated code. Setting multiple bits in a bitmap is easy. How do you then go on to allow the user to specify an ordering preference? If you have an algorithm you'd like to propose that allows the user to manage the ordering when enabling multiple methods without substantially increasing the complexity, please share. IMO, a given device will generally use one reset method and it seems sufficient to restrict user preference to achieve all the use cases I've noted. Thanks, Alex