Received: by 2002:a05:6a10:f347:0:0:0:0 with SMTP id d7csp1160723pxu; Fri, 27 Nov 2020 00:50:43 -0800 (PST) X-Google-Smtp-Source: ABdhPJy6UjCBlgtyN4aqOruvM8fxbsLGbnmbTz8paN3VMUEjqesmYQReRHmAfa5FnUfVpgRVEnFx X-Received: by 2002:a17:906:8246:: with SMTP id f6mr344388ejx.343.1606467043026; Fri, 27 Nov 2020 00:50:43 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1606467043; cv=none; d=google.com; s=arc-20160816; b=AoVHSgapcWNm8DMX6RC0INCJBXsTHJS7VN1T9aNxz45CepqEyCQEfaMr/E0Xn/KYRG uVhwjOKJMewEAwnvKwY9znz9SqYWhhwJ8MS8ON2pYZSulNL80pHJF2Tm754Hp4JzuLjU Pw3ckEQh5MxdwnCPyZ5pMbArVntYySS9rZhg99p3ZtgKR0EYR0Ux1ZQnszeVgpDlVMqN tcQnayFz82uWX7fY8vTV5OHaNemykQNBmXHCFLz42Bufi2uwmiM9xY9uh84NkNwemu+d Rc2ysMWRBHgcONo88kL7BE1s+6HhXqRw3vwliBaJj3zhLCNRexSymD37Yw2D6Rz6g3oC Z8QQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-language:content-transfer-encoding :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject:reply-to:ironport-sdr:ironport-sdr; bh=MHg6gpPaZDsR4dKAXomtBMPY0pw7qo7P7E6XCAkNS0o=; b=Vq4S5m17/i4X3dGg7rRnEutF84fto4CSPujMT9+YuDxV9LDfTYW1q5o+2nXYpj4fxo L9WLSXtaZCya8YZL5/vvY/CInjrC/oNnD2k7PPV1qMnB3B/l3QMrdEWvR/LIUikZmbeC lfKbaRr/IlLcN9li8TgPlUfJhjqeOVr4PYuCDAhLp5aSzau/4zUTfBg1OBBU3MUigjNx Q5gM8c3Duf+KEx3e/ude6Hd2GPetwRZ2wcqiWRyeoidM6GxnldPai1T3UewISCFfmrSb cxhzkVLBEhIZa038cbkKVEHjEiHuaZRslWMVMv5XRGDRSi2T53rY+jw3mZ2YusLwQRwe blPA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id j30si4458838edd.363.2020.11.27.00.50.20; Fri, 27 Nov 2020 00:50:43 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2392374AbgK0Dxn (ORCPT + 99 others); Thu, 26 Nov 2020 22:53:43 -0500 Received: from mga11.intel.com ([192.55.52.93]:35589 "EHLO mga11.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726908AbgK0Dxn (ORCPT ); Thu, 26 Nov 2020 22:53:43 -0500 IronPort-SDR: LR+0kKPDMj5bllNuAcdT5UrcbTI0DZDWelZRIRXoZbR08f+K726MmBJCHj/yhpwHJ1hsOiY2BD 3afOZje0JlMw== X-IronPort-AV: E=McAfee;i="6000,8403,9817"; a="168846461" X-IronPort-AV: E=Sophos;i="5.78,373,1599548400"; d="scan'208";a="168846461" X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from fmsmga004.fm.intel.com ([10.253.24.48]) by fmsmga102.fm.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 26 Nov 2020 19:53:41 -0800 IronPort-SDR: PDWKXGulJfBdxugdnRyel4ohEwnJXRGbAIqIgVbi6EFUzAKbj/ApUKl9U57dGHoooYYR2o3SuR goe97CCAk3Eg== X-ExtLoop1: 1 X-IronPort-AV: E=Sophos;i="5.78,373,1599548400"; d="scan'208";a="363006640" Received: from unknown (HELO [10.239.160.22]) ([10.239.160.22]) by fmsmga004.fm.intel.com with ESMTP; 26 Nov 2020 19:53:40 -0800 Reply-To: Colin.Xu@intel.com Subject: Re: [RFC PATCH] vfio/pci: Allow force needs_pm_restore as specified by device:vendor To: Alex Williamson Cc: "kvm@vger.kernel.org" , "linux-kernel@vger.kernel.org" , "Fonn, Swee Yee" References: <20201125021824.27411-1-colin.xu@intel.com> <20201125085312.63510f9f@w520.home> From: Colin Xu Message-ID: <7e7a83ca-8530-1afa-4b85-2ef76fb99a5c@intel.com> Date: Fri, 27 Nov 2020 11:53:39 +0800 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.4.3 MIME-Version: 1.0 In-Reply-To: <20201125085312.63510f9f@w520.home> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Content-Language: en-US Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 11/25/20 11:53 PM, Alex Williamson wrote: > On Wed, 25 Nov 2020 10:18:24 +0800 > Colin Xu wrote: > >> Force specific device listed in params pm_restore_ids to follow >> device state save/restore as needs_pm_restore. >> Some device has NoSoftRst so will skip current state save/restore enabled >> by needs_pm_restore. However once the device experienced power state >> D3<->D0 transition, either by idle_d3 or the guest driver changes PM_CTL, >> the guest driver won't get correct devie state although the configure >> space doesn't change. > It sounds like you're describing a device that incorrectly exposes > NoSoftRst when there is in fact some sort of internal reset that > requires reprogramming config space. What device requires this? How > is a user to know when this option is required? It seems like this > would be better handled via a quirk in PCI core that sets a device flag > that the NoSoftRst value is incorrect for the specific affected > devices. Thanks, > > Alex Thanks for the feedback. The device found are: Comet Lake PCH Serial IO I2C Controller [8086:06e8] [8086:06e9] Yes you're right, there is no straight way for user to know the device. The above device I found is during pass through them to VM. Although adding such param may help in certain scenario, it still too device-specific but not common in most cases. I'll try the pci quirk way. Colin > > >> Cc: Swee Yee Fonn >> Signed-off-by: Colin Xu >> --- >> drivers/vfio/pci/vfio_pci.c | 66 ++++++++++++++++++++++++++++++++++++- >> 1 file changed, 65 insertions(+), 1 deletion(-) >> >> diff --git a/drivers/vfio/pci/vfio_pci.c b/drivers/vfio/pci/vfio_pci.c >> index e6190173482c..50a4141c9e1d 100644 >> --- a/drivers/vfio/pci/vfio_pci.c >> +++ b/drivers/vfio/pci/vfio_pci.c >> @@ -34,6 +34,15 @@ >> #define DRIVER_AUTHOR "Alex Williamson " >> #define DRIVER_DESC "VFIO PCI - User Level meta-driver" >> >> +#define VFIO_MAX_PM_DEV 32 >> +struct vfio_pm_devs { >> + struct { >> + unsigned short vendor; >> + unsigned short device; >> + } ids[VFIO_MAX_PM_DEV]; >> + u32 count; >> +}; >> + >> static char ids[1024] __initdata; >> module_param_string(ids, ids, sizeof(ids), 0); >> MODULE_PARM_DESC(ids, "Initial PCI IDs to add to the vfio driver, format is \"vendor:device[:subvendor[:subdevice[:class[:class_mask]]]]\" and multiple comma separated entries can be specified"); >> @@ -64,6 +73,10 @@ static bool disable_denylist; >> module_param(disable_denylist, bool, 0444); >> MODULE_PARM_DESC(disable_denylist, "Disable use of device denylist. Disabling the denylist allows binding to devices with known errata that may lead to exploitable stability or security issues when accessed by untrusted users."); >> >> +static char pm_restore_ids[1024] __initdata; >> +module_param_string(pm_restore_ids, pm_restore_ids, sizeof(pm_restore_ids), 0); >> +MODULE_PARM_DESC(pm_restore_ids, "comma separated device in format of \"vendor:device\""); >> + >> static inline bool vfio_vga_disabled(void) >> { >> #ifdef CONFIG_VFIO_PCI_VGA >> @@ -260,10 +273,50 @@ static bool vfio_pci_nointx(struct pci_dev *pdev) >> return false; >> } >> >> +static struct vfio_pm_devs pm_devs = {0}; >> +static void __init vfio_pci_fill_pm_ids(void) >> +{ >> + char *p, *id; >> + int idx = 0; >> + >> + /* no ids passed actually */ >> + if (pm_restore_ids[0] == '\0') >> + return; >> + >> + /* add ids specified in the module parameter */ >> + p = pm_restore_ids; >> + while ((id = strsep(&p, ","))) { >> + unsigned int vendor, device = PCI_ANY_ID; >> + int fields; >> + >> + if (!strlen(id)) >> + continue; >> + >> + fields = sscanf(id, "%x:%x", &vendor, &device); >> + >> + if (fields != 2) { >> + pr_warn("invalid vendor:device string \"%s\"\n", id); >> + continue; >> + } >> + >> + if (idx < VFIO_MAX_PM_DEV) { >> + pm_devs.ids[idx].vendor = vendor; >> + pm_devs.ids[idx].device = device; >> + pm_devs.count++; >> + idx++; >> + pr_info("add [%04x:%04x] for needs_pm_restore\n", >> + vendor, device); >> + } else { >> + pr_warn("Exceed maximum %d, skip adding [%04x:%04x] for needs_pm_restore\n", >> + VFIO_MAX_PM_DEV, vendor, device); >> + } >> + } >> +} >> + >> static void vfio_pci_probe_power_state(struct vfio_pci_device *vdev) >> { >> struct pci_dev *pdev = vdev->pdev; >> - u16 pmcsr; >> + u16 pmcsr, idx; >> >> if (!pdev->pm_cap) >> return; >> @@ -271,6 +324,16 @@ static void vfio_pci_probe_power_state(struct vfio_pci_device *vdev) >> pci_read_config_word(pdev, pdev->pm_cap + PCI_PM_CTRL, &pmcsr); >> >> vdev->needs_pm_restore = !(pmcsr & PCI_PM_CTRL_NO_SOFT_RESET); >> + >> + for (idx = 0; idx < pm_devs.count; idx++) { >> + if (vdev->pdev->vendor == pm_devs.ids[idx].vendor && >> + vdev->pdev->device == pm_devs.ids[idx].device) { >> + vdev->needs_pm_restore = true; >> + pr_info("force [%04x:%04x] to needs_pm_restore\n", >> + vdev->pdev->vendor, vdev->pdev->device); >> + break; >> + } >> + } >> } >> >> /* >> @@ -2423,6 +2486,7 @@ static int __init vfio_pci_init(void) >> goto out_driver; >> >> vfio_pci_fill_ids(); >> + vfio_pci_fill_pm_ids(); >> >> if (disable_denylist) >> pr_warn("device denylist disabled.\n"); -- Best Regards, Colin Xu