Received: by 2002:a05:7412:b10a:b0:f3:1519:9f41 with SMTP id az10csp2354248rdb; Sun, 3 Dec 2023 13:17:32 -0800 (PST) X-Google-Smtp-Source: AGHT+IHVw5kBgU15NKgm0cIZgAm3wp0Qpsu7EkhCvQnPdf4IRq9DI+tQ4Vj5EneSG9FgIknpwaOX X-Received: by 2002:a5e:a709:0:b0:7b4:28f8:d9 with SMTP id b9-20020a5ea709000000b007b428f800d9mr4639328iod.34.1701638251943; Sun, 03 Dec 2023 13:17:31 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1701638251; cv=none; d=google.com; s=arc-20160816; b=LOBFjq8to/W0bvM7Dq3kiqMHdWiICiDE1IW2PYPD176t8+KoEOUt7zxBnaI2JXJnmA ew/WHN3Unx7qhc8i9x5bEJj5gdUc7N4Pw+yNtkLDe8ElDo/IBjAaKHqZ2cnZfuStTwWJ 8+b9m6lmMFdmtTS31YzWT58KgV4nHkOS1D1naeIE3LmYD5SbYrrmj3IzE9gKCwMTYPum 1teOTVOj9EZv9MTVvYOoo9977+lazGS2KGdWBW05aMrDpBOl19VGqeJrwFTo2InC+Xw3 JH1mrKZaztPjDiqnk2G9TcaNc6GU02D2Lu+84IErxfwtlyUfVhDOnJSY5kRYrwYpTVj9 FZSw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=YaXGORkgyD8V8K3kt/4/CeA1q7F0AqMm8u3Qa3gndrU=; fh=iFH+Dv/cTpugdM9huIrmYQk0qUY0zXejBDjC6IOAQKo=; b=JvVBPqMpZz65O3cwNRWrJE7XfOkUqjBWtn8ZzmfnOPSJZZu9uqJuoN2ANmOFgyvp7r V6xhwOxYdYe+looKfQcXQPWE6KhfV/bgz6Hd18yMoMfJDWHvmnSuXCmlZgLaVmcViZOg N+t9DY17beC5+3bcD6bsjSHkcgC3AsXlsasJWmEi5PiZU6ddZXtUnRArs28OxEEqeBku eD7GSrZlSQLDZT8YZmQuFgbaMj21DeCNkxjYm1d2TdhGfIvb++t7+L21OK60bGM1Hf17 opvdadc3N5deLzTkEgZlUpaAL04kgQHSVa4G020jHFNQ6d+iEwhvrjApH+F4nrIO7iy1 5RJQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@ziepe.ca header.s=google header.b=EPCIT76H; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:8 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from fry.vger.email (fry.vger.email. [2620:137:e000::3:8]) by mx.google.com with ESMTPS id k37-20020a63ff25000000b005ad8009e304si6846805pgi.784.2023.12.03.13.17.31 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 03 Dec 2023 13:17:31 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:8 as permitted sender) client-ip=2620:137:e000::3:8; Authentication-Results: mx.google.com; dkim=pass header.i=@ziepe.ca header.s=google header.b=EPCIT76H; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::3:8 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by fry.vger.email (Postfix) with ESMTP id 42F9B808A362; Sun, 3 Dec 2023 13:17:29 -0800 (PST) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.11 at fry.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229698AbjLCVD0 (ORCPT + 99 others); Sun, 3 Dec 2023 16:03:26 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:37688 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229450AbjLCVDZ (ORCPT ); Sun, 3 Dec 2023 16:03:25 -0500 Received: from mail-qv1-xf2e.google.com (mail-qv1-xf2e.google.com [IPv6:2607:f8b0:4864:20::f2e]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 108C7AA for ; Sun, 3 Dec 2023 13:03:31 -0800 (PST) Received: by mail-qv1-xf2e.google.com with SMTP id 6a1803df08f44-67abd1879c0so5337566d6.2 for ; Sun, 03 Dec 2023 13:03:31 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ziepe.ca; s=google; t=1701637410; x=1702242210; darn=vger.kernel.org; h=in-reply-to:content-disposition:mime-version:references:message-id :subject:cc:to:from:date:from:to:cc:subject:date:message-id:reply-to; bh=YaXGORkgyD8V8K3kt/4/CeA1q7F0AqMm8u3Qa3gndrU=; b=EPCIT76HtPBzRKP4TG88oBEItjAruT/TTDWWfxqJSSEkJEoxLIuU6pjKPlLcgcHGYD WFz1IaJNLWkca7++6Qqc5Jc1VGIDPjKpNzXGyA0vcz2yFjRQFTXs8p8NYck2NLxke3Qj nCwFHy5GZ4viyiQ9urM/vDn7N2gvKceph+Bnlc2h5pb7Gq+lXuXWANfWvKk3a+BIuMsD kfmqABSLvCtDUQy5WQYcJ/9CB3Rf2ETDH+U/kjlbwc+Jb3KB9flQZ216XRoEIQof9VsU +cKuwGPSx5jrO2BLwNNdcZLv/iKeJfa4tpILcfpWlXL8Ljmo8qc17ycCEPV+y2aLa++a /DxA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20230601; t=1701637410; x=1702242210; h=in-reply-to:content-disposition:mime-version:references:message-id :subject:cc:to:from:date:x-gm-message-state:from:to:cc:subject:date :message-id:reply-to; bh=YaXGORkgyD8V8K3kt/4/CeA1q7F0AqMm8u3Qa3gndrU=; b=Dy0P3us3DLokGq9v/ugvvWJMh4N2G3zVfGO6L3e9KJ/qwIOQt3df3FxX+nryumUZvC FKN25w9aF7miKjNsVewYLBoP9HIlZW1Q3YK1VEP1+3oxKfkUM6ghNLLa4fG7dgnC5yYv BC+bIQIScb44Dt4twm+aUbldM2+341Z9vd02fRfAbBrmlHNf+dC+kPW4UICLthwxOhxf skhBjeT8s/4IuJVfhctDGXT5S5dP9jLsHwoEe/X3JxdoB7mzsN1pGd1qMsUxJTnVH3Qy FAiYFyWVevjEQIf/cK1JoFc157ecRl/If1PGHQy+b4TvENS6inOZHy6zE4oUyQ2VU6Kv nSrg== X-Gm-Message-State: AOJu0YxBaKJ/PbxLzR2S/81DxQ1jQPHltbdqgJ0jj6IEvoTGpGoFWroD 8HWa14sA3D5a21miofN8SB3qfw== X-Received: by 2002:a0c:ec86:0:b0:67a:8d3c:22a8 with SMTP id u6-20020a0cec86000000b0067a8d3c22a8mr4170105qvo.65.1701637410157; Sun, 03 Dec 2023 13:03:30 -0800 (PST) Received: from ziepe.ca (hlfxns017vw-142-134-23-187.dhcp-dynamic.fibreop.ns.bellaliant.net. [142.134.23.187]) by smtp.gmail.com with ESMTPSA id k7-20020a0cabc7000000b0067a3991d002sm3712187qvb.30.2023.12.03.13.03.29 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Sun, 03 Dec 2023 13:03:29 -0800 (PST) Received: from jgg by wakko with local (Exim 4.95) (envelope-from ) id 1r9nEc-009I57-0J; Sun, 03 Dec 2023 10:14:14 -0400 Date: Sun, 3 Dec 2023 10:14:14 -0400 From: Jason Gunthorpe To: Baolu Lu Cc: Joerg Roedel , Will Deacon , Robin Murphy , Kevin Tian , Jean-Philippe Brucker , Nicolin Chen , Yi Liu , Jacob Pan , Yan Zhao , iommu@lists.linux.dev, kvm@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH v7 12/12] iommu: Improve iopf_queue_flush_dev() Message-ID: <20231203141414.GJ1489931@ziepe.ca> References: <20231115030226.16700-1-baolu.lu@linux.intel.com> <20231115030226.16700-13-baolu.lu@linux.intel.com> <20231201203536.GG1489931@ziepe.ca> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Spam-Status: No, score=0.2 required=5.0 tests=DATE_IN_PAST_06_12,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on fry.vger.email Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (fry.vger.email [0.0.0.0]); Sun, 03 Dec 2023 13:17:29 -0800 (PST) On Sun, Dec 03, 2023 at 04:53:08PM +0800, Baolu Lu wrote: > On 12/2/23 4:35 AM, Jason Gunthorpe wrote: > > On Wed, Nov 15, 2023 at 11:02:26AM +0800, Lu Baolu wrote: > > > The iopf_queue_flush_dev() is called by the iommu driver before releasing > > > a PASID. It ensures that all pending faults for this PASID have been > > > handled or cancelled, and won't hit the address space that reuses this > > > PASID. The driver must make sure that no new fault is added to the queue. > > This needs more explanation, why should anyone care? > > > > More importantly, why is*discarding* the right thing to do? > > Especially why would we discard a partial page request group? > > > > After we change a translation we may have PRI requests in a > > queue. They need to be acknowledged, not discarded. The DMA in the > > device should be restarted and the device should observe the new > > translation - if it is blocking then it should take a DMA error. > > > > More broadly, we should just let things run their normal course. The > > domain to deliver the fault to should be determined very early. If we > > get a fault and there is no fault domain currently assigned then just > > restart it. > > > > The main reason to fence would be to allow the domain to become freed > > as the faults should be holding pointers to it. But I feel there are > > simpler options for that then this.. > > In the iommu_detach_device_pasid() path, the domain is about to be > removed from the pasid of device. The IOMMU driver performs the > following steps sequentially: I know that is why it does, but it doesn't explain at all why. > 1. Clears the pasid translation entry. Thus, all subsequent DMA > transactions (translation requests, translated requests or page > requests) targeting the iommu domain will be blocked. > > 2. Waits until all pending page requests for the device's PASID have > been reported to upper layers via the iommu_report_device_fault(). > However, this does not guarantee that all page requests have been > responded. > > 3. Free all partial page requests for this pasid since the page request > response is only needed for a complete request group. There's no > action required for the page requests which are not last of a request > group. But we expect the last to come eventually since everything should be grouped properly, so why bother doing this? Indeed if 2 worked, how is this even possible to have partials? > 5. Follow the IOMMU hardware requirements (for example, VT-d sepc, > section 7.10, Software Steps to Drain Page Requests & Responses) to > drain in-flight page requests and page group responses between the > remapping hardware queues and the endpoint device. > > With above steps done in iommu_detach_device_pasid(), the pasid could be > re-used for any other address space. As I said, that isn't even required. There is no issue with leaking PRI's across attachments. > > I suppose the driver is expected to stop calling > > iommu_report_device_fault() before calling this function, but that > > doesn't seem like it is going to be possible. Drivers should be > > implementing atomic replace for the PASID updates and in that case > > there is no momement when it can say the HW will stop generating PRI. > > Atomic domain replacement for a PASID is not currently implemented in > the core or driver. It is, the driver should implement set_dev_pasid in such a way that repeated calls do replacements, ideally atomically. This is what ARM SMMUv3 does after my changes. > Even if atomic replacement were to be implemented, > it would be necessary to ensure that all translation requests, > translated requests, page requests and responses for the old domain are > drained before switching to the new domain. Again, no it isn't required. Requests simply have to continue to be acked, it doesn't matter if they are acked against the wrong domain because the device will simply re-issue them.. Jason