Received: by 2002:a05:6358:489b:b0:bb:da1:e618 with SMTP id x27csp90389rwn; Wed, 7 Sep 2022 13:00:31 -0700 (PDT) X-Google-Smtp-Source: AA6agR4TeCi1bc8O+kcdqDtHgJ7YSZQxoV5fOFreauaQbDq3H+5Y3oNyTRIHKZI4c2Rxn2GrHp4e X-Received: by 2002:a17:907:968f:b0:742:1244:b02 with SMTP id hd15-20020a170907968f00b0074212440b02mr3346094ejc.496.1662580831521; Wed, 07 Sep 2022 13:00:31 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1662580831; cv=none; d=google.com; s=arc-20160816; b=jLZAc65Gt2Al/vEKqBQFrBu9qekgDx15oV3CNn61w8rnfXFZNDfhclJRz8GMBGnCu1 mqknox2oXUX2YF+z2C1Ub8RtxofjFePz4DoiaRGLPLdSZ8+PZg5iZkllq5IiqS1WwV6z DLbqS5/qo9dYrS1mIfDjY7T85Kg7iWasSFpuyijI1wtxEyxvAuF9ITvCoEyWIJT07khN xLdRuafPyV9F8/UQ3eZr0z0zWAChG2vMSQnhGRfKMsQE0SiDTiI8lF+CTr8NsG5NUxDL z8VN3V/90KMidR0+jcV/tDKBILFIxsWNi8v4BQD4Xc1klrXBf/h2l+tuWQbWLTrSPOKh ULQQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:content-language:subject:user-agent:mime-version :date:message-id; bh=qhDNt61jTxxQE/554xYo4BBpmsNwQWk7dBvwih8Psa0=; b=KRLXOTvVe0a7ZfQLVhVvcsK6N5bOm/DFFO/7iSEm0eU3LhFXyIjaYY9bcoTRUGp3Uu Z/yjTNIy8GE0ppp9KCxDbSl6uWiOICvZyYz1U8VWpMmnZjgGIcoPv/WkI/3nCDqhvAyd ssqbyEY+4mfb97FTx6RseXakvjDzW9JFPRMW8sdz9rQzj7qAWdzBbQBYmxMLpjPJJgWO 9m9Bu4kpZnd3rcE3yHgpH+w+iLDrZC8z5DqdkxHyrHZMrX8CAiWjM/7HXpuj0Xs9FDjq SoORnis3Q8WmymEo2euRVyyiP/5fpkNtLDWFM85V73ur7vdqqQZCEi9Jt48MlzkIrATX mtvA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id bo4-20020a170906d04400b0073d56b67e4bsi188219ejb.801.2022.09.07.13.00.06; Wed, 07 Sep 2022 13:00:31 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229850AbiIGTli (ORCPT + 99 others); Wed, 7 Sep 2022 15:41:38 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:37038 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229523AbiIGTlg (ORCPT ); Wed, 7 Sep 2022 15:41:36 -0400 Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 26615639D; Wed, 7 Sep 2022 12:41:34 -0700 (PDT) Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 913CA106F; Wed, 7 Sep 2022 12:41:40 -0700 (PDT) Received: from [10.57.15.197] (unknown [10.57.15.197]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id A19FF3F7B4; Wed, 7 Sep 2022 12:41:20 -0700 (PDT) Message-ID: <0b466705-3a17-1bbc-7ef2-5adadc22d1ae@arm.com> Date: Wed, 7 Sep 2022 20:41:13 +0100 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Windows NT 10.0; rv:102.0) Gecko/20100101 Thunderbird/102.2.1 Subject: Re: [PATCH v6 1/5] iommu: Return -EMEDIUMTYPE for incompatible domain and device/group Content-Language: en-GB To: Jason Gunthorpe Cc: Joerg Roedel , Nicolin Chen , will@kernel.org, alex.williamson@redhat.com, suravee.suthikulpanit@amd.com, marcan@marcan.st, sven@svenpeter.dev, alyssa@rosenzweig.io, robdclark@gmail.com, dwmw2@infradead.org, baolu.lu@linux.intel.com, mjrosato@linux.ibm.com, gerald.schaefer@linux.ibm.com, orsonzhai@gmail.com, baolin.wang@linux.alibaba.com, zhang.lyra@gmail.com, thierry.reding@gmail.com, vdumpa@nvidia.com, jonathanh@nvidia.com, jean-philippe@linaro.org, cohuck@redhat.com, tglx@linutronix.de, shameerali.kolothum.thodi@huawei.com, thunder.leizhen@huawei.com, christophe.jaillet@wanadoo.fr, yangyingliang@huawei.com, jon@solid-run.com, iommu@lists.linux.dev, linux-kernel@vger.kernel.org, asahi@lists.linux.dev, linux-arm-kernel@lists.infradead.org, linux-arm-msm@vger.kernel.org, linux-s390@vger.kernel.org, linux-tegra@vger.kernel.org, virtualization@lists.linux-foundation.org, kvm@vger.kernel.org, kevin.tian@intel.com References: <20220815181437.28127-1-nicolinc@nvidia.com> <20220815181437.28127-2-nicolinc@nvidia.com> <9f91f187-2767-13f9-68a2-a5458b888f00@arm.com> From: Robin Murphy In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-11.1 required=5.0 tests=BAYES_00,NICE_REPLY_A, RCVD_IN_DNSWL_HI,SPF_HELO_NONE,SPF_NONE,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2022-09-07 18:00, Jason Gunthorpe wrote: > On Wed, Sep 07, 2022 at 03:23:09PM +0100, Robin Murphy wrote: >> On 2022-09-07 14:47, Jason Gunthorpe wrote: >>> On Wed, Sep 07, 2022 at 02:41:54PM +0200, Joerg Roedel wrote: >>>> On Mon, Aug 15, 2022 at 11:14:33AM -0700, Nicolin Chen wrote: >>>>> Provide a dedicated errno from the IOMMU driver during attach that the >>>>> reason attached failed is because of domain incompatability. EMEDIUMTYPE >>>>> is chosen because it is never used within the iommu subsystem today and >>>>> evokes a sense that the 'medium' aka the domain is incompatible. >>>> >>>> I am not a fan of re-using EMEDIUMTYPE or any other special value. What >>>> is needed here in EINVAL, but with a way to tell the caller which of the >>>> function parameters is actually invalid. >>> >>> Using errnos to indicate the nature of failure is a well established >>> unix practice, it is why we have hundreds of error codes and don't >>> just return -EINVAL for everything. >>> >>> What don't you like about it? >>> >>> Would you be happier if we wrote it like >>> >>> #define IOMMU_EINCOMPATIBLE_DEVICE xx >>> >>> Which tells "which of the function parameters is actually invalid" ? >> >> FWIW, we're now very close to being able to validate dev->iommu against >> where the domain came from in core code, and so short-circuit ->attach_dev >> entirely if they don't match. > > I don't think this is a long term direction. We have systems now with > a number of SMMU blocks and we really are going to see a need that > they share the iommu_domains so we don't have unncessary overheads > from duplicated io page table memory. > > So ultimately I'd expect to pass the iommu_domain to the driver and > the driver will decide if the page table memory it represents is > compatible or not. Restricting to only the same iommu instance isn't > good.. Who said IOMMU instance? As a reminder, the patch I currently have[1] is matching the driver (via the device ops), which happens to be entirely compatible with drivers supporting cross-instance domains. Mostly because we already have drivers that support cross-instance domains and callers that use them. >> At that point -EINVAL at the driver callback level could be assumed >> to refer to the domain argument, while anything else could be taken >> as something going unexpectedly wrong when the attach may otherwise >> have worked. I've forgotten if we actually had a valid case anywhere >> for "this is my device but even if you retry with a different domain >> it's still never going to work", but I think we wouldn't actually >> need that anyway - it should be clear enough to a caller that if >> attaching to an existing domain fails, then allocating a fresh >> domain and attaching also fails, that's the point to give up. > > The point was to have clear error handling, we either have permenent > errors or 'this domain will never work with this device error'. > > If we treat all error as temporary and just retry randomly it can > create a mess. For instance we might fail to attach to a perfectly > compatible domain due to ENOMEM or something and then go on to > successfully a create a new 2nd domain, just due to races. > > We can certainly code the try everything then allocate scheme, it is > just much more fragile than having definitive error codes. Again, not what I was suggesting. In fact the nature of iommu_attach_group() already rules out bogus devices getting this far, so all a driver currently has to worry about is compatibility of a device that it definitely probed with a domain that it definitely allocated. Therefore, from a caller's point of view, if attaching to an existing domain returns -EINVAL, try another domain; multiple different existing domains can be tried, and may also return -EINVAL for the same or different reasons; the final attempt is to allocate a fresh domain and attach to that, which should always be nominally valid and *never* return -EINVAL. If any attempt returns any other error, bail out down the usual "this should have worked but something went wrong" path. Even if any driver did have a nonsensical "nothing went wrong, I just can't attach my device to any of my domains" case, I don't think it would really need distinguishing from any other general error anyway. Once multiple drivers are in play, the only addition is that the "gatekeeper" check inside iommu_attach_group() may also return -EINVAL if the device is managed by a different driver, since that still fits the same "try again with a different domain" message to the caller. It's actually quite neat - basically the exact same thing we've tried to do with -EMEDIUMTYPE here, but more self-explanatory, since the fact is that a domain itself should never be invalid for attaching to via its own ops, and a group should never be inherently invalid for attaching to a suitable domain, it is only ever a particular combination of group (or device at the internal level) and domain that may not be valid together. Thus as long as we can maintain that basic guarantee that attaching a group to a newly allocated domain can only ever fail for resource allocation reasons and not some spurious "incompatibility", then we don't need any obscure trickery, and a single, clear, error code is in fact enough to say all that needs to be said. Whether iommu_attach_device() should also join the party and start rejecting non-singleton-group devices with a different error, or maintain its current behaviour since its legacy users already have their expectations set, is another matter in its own right. Cheers, Robin. [1] https://gitlab.arm.com/linux-arm/linux-rm/-/commit/683cdff1b2d4ae11f56e38d93b37e66e8c939fc9