Received: by 2002:a25:86ce:0:0:0:0:0 with SMTP id y14csp1418064ybm; Thu, 23 May 2019 00:16:12 -0700 (PDT) X-Google-Smtp-Source: APXvYqxC9vBjNtB3ytPjPiEsZeXz227I98kf5BZBzt8t5h7goKWsyQaMhVUpcS011gQ6yDvFq1FD X-Received: by 2002:a62:e303:: with SMTP id g3mr100846322pfh.220.1558595772730; Thu, 23 May 2019 00:16:12 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1558595772; cv=none; d=google.com; s=arc-20160816; b=x4ULYHXzTeLoqybudWtZ4GicZvWcvyle7KzjSTWHLHAVzn1nOHRj4RsJZxZqKF2i4+ b/KSillByhxvsy6Ff+xd0i2yqFU0ZfxLsm5RwyIBJjZwvPu3GHC5N8oMnOoCNLLTcBMd KGmgyjx1xsl/6UQ2QSl75e2Q1hVDitz4HK7+jzJx6eGRffBWR6SOzszubwa0WHp4FVUn y43vdHvXO+2sYxrGaMtpp5PkpjPOV3y54PotR6pC4kvcGIKdCuWHXPo5UedYKGRm98gL ftBIR77ua1yW8oKPUZhIfoEErSRAFWM0Q04AI6R4B5J4HgSE1qxC+U20iNUcL4I19upA 8Onw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=ZdKoQ0zzhBrt5n/Svv+5i2zQBIZ8iYDOfET4rVXvIZE=; b=kAnBu0hE9ZSgJOlfz5z275m/xrsy3kWVVNQKQQPkgVNxN5u4l9ImMpxcHfs9o6rEcv Tod1ocsPk9Y8n7aY5ZbIUsh8NnEH/RJioR3k2QUGrjBnMZo9R1lp1lQyR6fwJBzRsrAT rRriTPUShUD8enccyYakARoJsajymfZTm4xO1B9zyB0oqbai3j3E1dBPqCMbR1WdMss4 3ZflEOg8QzvZJPcJ9SJTHXk5Sr+pZei2ICsxifgKDWc2+xLK5lhB4X8rugg4PN6YJmue x5m0PCnjTcffTDxiRawkTHVmVP9ISdrjMLXR695GBYpHvrFjmo8VUgJJ6K9tHG3lfEDy 6iuA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id i8si11636266pgs.220.2019.05.23.00.15.56; Thu, 23 May 2019 00:16:12 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1727466AbfEWHOT (ORCPT + 99 others); Thu, 23 May 2019 03:14:19 -0400 Received: from mx1.redhat.com ([209.132.183.28]:35630 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726070AbfEWHOT (ORCPT ); Thu, 23 May 2019 03:14:19 -0400 Received: from smtp.corp.redhat.com (int-mx08.intmail.prod.int.phx2.redhat.com [10.5.11.23]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id E021F308FBB4; Thu, 23 May 2019 07:14:17 +0000 (UTC) Received: from [10.36.116.67] (ovpn-116-67.ams2.redhat.com [10.36.116.67]) by smtp.corp.redhat.com (Postfix) with ESMTPS id CE77419C4F; Thu, 23 May 2019 07:14:09 +0000 (UTC) Subject: Re: [PATCH v3 04/16] ioasid: Add custom IOASID allocator To: Jacob Pan Cc: iommu@lists.linux-foundation.org, LKML , Joerg Roedel , David Woodhouse , Alex Williamson , Jean-Philippe Brucker , Yi Liu , "Tian, Kevin" , Raj Ashok , Christoph Hellwig , Lu Baolu , Andriy Shevchenko References: <1556922737-76313-1-git-send-email-jacob.jun.pan@linux.intel.com> <1556922737-76313-5-git-send-email-jacob.jun.pan@linux.intel.com> <20190522124255.117de6f6@jacob-builder> From: Auger Eric Message-ID: Date: Thu, 23 May 2019 09:14:07 +0200 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:60.0) Gecko/20100101 Thunderbird/60.4.0 MIME-Version: 1.0 In-Reply-To: <20190522124255.117de6f6@jacob-builder> Content-Type: text/plain; charset=utf-8 Content-Language: en-US Content-Transfer-Encoding: 7bit X-Scanned-By: MIMEDefang 2.84 on 10.5.11.23 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.43]); Thu, 23 May 2019 07:14:18 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Jacob, On 5/22/19 9:42 PM, Jacob Pan wrote: > On Tue, 21 May 2019 11:55:55 +0200 > Auger Eric wrote: > >> Hi Jacob, >> >> On 5/4/19 12:32 AM, Jacob Pan wrote: >>> Sometimes, IOASID allocation must be handled by platform specific >>> code. The use cases are guest vIOMMU and pvIOMMU where IOASIDs need >>> to be allocated by the host via enlightened or paravirt interfaces. >>> >>> This patch adds an extension to the IOASID allocator APIs such that >>> platform drivers can register a custom allocator, possibly at boot >>> time, to take over the allocation. Xarray is still used for tracking >>> and searching purposes internal to the IOASID code. Private data of >>> an IOASID can also be set after the allocation. >>> >>> There can be multiple custom allocators registered but only one is >>> used at a time. In case of hot removal of devices that provides the >>> allocator, all IOASIDs must be freed prior to unregistering the >>> allocator. Default XArray based allocator cannot be mixed with >>> custom allocators, i.e. custom allocators will not be used if there >>> are outstanding IOASIDs allocated by the default XA allocator. >>> >>> Signed-off-by: Jacob Pan >>> --- >>> drivers/iommu/ioasid.c | 125 >>> +++++++++++++++++++++++++++++++++++++++++++++++++ 1 file changed, >>> 125 insertions(+) >>> >>> diff --git a/drivers/iommu/ioasid.c b/drivers/iommu/ioasid.c >>> index 99f5e0a..ed2915a 100644 >>> --- a/drivers/iommu/ioasid.c >>> +++ b/drivers/iommu/ioasid.c >>> @@ -17,6 +17,100 @@ struct ioasid_data { >>> }; >>> >>> static DEFINE_XARRAY_ALLOC(ioasid_xa); >>> +static DEFINE_MUTEX(ioasid_allocator_lock); >>> +static struct ioasid_allocator *active_custom_allocator; >>> + >>> +static LIST_HEAD(custom_allocators); >>> +/* >>> + * A flag to track if ioasid default allocator is in use, this will >>> + * prevent custom allocator from being used. The reason is that >>> custom allocator >>> + * must have unadulterated space to track private data with >>> xarray, there cannot >>> + * be a mix been default and custom allocated IOASIDs. >>> + */ >>> +static int default_allocator_active; >>> + >>> +/** >>> + * ioasid_register_allocator - register a custom allocator >>> + * @allocator: the custom allocator to be registered >>> + * >>> + * Custom allocators take precedence over the default xarray based >>> allocator. >>> + * Private data associated with the ASID are managed by ASID >>> common code >>> + * similar to data stored in xa. >>> + * >>> + * There can be multiple allocators registered but only one is >>> active. In case >>> + * of runtime removal of a custom allocator, the next one is >>> activated based >>> + * on the registration ordering. >>> + */ >>> +int ioasid_register_allocator(struct ioasid_allocator *allocator) >>> +{ >>> + struct ioasid_allocator *pallocator; >>> + int ret = 0; >>> + >>> + if (!allocator) >>> + return -EINVAL; >> is it really necessary? Sin't it the caller responsibility? > makes sense. will remove this one and below. >>> + >>> + mutex_lock(&ioasid_allocator_lock); >>> + /* >>> + * No particular preference since all custom allocators >>> end up calling >>> + * the host to allocate IOASIDs. We activate the first one >>> and keep >>> + * the later registered allocators in a list in case the >>> first one gets >>> + * removed due to hotplug. >>> + */ >>> + if (list_empty(&custom_allocators)) >>> + active_custom_allocator = allocator;> + >>> else { >>> + /* Check if the allocator is already registered */ >>> + list_for_each_entry(pallocator, >>> &custom_allocators, list) { >>> + if (pallocator == allocator) { >>> + pr_err("IOASID allocator already >>> registered\n"); >>> + ret = -EEXIST; >>> + goto out_unlock; >>> + } >>> + } >>> + } >>> + list_add_tail(&allocator->list, &custom_allocators); >>> + >>> +out_unlock: >>> + mutex_unlock(&ioasid_allocator_lock); >>> + return ret; >>> +} >>> +EXPORT_SYMBOL_GPL(ioasid_register_allocator); >>> + >>> +/** >>> + * ioasid_unregister_allocator - Remove a custom IOASID allocator >>> + * @allocator: the custom allocator to be removed >>> + * >>> + * Remove an allocator from the list, activate the next allocator >>> in >>> + * the order it was registered. >>> + */ >>> +void ioasid_unregister_allocator(struct ioasid_allocator >>> *allocator) +{ >>> + if (!allocator) >>> + return; >> is it really necessary? >>> + >>> + if (list_empty(&custom_allocators)) { >>> + pr_warn("No custom IOASID allocators active!\n"); >>> + return; >>> + } >>> + >>> + mutex_lock(&ioasid_allocator_lock); >>> + list_del(&allocator->list); >>> + if (list_empty(&custom_allocators)) { >>> + pr_info("No custom IOASID allocators\n")> >>> + /* >>> + * All IOASIDs should have been freed before the >>> last custom >>> + * allocator is unregistered. Unless default >>> allocator is in >>> + * use. >>> + */ >>> + BUG_ON(!xa_empty(&ioasid_xa) >>> && !default_allocator_active); >>> + active_custom_allocator = NULL; >>> + } else if (allocator == active_custom_allocator) { >> In case you are removing the active custom allocator don't you also >> need to check that all ioasids were freed. Otherwise you are likely >> to switch to a different allocator whereas the asid space is >> partially populated. > The assumption is that all custom allocators on the same guest will end > up calling the same host allocator. Having multiple custom allocators in > the list is just a way to support multiple (p)vIOMMUs with hotplug. > Therefore, we cannot nor need to free all PASIDs when one custom > allocator goes away. This is a different situation then switching > between default allocator and custom allocator, where custom allocator > has to start with a clean space. Although I understand your specific usecase, this framework may have other users, where custom allocators behave differently. Also the commit msg says:" In case of hot removal of devices that provides the allocator, all IOASIDs must be freed prior to unregistering the allocator." Thanks Eric > > >>> + active_custom_allocator = >>> list_entry(&custom_allocators, struct ioasid_allocator, list); >>> + pr_info("IOASID allocator changed"); >>> + } >>> + mutex_unlock(&ioasid_allocator_lock); >>> +} >>> +EXPORT_SYMBOL_GPL(ioasid_unregister_allocator); >>> >>> /** >>> * ioasid_set_data - Set private data for an allocated ioasid >>> @@ -68,6 +162,29 @@ ioasid_t ioasid_alloc(struct ioasid_set *set, >>> ioasid_t min, ioasid_t max, data->set = set; >>> data->private = private; >>> >>> + mutex_lock(&ioasid_allocator_lock); >>> + /* >>> + * Use custom allocator if available, otherwise use >>> default. >>> + * However, if there are active IOASIDs already been >>> allocated by default >>> + * allocator, custom allocator cannot be used. >>> + */ >>> + if (!default_allocator_active && active_custom_allocator) { >>> + id = active_custom_allocator->alloc(min, max, >>> active_custom_allocator->pdata); >>> + if (id == INVALID_IOASID) { >>> + pr_err("Failed ASID allocation by custom >>> allocator\n"); >>> + mutex_unlock(&ioasid_allocator_lock); >>> + goto exit_free; >>> + } >>> + /* >>> + * Use XA to manage private data also sanitiy >>> check custom >>> + * allocator for duplicates. >>> + */ >>> + min = id; >>> + max = id + 1; >>> + } else >>> + default_allocator_active = 1; >> nit: true? > yes, i can turn default_allocator_active into a bool type. > >>> + mutex_unlock(&ioasid_allocator_lock); >>> + >>> if (xa_alloc(&ioasid_xa, &id, data, XA_LIMIT(min, max), >>> GFP_KERNEL)) { pr_err("Failed to alloc ioasid from %d to %d\n", >>> min, max); goto exit_free;> @@ -91,9 +208,17 @@ void >>> ioasid_free(ioasid_t ioasid) { >>> struct ioasid_data *ioasid_data; >>> >>> + mutex_lock(&ioasid_allocator_lock); >>> + if (active_custom_allocator) >>> + active_custom_allocator->free(ioasid, >>> active_custom_allocator->pdata); >>> + mutex_unlock(&ioasid_allocator_lock); >>> + >>> ioasid_data = xa_erase(&ioasid_xa, ioasid); >>> >>> kfree_rcu(ioasid_data, rcu); >>> + >>> + if (xa_empty(&ioasid_xa)) >>> + default_allocator_active = 0; >> Isn't it racy? what if an xa_alloc occurs inbetween? >> >> > yes, i will move it under the mutex. Thanks. >>> } >>> EXPORT_SYMBOL_GPL(ioasid_free); >>> >>> >> >> Thanks >> >> Eric > > [Jacob Pan] >