Received: by 2002:a6b:500f:0:0:0:0:0 with SMTP id e15csp190135iob; Tue, 3 May 2022 14:47:33 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxniMySvUSHxwfdPFNqsURyOKHcBGlxerZO2i+ThAGp/3QYPeEziAGRUuzrH0SaC5BzKfwx X-Received: by 2002:a05:6402:11c9:b0:422:b76c:bef8 with SMTP id j9-20020a05640211c900b00422b76cbef8mr6477841edw.238.1651614453218; Tue, 03 May 2022 14:47:33 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1651614453; cv=none; d=google.com; s=arc-20160816; b=aTG0Oq9+Uvs9OvztbJo1UlGLaZVihTBpm9g2nLHuJ4QZxg8wqYQdMLglFZ893cswLK m2ZAveQf86mDhot/8IMG4E5BJksRhDN8KNDj1/nPvRVPwN8JTacW0JTFts48BcCheQkc aqN7NEkE285fNnQbRnvrosjn2Nr4IihFqj91ACROplntJjxZOsfwM7z3IuJuvTzHLilH tJ7ayzjs+0GkqQLhFX3wj0tEG4YN71D/kbewIvriujoV3JXbMLrVjVCiZrPxJpPsDxL0 6ZEvaE7Lq9Txm6sr/8d+PxCZjEbtxEL/OU58/IpYTBwfbKDFZj7MgQ7/DeIQqSMRUE51 SIKg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:content-language:subject:user-agent:mime-version :date:message-id; bh=50yfvyXmnfNqUQuK5yNtj2tvK9xtU0FhOQYq+iYc1xY=; b=AaEbHnM8+/bJOUlrJTNdMdLznxMl0B9P2bX99Kb6Z67oT+zyq0gm7YEY5Saim0V8nE xWYKWHsubsHa0vw9scfpUmska3eroAlToauURL5qhGWcM74/AQFq7m9lXJN4SwtJ5Pcq F4n5ICgmMP0e3kYBWTuVgzONcJUjNU3MDI9ZWFYuC4ZKbnmq78GU/4dkvh5yu+AF9vEo yIXqA4Of2UWOOgZOffP5RxUoyPoSqdagKmelETI5DZthSpzoBJeo5GNTDcmFbIB5JegJ Pbkr4lDWuLqkqzP1PsgcJTDs+9P0xPUCI0fgsnwjPUMLVr+d01wbNf0I0AYUjLjMEjrc sQRw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id h20-20020aa7c614000000b00425fd3917d3si14130700edq.483.2022.05.03.14.47.09; Tue, 03 May 2022 14:47:33 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231736AbiECNIV (ORCPT + 99 others); Tue, 3 May 2022 09:08:21 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:48752 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S235783AbiECNIT (ORCPT ); Tue, 3 May 2022 09:08:19 -0400 Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id CF5E924977 for ; Tue, 3 May 2022 06:04:44 -0700 (PDT) Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 95D8C1042; Tue, 3 May 2022 06:04:44 -0700 (PDT) Received: from [10.57.80.111] (unknown [10.57.80.111]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id CCE073F774; Tue, 3 May 2022 06:04:42 -0700 (PDT) Message-ID: <8b3d31ef-caf7-da92-fa95-0df378d5b091@arm.com> Date: Tue, 3 May 2022 14:04:37 +0100 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (Windows NT 10.0; rv:91.0) Gecko/20100101 Thunderbird/91.8.1 Subject: Re: [RESEND PATCH v8 00/11] Fix BUG_ON in vfio_iommu_group_notifier() Content-Language: en-GB To: Jason Gunthorpe , Qian Cai Cc: Lu Baolu , Joerg Roedel , Kevin Tian , Liu Yi L , iommu@lists.linux-foundation.org, linux-kernel@vger.kernel.org, Jean-Philippe Brucker References: <20220418005000.897664-1-baolu.lu@linux.intel.com> <20220502161204.GA22@qian> <20220502164216.GP8364@nvidia.com> From: Robin Murphy In-Reply-To: <20220502164216.GP8364@nvidia.com> Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit X-Spam-Status: No, score=-9.8 required=5.0 tests=BAYES_00,NICE_REPLY_A, RCVD_IN_DNSWL_HI,SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2022-05-02 17:42, Jason Gunthorpe wrote: > On Mon, May 02, 2022 at 12:12:04PM -0400, Qian Cai wrote: >> On Mon, Apr 18, 2022 at 08:49:49AM +0800, Lu Baolu wrote: >>> Hi Joerg, >>> >>> This is a resend version of v8 posted here: >>> https://lore.kernel.org/linux-iommu/20220308054421.847385-1-baolu.lu@linux.intel.com/ >>> as we discussed in this thread: >>> https://lore.kernel.org/linux-iommu/Yk%2Fq1BGN8pC5HVZp@8bytes.org/ >>> >>> All patches can be applied perfectly except this one: >>> - [PATCH v8 02/11] driver core: Add dma_cleanup callback in bus_type >>> It conflicts with below refactoring commit: >>> - 4b775aaf1ea99 "driver core: Refactor sysfs and drv/bus remove hooks" >>> The conflict has been fixed in this post. >>> >>> No functional changes in this series. I suppress cc-ing this series to >>> all v8 reviewers in order to avoid spam. >>> >>> Please consider it for your iommu tree. >> >> Reverting this series fixed an user-after-free while doing SR-IOV. >> >> BUG: KASAN: use-after-free in __lock_acquire >> Read of size 8 at addr ffff080279825d78 by task qemu-system-aar/22429 >> CPU: 24 PID: 22429 Comm: qemu-system-aar Not tainted 5.18.0-rc5-next-20220502 #69 >> Call trace: >> dump_backtrace >> show_stack >> dump_stack_lvl >> print_address_description.constprop.0 >> print_report >> kasan_report >> __asan_report_load8_noabort >> __lock_acquire >> lock_acquire.part.0 >> lock_acquire >> _raw_spin_lock_irqsave >> arm_smmu_detach_dev >> arm_smmu_detach_dev at drivers/iommu/arm/arm-smmu-v3/arm-smmu-v3.c:2377 >> arm_smmu_attach_dev > > Hum. > > So what has happened is that VFIO does this sequence: > > iommu_detach_group() > iommu_domain_free() > iommu_group_release_dma_owner() > > Which, I think should be valid, API wise. > > From what I can see reading the code SMMUv3 blows up above because it > doesn't have a detach_dev op: > > .default_domain_ops = &(const struct iommu_domain_ops) { > .attach_dev = arm_smmu_attach_dev, > .map_pages = arm_smmu_map_pages, > .unmap_pages = arm_smmu_unmap_pages, > .flush_iotlb_all = arm_smmu_flush_iotlb_all, > .iotlb_sync = arm_smmu_iotlb_sync, > .iova_to_phys = arm_smmu_iova_to_phys, > .enable_nesting = arm_smmu_enable_nesting, > .free = arm_smmu_domain_free, > } > > But it is internally tracking the domain inside the master - so when > the next domain is attached it does this: > > static void arm_smmu_detach_dev(struct arm_smmu_master *master) > { > struct arm_smmu_domain *smmu_domain = master->domain; > > spin_lock_irqsave(&smmu_domain->devices_lock, flags); > > And explodes as the domain has been freed but master->domain was not > NULL'd. > > It worked before because iommu_detach_group() used to attach the > default group and that was before the domain was freed in the above > sequence. Oof, I totally overlooked the significance of that little subtlety in review :( > I'm guessing SMMU3 needs to call it's arm_smmu_detach_dev(master) from > the detach_dev op and null it's cached copy of the domain, but I don't > know this driver.. Robin? The original intent was that .detach_dev is deprecated in favour of default domains, and when the latter are in use, a device is always attached *somewhere* once probed (i.e. group->domain is never NULL). At face value, the neatest fix IMO would probably be for SMMUv3's .domain_free to handle smmu_domain->devices being non-empty and detach them at that point. However that wouldn't be viable for virtio-iommu or anyone else keeping an internal one-way association of devices to their current domains. If we're giving up entirely on that notion of .detach_dev going away then all default-domain-supporting drivers probably want checking to make sure that path hasn't bitrotted; both Arm SMMU drivers had it proactively removed 6 years ago; virtio-iommu never had it at all; newer drivers like apple-dart have some code there, but it won't have ever run until now. We *could* stay true to the original paradigm by introducing some real usage of IOMMU_DOMAIN_BLOCKED, such that we could keep one or more of those around to actively attach to instead of having groups in this unattached limbo state, but that's a bigger job involving adding support to drivers as well; too much for a quick fix now... Robin.