Received: by 2002:a05:6358:a55:b0:ec:fcf4:3ecf with SMTP id 21csp6617594rwb; Wed, 18 Jan 2023 07:23:13 -0800 (PST) X-Google-Smtp-Source: AMrXdXu588DHek2YTbUo9bqptui3yIOczf8iSQ9xc6sB7rAGyNMwHYaqofGHimi+1R+GoiVNsW5f X-Received: by 2002:a17:90a:984:b0:226:f10a:c0de with SMTP id 4-20020a17090a098400b00226f10ac0demr7890089pjo.19.1674055393164; Wed, 18 Jan 2023 07:23:13 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1674055393; cv=none; d=google.com; s=arc-20160816; b=LTjlA4VN4LOtI3a2cmvCaB/BupMatrJSg4IsUuW9J3q5L4Jgdm/+JOWcPZEp8Mwsp7 bEb/cLBfUL+Tt5HC7uT0thp5E0o2orVZsQmF3qfAP5Ax6gT8A113qqRmnQO5JNmtxAnM qAnw5FuLuuN9EbBor2BftQEK2e6M1xUOeFRxRIvFsAsL8ipJdZwQta1RSa2+N0GnIWF7 j19OB1EgtCL6IJUdMAxAMSY/lnQr93pn8nNq+YjQKfhz4QtNV0eOFKak04WeacefAxCM DMQJH4OqIfciJjP68OtEIbCfBvXzQ/2AeXxtsSnihWffpTM8ur4JPpkRQOcoyyKSN8KB ixnA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:mime-version:content-transfer-encoding :in-reply-to:from:references:cc:to:content-language:subject :user-agent:date:message-id:dkim-signature; bh=/asgtMy/qhWFY2TxHCd85ee3a9Lem+Noi/OHiwyhihU=; b=CgLVpLwFVKhhfvEXfqJ0rUG6ijzi+5HvtkM2szZ4NxZ0qoS8elw8W+vP+1fpSo2w6k DEK8VwVAvxQaXrPWFMdN3diJ43iVqDTgLbYALubIRtA1unYtmOjxy4Q+BetpA+UZXi4i dp+Ynce4wnLeh1/12nnjds0OXq9zuEBwU9UXWb1xM4u7+YIgAEFW5+/Or0nMAiF2WOFw H5kZIqO7kGGo3BSAKRI14D/hY4dJhTVaFYq6OTgpf5NhLEFVZcBKKKHdBATpaNED5PpB R6colY0KK4q8jFVY34uW54uHCIuzqK1YQ7c7/wN6l7rVSb1NhIO6tpIqSqlntoyDFwcv kT6g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@ibm.com header.s=pp1 header.b=OdH3NOyk; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=NONE dis=NONE) header.from=ibm.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id mw5-20020a17090b4d0500b00225a7fb0dbdsi2472774pjb.111.2023.01.18.07.23.07; Wed, 18 Jan 2023 07:23:13 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@ibm.com header.s=pp1 header.b=OdH3NOyk; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=NONE dis=NONE) header.from=ibm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231620AbjARPA7 (ORCPT + 45 others); Wed, 18 Jan 2023 10:00:59 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:60874 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231864AbjARPAb (ORCPT ); Wed, 18 Jan 2023 10:00:31 -0500 Received: from mx0a-001b2d01.pphosted.com (mx0b-001b2d01.pphosted.com [148.163.158.5]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 1E81B604AB; Wed, 18 Jan 2023 06:55:50 -0800 (PST) Received: from pps.filterd (m0098416.ppops.net [127.0.0.1]) by mx0b-001b2d01.pphosted.com (8.17.1.19/8.17.1.19) with ESMTP id 30IDKcQY027120; Wed, 18 Jan 2023 14:55:44 GMT DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ibm.com; h=message-id : date : subject : to : cc : references : from : in-reply-to : content-type : content-transfer-encoding : mime-version; s=pp1; bh=/asgtMy/qhWFY2TxHCd85ee3a9Lem+Noi/OHiwyhihU=; b=OdH3NOykV8G0o/SZKaiUxLOSFcaNT5x//YXBz32ZNiOA3blUq1AgBdH6vwSEs8zHW+BD 4kk1YfzfS4M+LF0aXhEf4t1xY00kdiWBgVPMB/elluMSnc7VsC/oXLwiYyROgzBlJpb7 QgcHERvG/utRVZhIJ1/9hEUAiGlDUyg02XEZF6WA/9jAw+UXBz5c1MknacGlBoxBF8BP +gwUYCgPwFFfdCVlyiARmTeGFgcnVhYdCd9eQxc6kjUUJOWGHZc/qYADvQ6rHYrG0wFJ dAkycZVl1Sgb3rQ92R5dRDwZB61VQNZoyuN6hxfUHwuPXeR7s/ZbLWtD43opjt/7/bvf qQ== Received: from pps.reinject (localhost [127.0.0.1]) by mx0b-001b2d01.pphosted.com (PPS) with ESMTPS id 3n6f91wtvx-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Wed, 18 Jan 2023 14:55:44 +0000 Received: from m0098416.ppops.net (m0098416.ppops.net [127.0.0.1]) by pps.reinject (8.17.1.5/8.17.1.5) with ESMTP id 30IE0Tbh022576; Wed, 18 Jan 2023 14:55:43 GMT Received: from ppma02dal.us.ibm.com (a.bd.3ea9.ip4.static.sl-reverse.com [169.62.189.10]) by mx0b-001b2d01.pphosted.com (PPS) with ESMTPS id 3n6f91wtvp-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Wed, 18 Jan 2023 14:55:43 +0000 Received: from pps.filterd (ppma02dal.us.ibm.com [127.0.0.1]) by ppma02dal.us.ibm.com (8.17.1.19/8.17.1.19) with ESMTP id 30ICc6Ua005708; Wed, 18 Jan 2023 14:55:42 GMT Received: from smtprelay07.dal12v.mail.ibm.com ([9.208.130.99]) by ppma02dal.us.ibm.com (PPS) with ESMTPS id 3n3m17nrtm-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Wed, 18 Jan 2023 14:55:42 +0000 Received: from smtpav03.dal12v.mail.ibm.com (smtpav03.dal12v.mail.ibm.com [10.241.53.102]) by smtprelay07.dal12v.mail.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id 30IEtfje29622842 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Wed, 18 Jan 2023 14:55:41 GMT Received: from smtpav03.dal12v.mail.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 2463D58056; Wed, 18 Jan 2023 14:55:41 +0000 (GMT) Received: from smtpav03.dal12v.mail.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 9D2485803F; Wed, 18 Jan 2023 14:55:39 +0000 (GMT) Received: from [9.60.89.243] (unknown [9.60.89.243]) by smtpav03.dal12v.mail.ibm.com (Postfix) with ESMTP; Wed, 18 Jan 2023 14:55:39 +0000 (GMT) Message-ID: Date: Wed, 18 Jan 2023 09:55:39 -0500 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Thunderbird/102.6.0 Subject: Re: [PATCH v4] vfio: fix potential deadlock on vfio group lock Content-Language: en-US To: "Tian, Kevin" , Alex Williamson Cc: "akrowiak@linux.ibm.com" , "jjherne@linux.ibm.com" , "farman@linux.ibm.com" , "imbrenda@linux.ibm.com" , "frankja@linux.ibm.com" , "pmorel@linux.ibm.com" , "david@redhat.com" , "Christopherson, , Sean" , "intel-gfx@lists.freedesktop.org" , "cohuck@redhat.com" , "linux-kernel@vger.kernel.org" , "zhenyuw@linux.intel.com" , "pasic@linux.ibm.com" , "jgg@nvidia.com" , "kvm@vger.kernel.org" , "pbonzini@redhat.com" , "linux-s390@vger.kernel.org" , "borntraeger@linux.ibm.com" , "Liu, Yi L" , "intel-gvt-dev@lists.freedesktop.org" , "Wang, Zhi A" References: <20230114000351.115444-1-mjrosato@linux.ibm.com> <20230117142252.70cc85c7.alex.williamson@redhat.com> From: Matthew Rosato In-Reply-To: Content-Type: text/plain; charset=UTF-8 X-TM-AS-GCONF: 00 X-Proofpoint-GUID: TfmjtY4Qpflu2Wdc05rgEibVTFbw_JpY X-Proofpoint-ORIG-GUID: 1ul9rd7a7oGMvAucLJJS3iFU0CRQHuJl Content-Transfer-Encoding: 7bit X-Proofpoint-UnRewURL: 0 URL was un-rewritten MIME-Version: 1.0 X-Proofpoint-Virus-Version: vendor=baseguard engine=ICAP:2.0.219,Aquarius:18.0.923,Hydra:6.0.562,FMLib:17.11.122.1 definitions=2023-01-18_05,2023-01-18_01,2022-06-22_01 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 suspectscore=0 phishscore=0 adultscore=0 clxscore=1015 impostorscore=0 malwarescore=0 mlxlogscore=999 bulkscore=0 spamscore=0 priorityscore=1501 lowpriorityscore=0 mlxscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2212070000 definitions=main-2301180124 X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_EF,NICE_REPLY_A,RCVD_IN_MSPIKE_H2,SPF_HELO_NONE, SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 1/18/23 4:03 AM, Tian, Kevin wrote: >> From: Alex Williamson >> Sent: Wednesday, January 18, 2023 5:23 AM >> >> On Fri, 13 Jan 2023 19:03:51 -0500 >> Matthew Rosato wrote: >> >>> void vfio_device_group_close(struct vfio_device *device) >>> { >>> + void (*put_kvm)(struct kvm *kvm); >>> + struct kvm *kvm; >>> + >>> mutex_lock(&device->group->group_lock); >>> + kvm = device->kvm; >>> + put_kvm = device->put_kvm; >>> vfio_device_close(device, device->group->iommufd); >>> + if (kvm == device->kvm) >>> + kvm = NULL; >> >> Hmm, so we're using whether the device->kvm pointer gets cleared in >> last_close to detect whether we should put the kvm reference. That's a >> bit obscure. Our get and put is also asymmetric. >> >> Did we decide that we couldn't do this via a schedule_work() from the >> last_close function, ie. implementing our own version of an async put? >> It seems like that potentially has a cleaner implementation, symmetric >> call points, handling all the storing and clearing of kvm related >> pointers within the get/put wrappers, passing only a vfio_device to the >> put wrapper, using the "vfio_device_" prefix for both. Potentially >> we'd just want an unconditional flush outside of lock here for >> deterministic release. >> >> What's the downside? Thanks, >> > > btw I guess this can be also fixed by Yi's work here: > > https://lore.kernel.org/kvm/20230117134942.101112-6-yi.l.liu@intel.com/ > > with set_kvm(NULL) moved to the release callback of kvm_vfio device, > such circular lock dependency can be avoided too. Oh, interesting... It seems to me that this would eliminate the reported call chain altogether: kvm_put_kvm -> kvm_destroy_vm -> kvm_destroy_devices -> kvm_vfio_destroy (starting here -- this would no longer be executed) -> kvm_vfio_file_set_kvm -> vfio_file_set_kvm -> group->group_lock/group_rwsem because kvm_destroy_devices now can't end up calling kvm_vfio_destroy and friends, it won't try and acquire the group lock a 2nd time making a kvm_put_kvm while the group lock is held OK to do. The vfio_file_set_kvm call will now always come from a separate thread of execution, kvm_vfio_group_add, kvm_vfio_group_del or the release thread: kvm_device_release (where the group->group_lock would not be held since vfio does not trigger closing of the kvm fd) -> kvm_vfio_destroy (or, kvm_vfio_release) -> kvm_vfio_file_set_kvm -> vfio_file_set_kvm -> group->group_lock/group_rwsem