Received: by 2002:a05:6a10:16a7:0:0:0:0 with SMTP id gp39csp1308852pxb; Fri, 13 Nov 2020 09:17:44 -0800 (PST) X-Google-Smtp-Source: ABdhPJzSPiExihIzwWeQ3TcIc5+UpzLAEYaUGTypWZ45jRYJRuOo1sDIbZ1ULg3x+We3qed4FiyT X-Received: by 2002:a17:906:b255:: with SMTP id ce21mr3046468ejb.137.1605287864391; Fri, 13 Nov 2020 09:17:44 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1605287864; cv=none; d=google.com; s=arc-20160816; b=ZVx/qUopdfTtTpojMh2xBIvcOF6uedN+BBU9Y9Ru60YpkqutTPhhPBTL3LYnhLHRYt hGyJbhe0yvwU7MJbxhosCP3kGFuRaMLJQg54eo+DnGCriWr/YKPVziwCqM61IpKmyGHy XCeAxKEgXHaMBtmEzCaBsI65e6NbIqYi6Rnk5pWimI8l2h+KbJj5Ravgw8bP8xkT3QQX 6PkjmccpzRk96ttWDnzbg+tTPHnrRHM+aGuiEW0E34iHKr8wA3yJ/mWvhrDtns86k9/p 7ZQqE5LKE0omU0gpKmrdQM0smWXGp2lLKXVuFugYcyxO91tJTkx8WVgWdGDib0WBRroc 36Og== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-language:content-transfer-encoding :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject:dkim-signature; bh=5qGvsEYdJTEe7tzuPXfJCNS7NxvtM6VTCsMNAwtkKUE=; b=YDOttvkB6fmSpuPjCvwF8ZA80NUovhaeAxT1Os6O+zVtIGIrH34bgPhUCknRoM3sHv i3DI6JygKzPgyAo7TK4/Sh4NdJw73JsUlZi+8+jHJIuKwc+D4+BO8Az2zqquvM2slPO3 elF8KG9AeaEjHZGpKNs1PGQZcyhVtMSZCs0Ubp6qc1Yzd5T6UEanR4Q3YWsyvu31cMDa U2ai/5IBBZRNn4gaWQFiVQjxyLrmc5riou8wrt9TayIQ37EIoAGr/9ZIOHg2L4qmh8tB Jrituvwscm51bPr3gkSah7vdJxoSe/FTjXEYlT/6MwKN6lrm5pD6LxjoVLYrl3MZGwOo zfYA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@ibm.com header.s=pp1 header.b=CuOoh3C5; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=ibm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id m8si6899534edj.357.2020.11.13.09.17.18; Fri, 13 Nov 2020 09:17:44 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@ibm.com header.s=pp1 header.b=CuOoh3C5; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=ibm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726315AbgKMROX (ORCPT + 99 others); Fri, 13 Nov 2020 12:14:23 -0500 Received: from mx0b-001b2d01.pphosted.com ([148.163.158.5]:10590 "EHLO mx0a-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-FAIL) by vger.kernel.org with ESMTP id S1726309AbgKMROW (ORCPT ); Fri, 13 Nov 2020 12:14:22 -0500 Received: from pps.filterd (m0098413.ppops.net [127.0.0.1]) by mx0b-001b2d01.pphosted.com (8.16.0.42/8.16.0.42) with SMTP id 0ADH0iaC057866; Fri, 13 Nov 2020 12:14:29 -0500 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ibm.com; h=subject : to : cc : references : from : message-id : date : mime-version : in-reply-to : content-type : content-transfer-encoding; s=pp1; bh=5qGvsEYdJTEe7tzuPXfJCNS7NxvtM6VTCsMNAwtkKUE=; b=CuOoh3C5VMyBFc8rRq1053GzrE+k9ANDty9hwmWZhEsowNvKQx76cVlwFAXWSaCKNmE2 tSU512DHSxfR0ghiMF+LzF7wR2HxwPoQtRrLeNk+mis2rpnerOpevO0rrbl14MPol8k/ 7SNaxO0gXCqO6Jzq7JLAoR5JxL6UnhtozAFHx7JK48hY4LD7DrM2v3JKrK3JuUIEfAXF xNlg0n03B3DbSU2VQOhPHSg//Ys4pOw/vgQs1zwsbWRgxJUFSqCagJrB7n+d9HM/U4vX q+Yu1mcVApAbfNGRGjA+nbYPRtXFuThDhRRec2TPkcRxeAkzHfDhMb9A5wwnFkd7y05g nA== Received: from pps.reinject (localhost [127.0.0.1]) by mx0b-001b2d01.pphosted.com with ESMTP id 34sx480fdu-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Fri, 13 Nov 2020 12:14:28 -0500 Received: from m0098413.ppops.net (m0098413.ppops.net [127.0.0.1]) by pps.reinject (8.16.0.36/8.16.0.36) with SMTP id 0ADH154b059212; Fri, 13 Nov 2020 12:14:28 -0500 Received: from ppma05wdc.us.ibm.com (1b.90.2fa9.ip4.static.sl-reverse.com [169.47.144.27]) by mx0b-001b2d01.pphosted.com with ESMTP id 34sx480fdh-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Fri, 13 Nov 2020 12:14:28 -0500 Received: from pps.filterd (ppma05wdc.us.ibm.com [127.0.0.1]) by ppma05wdc.us.ibm.com (8.16.0.42/8.16.0.42) with SMTP id 0ADHD3C2027758; Fri, 13 Nov 2020 17:14:27 GMT Received: from b03cxnp07029.gho.boulder.ibm.com (b03cxnp07029.gho.boulder.ibm.com [9.17.130.16]) by ppma05wdc.us.ibm.com with ESMTP id 34nk79v7qa-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Fri, 13 Nov 2020 17:14:27 +0000 Received: from b03ledav003.gho.boulder.ibm.com (b03ledav003.gho.boulder.ibm.com [9.17.130.234]) by b03cxnp07029.gho.boulder.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id 0ADHEOs030671190 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Fri, 13 Nov 2020 17:14:24 GMT Received: from b03ledav003.gho.boulder.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id B36106A04F; Fri, 13 Nov 2020 17:14:24 +0000 (GMT) Received: from b03ledav003.gho.boulder.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 392EF6A047; Fri, 13 Nov 2020 17:14:23 +0000 (GMT) Received: from cpe-66-24-58-13.stny.res.rr.com (unknown [9.85.152.80]) by b03ledav003.gho.boulder.ibm.com (Postfix) with ESMTP; Fri, 13 Nov 2020 17:14:22 +0000 (GMT) Subject: Re: [PATCH v11 05/14] s390/vfio-ap: implement in-use callback for vfio_ap driver To: Halil Pasic Cc: linux-s390@vger.kernel.org, linux-kernel@vger.kernel.org, kvm@vger.kernel.org, freude@linux.ibm.com, borntraeger@de.ibm.com, cohuck@redhat.com, mjrosato@linux.ibm.com, alex.williamson@redhat.com, kwankhede@nvidia.com, fiuczy@linux.ibm.com, frankja@linux.ibm.com, david@redhat.com, hca@linux.ibm.com, gor@linux.ibm.com References: <20201022171209.19494-1-akrowiak@linux.ibm.com> <20201022171209.19494-6-akrowiak@linux.ibm.com> <20201027142711.1b57825e.pasic@linux.ibm.com> From: Tony Krowiak Message-ID: <6a5feb16-46b5-9dca-7e85-7d344b0ffa24@linux.ibm.com> Date: Fri, 13 Nov 2020 12:14:22 -0500 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.5.0 MIME-Version: 1.0 In-Reply-To: <20201027142711.1b57825e.pasic@linux.ibm.com> Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Content-Language: en-US X-TM-AS-GCONF: 00 X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:6.0.312,18.0.737 definitions=2020-11-13_10:2020-11-13,2020-11-13 signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 impostorscore=0 clxscore=1015 lowpriorityscore=0 mlxlogscore=999 malwarescore=0 mlxscore=0 phishscore=0 suspectscore=3 adultscore=0 bulkscore=0 spamscore=0 priorityscore=1501 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2009150000 definitions=main-2011130107 Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 10/27/20 9:27 AM, Halil Pasic wrote: > On Thu, 22 Oct 2020 13:12:00 -0400 > Tony Krowiak wrote: > >> Let's implement the callback to indicate when an APQN >> is in use by the vfio_ap device driver. The callback is >> invoked whenever a change to the apmask or aqmask would >> result in one or more queue devices being removed from the driver. The >> vfio_ap device driver will indicate a resource is in use >> if the APQN of any of the queue devices to be removed are assigned to >> any of the matrix mdevs under the driver's control. >> >> Signed-off-by: Tony Krowiak >> --- >> drivers/s390/crypto/vfio_ap_drv.c | 1 + >> drivers/s390/crypto/vfio_ap_ops.c | 78 +++++++++++++++++++-------- >> drivers/s390/crypto/vfio_ap_private.h | 2 + >> 3 files changed, 60 insertions(+), 21 deletions(-) >> >> diff --git a/drivers/s390/crypto/vfio_ap_drv.c b/drivers/s390/crypto/vfio_ap_drv.c >> index 73bd073fd5d3..8934471b7944 100644 >> --- a/drivers/s390/crypto/vfio_ap_drv.c >> +++ b/drivers/s390/crypto/vfio_ap_drv.c >> @@ -147,6 +147,7 @@ static int __init vfio_ap_init(void) >> memset(&vfio_ap_drv, 0, sizeof(vfio_ap_drv)); >> vfio_ap_drv.probe = vfio_ap_mdev_probe_queue; >> vfio_ap_drv.remove = vfio_ap_mdev_remove_queue; >> + vfio_ap_drv.in_use = vfio_ap_mdev_resource_in_use; >> vfio_ap_drv.ids = ap_queue_ids; >> >> ret = ap_driver_register(&vfio_ap_drv, THIS_MODULE, VFIO_AP_DRV_NAME); >> diff --git a/drivers/s390/crypto/vfio_ap_ops.c b/drivers/s390/crypto/vfio_ap_ops.c >> index 1357f8f8b7e4..9e9fad560859 100644 >> --- a/drivers/s390/crypto/vfio_ap_ops.c >> +++ b/drivers/s390/crypto/vfio_ap_ops.c >> @@ -522,18 +522,40 @@ vfio_ap_mdev_verify_queues_reserved_for_apid(struct ap_matrix_mdev *matrix_mdev, >> return 0; >> } >> >> +#define MDEV_SHARING_ERR "Userspace may not re-assign queue %02lx.%04lx " \ >> + "already assigned to %s" >> + >> +static void vfio_ap_mdev_log_sharing_err(const char *mdev_name, >> + unsigned long *apm, >> + unsigned long *aqm) >> +{ >> + unsigned long apid, apqi; >> + >> + for_each_set_bit_inv(apid, apm, AP_DEVICES) >> + for_each_set_bit_inv(apqi, aqm, AP_DOMAINS) >> + pr_err(MDEV_SHARING_ERR, apid, apqi, mdev_name); > Isn't error rather severe for this? For my taste even warning would be > severe for this. The user only sees a EADDRINUSE returned from the sysfs interface, so Conny asked if I could log a message to indicate which APQNs are in use by which mdev. I can change this to an info message, but it will be missed if the log level is set higher. Maybe Conny can put in her two cents here since she asked for this. > >> +} >> + >> /** >> * vfio_ap_mdev_verify_no_sharing >> * >> - * Verifies that the APQNs derived from the cross product of the AP adapter IDs >> - * and AP queue indexes comprising the AP matrix are not configured for another >> + * Verifies that each APQN derived from the cross product of the AP adapter IDs >> + * and AP queue indexes comprising an AP matrix is not assigned to a >> * mediated device. AP queue sharing is not allowed. >> * >> - * @matrix_mdev: the mediated matrix device >> + * @matrix_mdev: the mediated matrix device to which the APQNs being verified >> + * are assigned. If the value is not NULL, then verification will >> + * proceed for all other matrix mediated devices; otherwise, all >> + * matrix mediated devices will be verified. >> + * @mdev_apm: mask indicating the APIDs of the APQNs to be verified >> + * @mdev_aqm: mask indicating the APQIs of the APQNs to be verified >> * >> - * Returns 0 if the APQNs are not shared, otherwise; returns -EADDRINUSE. >> + * Returns 0 if no APQNs are not shared, otherwise; returns -EADDRINUSE if one >> + * or more APQNs are shared. >> */ >> -static int vfio_ap_mdev_verify_no_sharing(struct ap_matrix_mdev *matrix_mdev) >> +static int vfio_ap_mdev_verify_no_sharing(struct ap_matrix_mdev *matrix_mdev, >> + unsigned long *mdev_apm, >> + unsigned long *mdev_aqm) >> { >> struct ap_matrix_mdev *lstdev; >> DECLARE_BITMAP(apm, AP_DEVICES); >> @@ -550,14 +572,15 @@ static int vfio_ap_mdev_verify_no_sharing(struct ap_matrix_mdev *matrix_mdev) >> * We work on full longs, as we can only exclude the leftover >> * bits in non-inverse order. The leftover is all zeros. >> */ >> - if (!bitmap_and(apm, matrix_mdev->matrix.apm, >> - lstdev->matrix.apm, AP_DEVICES)) >> + if (!bitmap_and(apm, mdev_apm, lstdev->matrix.apm, AP_DEVICES)) >> continue; >> >> - if (!bitmap_and(aqm, matrix_mdev->matrix.aqm, >> - lstdev->matrix.aqm, AP_DOMAINS)) >> + if (!bitmap_and(aqm, mdev_aqm, lstdev->matrix.aqm, AP_DOMAINS)) >> continue; >> >> + vfio_ap_mdev_log_sharing_err(dev_name(mdev_dev(lstdev->mdev)), >> + apm, aqm); >> + >> return -EADDRINUSE; >> } >> >> @@ -683,6 +706,7 @@ static ssize_t assign_adapter_store(struct device *dev, >> { >> int ret; >> unsigned long apid; >> + DECLARE_BITMAP(apm, AP_DEVICES); >> struct mdev_device *mdev = mdev_from_dev(dev); >> struct ap_matrix_mdev *matrix_mdev = mdev_get_drvdata(mdev); >> >> @@ -708,18 +732,18 @@ static ssize_t assign_adapter_store(struct device *dev, >> if (ret) >> goto done; >> >> - set_bit_inv(apid, matrix_mdev->matrix.apm); >> + memset(apm, 0, sizeof(apm)); >> + set_bit_inv(apid, apm); >> >> - ret = vfio_ap_mdev_verify_no_sharing(matrix_mdev); >> + ret = vfio_ap_mdev_verify_no_sharing(matrix_mdev, apm, >> + matrix_mdev->matrix.aqm); > What is the benefit of using a copy here? I mean we have the vfio_ap lock > so nobody can see the bit we speculatively flipped. The vfio_ap_mdev_verify_no_sharing() function definition was changed so that it can also be re-used by the vfio_ap_mdev_resource_in_use() function rather than duplicating that code for the in_use callback. The in-use callback is invoked by the AP bus which has no concept of a mediated device, so I made this change to accommodate that fact. > > I've also pointed out in the previous patch that in_use() isn't > perfectly reliable (at least in theory) because of a race. We discussed that privately and determined that the sysfs assignment interfaces will use mutex_trylock() to avoid races. > > Otherwise looks good to me! > >> if (ret) >> - goto share_err; >> + goto done; >> >> + set_bit_inv(apid, matrix_mdev->matrix.apm); >> vfio_ap_mdev_link_queues(matrix_mdev, LINK_APID, apid); >> ret = count; >> - goto done; >> >> -share_err: >> - clear_bit_inv(apid, matrix_mdev->matrix.apm); >> done: >> mutex_unlock(&matrix_dev->lock); >> >> @@ -831,6 +855,7 @@ static ssize_t assign_domain_store(struct device *dev, >> { >> int ret; >> unsigned long apqi; >> + DECLARE_BITMAP(aqm, AP_DOMAINS); >> struct mdev_device *mdev = mdev_from_dev(dev); >> struct ap_matrix_mdev *matrix_mdev = mdev_get_drvdata(mdev); >> unsigned long max_apqi = matrix_mdev->matrix.aqm_max; >> @@ -851,18 +876,18 @@ static ssize_t assign_domain_store(struct device *dev, >> if (ret) >> goto done; >> >> - set_bit_inv(apqi, matrix_mdev->matrix.aqm); >> + memset(aqm, 0, sizeof(aqm)); >> + set_bit_inv(apqi, aqm); >> >> - ret = vfio_ap_mdev_verify_no_sharing(matrix_mdev); >> + ret = vfio_ap_mdev_verify_no_sharing(matrix_mdev, >> + matrix_mdev->matrix.apm, aqm); >> if (ret) >> - goto share_err; >> + goto done; >> >> + set_bit_inv(apqi, matrix_mdev->matrix.aqm); >> vfio_ap_mdev_link_queues(matrix_mdev, LINK_APQI, apqi); >> ret = count; >> - goto done; >> >> -share_err: >> - clear_bit_inv(apqi, matrix_mdev->matrix.aqm); >> done: >> mutex_unlock(&matrix_dev->lock); >> >> @@ -1442,3 +1467,14 @@ void vfio_ap_mdev_remove_queue(struct ap_device *apdev) >> kfree(q); >> mutex_unlock(&matrix_dev->lock); >> } >> + >> +bool vfio_ap_mdev_resource_in_use(unsigned long *apm, unsigned long *aqm) >> +{ >> + bool in_use; >> + >> + mutex_lock(&matrix_dev->lock); >> + in_use = !!vfio_ap_mdev_verify_no_sharing(NULL, apm, aqm); >> + mutex_unlock(&matrix_dev->lock); >> + >> + return in_use; >> +} >> diff --git a/drivers/s390/crypto/vfio_ap_private.h b/drivers/s390/crypto/vfio_ap_private.h >> index 4e5cc72fc0db..c1d8b5507610 100644 >> --- a/drivers/s390/crypto/vfio_ap_private.h >> +++ b/drivers/s390/crypto/vfio_ap_private.h >> @@ -105,4 +105,6 @@ struct vfio_ap_queue { >> int vfio_ap_mdev_probe_queue(struct ap_device *queue); >> void vfio_ap_mdev_remove_queue(struct ap_device *queue); >> >> +bool vfio_ap_mdev_resource_in_use(unsigned long *apm, unsigned long *aqm); >> + >> #endif /* _VFIO_AP_PRIVATE_H_ */