Received: by 2002:a25:1985:0:0:0:0:0 with SMTP id 127csp342729ybz; Wed, 29 Apr 2020 00:59:59 -0700 (PDT) X-Google-Smtp-Source: APiQypKjTgRtdvB1K0tTVGuX15Wo5hcOA3hf72oAx/L4g6MTJYQqLBp4wzKx/kzKc5N0bSgVBfzC X-Received: by 2002:a05:6402:1c07:: with SMTP id ck7mr1348279edb.202.1588147198841; Wed, 29 Apr 2020 00:59:58 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1588147198; cv=none; d=google.com; s=arc-20160816; b=tNj0wFGWHhiWuy/ufqQL2RhBl2r/2nK/ptyYdROuBCGpRj06AicrBoi5F3NCA5Djmp xB2AWBFeqGa7gkvbw+rA5JMmwO5w3HaBUjkbGEMcrgfC6N1lpQM09yYLRSiJB8SgZjxo 1Qz7hytpQBVbnYJYuBK3b9ymgeVh/TAhycjTxxjXXaXEZsy+VpQ/xbLMj8vYJxY97q/d pmg6lPFsKLUBB1Evi7z5h/xz2mu24YtSmmjVHyRQ/JgmdhlpT2G35sTycE6SNCcDXIL0 jYwT3vUlMM8hvTQSpXeuLWgd2zuKjRDdIFlywEjpEVcrGDc9C9xbcaEf1fZLKKo1x2oh NlWA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-language :content-transfer-encoding:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject; bh=MuDKcNbtyrXMqRBqdVIEkkXoV7L06Q+IVbgKv2IMDGU=; b=va7AQ6EXJz0XTU8nxKgEhsIE+heumVYoNwYlRa5rInc7rwRSXPXqKtFBuGB1a4WMSX KF4tOb+UHfqwXMlI0KPdg8U6+9U337l/5SSz49P2N6MPEm4AW2uVE6s+nAkw6xWpZ2vw xv9IQqFQUd1DC0xBdptGwq4mN3unoiSGcbxq9CwEJ2pT2jm94TXIblzxqFCZ5M68q+vG FrbfS+q9DZMvTYfwbQwcblGWxFu9VI+RuTwWxOT6h94vUohlTyGqSEOzUITgrCB2Y6ST 8bnSkYbFRiTYK0sOueJy7fNlOdL0m4ymButd7IjNPfxg06m0Z1bjnsgQ2eqQzuu1T4iP 3g+A== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id l26si3331285eje.378.2020.04.29.00.59.35; Wed, 29 Apr 2020 00:59:58 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ibm.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726526AbgD2H4P (ORCPT + 99 others); Wed, 29 Apr 2020 03:56:15 -0400 Received: from mx0b-001b2d01.pphosted.com ([148.163.158.5]:33360 "EHLO mx0b-001b2d01.pphosted.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726175AbgD2H4P (ORCPT ); Wed, 29 Apr 2020 03:56:15 -0400 Received: from pps.filterd (m0127361.ppops.net [127.0.0.1]) by mx0a-001b2d01.pphosted.com (8.16.0.42/8.16.0.42) with SMTP id 03T7XlOL144536; Wed, 29 Apr 2020 03:56:10 -0400 Received: from pps.reinject (localhost [127.0.0.1]) by mx0a-001b2d01.pphosted.com with ESMTP id 30mg17vwgp-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Wed, 29 Apr 2020 03:56:10 -0400 Received: from m0127361.ppops.net (m0127361.ppops.net [127.0.0.1]) by pps.reinject (8.16.0.36/8.16.0.36) with SMTP id 03T7Xlfr144547; Wed, 29 Apr 2020 03:56:10 -0400 Received: from ppma04ams.nl.ibm.com (63.31.33a9.ip4.static.sl-reverse.com [169.51.49.99]) by mx0a-001b2d01.pphosted.com with ESMTP id 30mg17vwg4-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Wed, 29 Apr 2020 03:56:10 -0400 Received: from pps.filterd (ppma04ams.nl.ibm.com [127.0.0.1]) by ppma04ams.nl.ibm.com (8.16.0.27/8.16.0.27) with SMTP id 03T7okS0017741; Wed, 29 Apr 2020 07:56:08 GMT Received: from b06avi18878370.portsmouth.uk.ibm.com (b06avi18878370.portsmouth.uk.ibm.com [9.149.26.194]) by ppma04ams.nl.ibm.com with ESMTP id 30mcu6yy2j-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=NOT); Wed, 29 Apr 2020 07:56:08 +0000 Received: from d06av24.portsmouth.uk.ibm.com (d06av24.portsmouth.uk.ibm.com [9.149.105.60]) by b06avi18878370.portsmouth.uk.ibm.com (8.14.9/8.14.9/NCO v10.0) with ESMTP id 03T7u54T65405348 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Wed, 29 Apr 2020 07:56:05 GMT Received: from d06av24.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 52FF34203F; Wed, 29 Apr 2020 07:56:05 +0000 (GMT) Received: from d06av24.portsmouth.uk.ibm.com (unknown [127.0.0.1]) by IMSVA (Postfix) with ESMTP id 69AC842042; Wed, 29 Apr 2020 07:56:04 +0000 (GMT) Received: from funtu.home (unknown [9.171.57.13]) by d06av24.portsmouth.uk.ibm.com (Postfix) with ESMTP; Wed, 29 Apr 2020 07:56:04 +0000 (GMT) Subject: Re: [PATCH v7 01/15] s390/vfio-ap: store queue struct in hash table for quick access To: Tony Krowiak , Halil Pasic Cc: linux-s390@vger.kernel.org, linux-kernel@vger.kernel.org, kvm@vger.kernel.org, borntraeger@de.ibm.com, cohuck@redhat.com, mjrosato@linux.ibm.com, pmorel@linux.ibm.com, alex.williamson@redhat.com, kwankhede@nvidia.com, jjherne@linux.ibm.com, fiuczy@linux.ibm.com References: <20200407192015.19887-1-akrowiak@linux.ibm.com> <20200407192015.19887-2-akrowiak@linux.ibm.com> <20200424055732.7663896d.pasic@linux.ibm.com> <20200427171739.76291a74.pasic@linux.ibm.com> <6ea12752-d23f-abe4-8d5f-3e7738984576@linux.ibm.com> <20200428120726.3f769ce3.pasic@linux.ibm.com> From: Harald Freudenberger Message-ID: Date: Wed, 29 Apr 2020 09:56:05 +0200 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.7.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 8bit Content-Language: en-US X-TM-AS-GCONF: 00 X-Proofpoint-Virus-Version: vendor=fsecure engine=2.50.10434:6.0.138,18.0.676 definitions=2020-04-29_02:2020-04-28,2020-04-29 signatures=0 X-Proofpoint-Spam-Details: rule=outbound_notspam policy=outbound score=0 clxscore=1015 malwarescore=0 suspectscore=0 lowpriorityscore=0 spamscore=0 phishscore=0 priorityscore=1501 adultscore=0 bulkscore=0 mlxlogscore=999 mlxscore=0 impostorscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.12.0-2003020000 definitions=main-2004290057 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 29.04.20 00:30, Tony Krowiak wrote: > > > On 4/28/20 6:57 AM, Harald Freudenberger wrote: >> On 28.04.20 12:07, Halil Pasic wrote: >>> On Mon, 27 Apr 2020 17:48:58 -0400 >>> Tony Krowiak wrote: >>> >>>> On 4/27/20 11:17 AM, Halil Pasic wrote: >>>>> On Mon, 27 Apr 2020 15:05:23 +0200 >>>>> Harald Freudenberger wrote: >>>>> >>>>>> On 24.04.20 05:57, Halil Pasic wrote: >>>>>>> On Tue,  7 Apr 2020 15:20:01 -0400 >>>>>>> Tony Krowiak wrote: >>>>>>>    >>>>>>>> Rather than looping over potentially 65535 objects, let's store the >>>>>>>> structures for caching information about queue devices bound to the >>>>>>>> vfio_ap device driver in a hash table keyed by APQN. >>>>>>> @Harald: >>>>>>> Would it make sense to make the efficient lookup of an apqueue base >>>>>>> on its APQN core AP functionality instead of each driver figuring it out >>>>>>> on it's own? >>>>>>> >>>>>>> If I'm not wrong the zcrypt device/driver(s) must the problem of >>>>>>> looking up a queue based on its APQN as well. >>>>>>> >>>>>>> For instance struct ep11_cprb has a target_id filed >>>>>>> (arch/s390/include/uapi/asm/zcrypt.h). >>>>>>> >>>>>>> Regards, >>>>>>> Halil >>>>>> Hi Halil >>>>>> >>>>>> no, the zcrypt drivers don't have this problem. They build up their own device object which >>>>>> includes a pointer to the base ap device. >>>>> I'm a bit confused. Doesn't your code loop first trough the ap_card >>>>> objects to find the APID portion of the APQN, and then loop the queue >>>>> list of the matching card to find the right ap_queue object? Or did I >>>>> miss something? Isn't that what _zcrypt_send_ep11_cprb() does? Can you >>>>> point me to the code that avoids the lookup (by apqn) for zcrypt? >>>> The code you reference, _zcrypt_send_ep11_cprb(), does loop through >>>> each queue associated with each card, but it doesn't appear to be >>>> looking for >>>> a queue with a particular APQN. It appears to be looking for a queue >>>> meeting a specific set of conditions. At least that's my take after >>>> taking a very >>>> brief look at the code, so I'm not sure that applies here. >>>> >>> One of the possible conditions is that the APQN is in the targets array. >>> Please have another look at the code below, is_desired_ep11_queue() >>> and is_desired_ep11_card() do APQI and APID part of the check >>> respectively: >>> >>>          for_each_zcrypt_card(zc) { >>>                  /* Check for online EP11 cards */ >>>                  if (!zc->online || !(zc->card->functions & 0x04000000)) >>>                          continue; >>>                  /* Check for user selected EP11 card */ >>>                  if (targets && >>>                      !is_desired_ep11_card(zc->card->id, target_num, targets)) >>>                          continue; >>>                  /* check if device node has admission for this card */ >>>                  if (!zcrypt_check_card(perms, zc->card->id)) >>>                          continue; >>>                  /* get weight index of the card device  */ >>>                  weight = speed_idx_ep11(func_code) * zc->speed_rating[SECKEY]; >>>                  if (zcrypt_card_compare(zc, pref_zc, weight, pref_weight)) >>>                          continue; >>>                  for_each_zcrypt_queue(zq, zc) { >>>                          /* check if device is online and eligible */ >>>                          if (!zq->online || >>>                              !zq->ops->send_ep11_cprb || >>>                              (targets && >>>                               !is_desired_ep11_queue(zq->queue->qid, >>>                                                      target_num, targets))) >>> >>> >>> Yes the size of targets may or may not be 1 (example for size == 1 is >>> the invocation form ep11_cryptsingle()) and the respective costs >>> depend on the usual size of the array. Since the goal of the whole >>> exercise seems to be to pick a single queue, and we settle with the first >>> suitable (first not in the input array, but in our lists) that is >>> suitable, I assumed we wouldn't need many hashtable lookups. >>> >>> Regards, >>> Halil >> again, this is all code related to zcrypt card and queues and has nothing directly to do with ap queue and ap cards. >> If you want to have a look how this works for ap devices, have a look into the scan routines for the ap bus in ap_bus.c >> There you can find a bus_for_each_device() which would fit together with the right matching function for your needs. >> And this is exactly what Tony implemented in the first shot. However, as written I can provide something like that >> for you. >> One note for the improvement via hash list with the argument about the max 65535 objects. >> Think about a real big machine which has currently up to 30 crypto cards (z15 GA1.5) which when CEX7S are >> plugged appear as 60 crypto adapters and have up to 85 domains each. When all these crypto resources >> are assigned to one LPAR we end up in 60x85 = 5100 APQNs. Well, of course with a hash you can improve >> the linear search through an array or list but can you measure the performance gain and then compare this >> to the complexity.  ... just some thoughts about beautifying code ... > > I set up a test case to compare searching using a hashtable verses using a list. > I created both a hashtable and a list of 5100 objects. Each structure had a single > APQN field. I then randomly searched both the hashtable and the list for > each APQN. The following table contains the result of 5 test runs. The elapsed > times are in nanoseconds. > > Test:                              List Search    Hashtable Search > ------                              ----------- ---------------- > Avg. Per APQN:             11651           81 > Total per 5500 APQNs:  60164268     1085368 > > Avg. Per APQN:              10925           78 > Total per 5500 APQNs:   56482780    1084590 > > Avg. Per APQN:              10190           80 > Total per 5500 APQNs:   52714920    1123205 > > Avg. Per APQN:              8431             76 > Total per 5500 APQNs:   43748838    1061414 > > Avg. Per APQN:              9678             75 > Total per 5500 APQNs:   50103437    1044427 > ----------------------------------------------- > Per APQN Search Avg:   10175          78            Hashtable is 130 times faster > Total Search 5500 Avg:  52642848    1079800  Hashtable is 49 times faster > > Note that the list search was just a straight search of an object in a list, not > a device attached to a bus. I don't know if that would add time, but it seems > that the savings using a hashtable are significant. Halil, I did not say that a hashtable is not faster than a linear list. The only thing I wanted to express is that we are adding complexity and performance improving code which is not even integrated somewhere. We are beautifying here. > > So I have two questions: > > 1. Would it make more sense to provide AP bus interfaces to search for >     queue devices by APQN? > > 2. If so, shall we store the queue devices in a hashtable to make the >     searches more efficient? If there is a decision to implement this as a feature function within the AP bus base code, I will use the bus functions provided by the kernel common code. So here this will be a bus_for_each_device() together with a filter function. I don't know how this is implemented within the common bus code. > > >>>>> If you look at the new function of vfio_ap_get_queue(unsigned long apqn) >>>>> it basically about finding the queue based on the apqn, with the >>>>> difference that it is vfio specific. >>>>> >>>>> Regards, >>>>> Halil >>>>> >>>>>> However, this is not a big issue, as the ap_bus holds a list of ap_card objects and within each >>>>>> ap_card object there exists a list of ap_queues. >>>>> >