Received: by 2002:a05:6a10:6d10:0:0:0:0 with SMTP id gq16csp855560pxb; Fri, 22 Apr 2022 12:43:01 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwKsgxLF8TuQ4XuBk25CmXKjNkxNIliXFXE6XXtniTnBDDXvFAhHNg3cuCNiecBZeYg4/Sx X-Received: by 2002:a05:6a00:cd2:b0:50a:7685:8055 with SMTP id b18-20020a056a000cd200b0050a76858055mr6544214pfv.37.1650656581585; Fri, 22 Apr 2022 12:43:01 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1650656581; cv=none; d=google.com; s=arc-20160816; b=kZwoZHAD+ULV9drvNBzJfa2zXcLQBnnAXl52/b6IXsvqwXuAYTy8G0kviVU/g+ICxL GKSedk/SdyPHNbmJFnXBIXjec89KoqNjHJEe2Zb9Ky7reeOU0sz0T7GrSHAAmlKM5Tk4 HaeTYKD6++N4cVUNwy5cJW2bfOm1Xl/uuMUMcbUozJoO/hQIxLHMztEsRwh2SjASh0Lg 36OTlZWiXok3O3uV5kOJHUEcxy6aVBzen0NJdhOSo09ZNPh231VY+UdN8A1SXtkYk00d ibeb2yrMHkmgLO000g/J3cwT4fHzNY+2VmV54PLECw1wjPbimYYO21HlKuS7eEILiGaa 23Xg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :dkim-signature; bh=+wRkA/MgdpGcvHCj6EQvY7xr281zOQma1KCz2azqeQs=; b=Ikw6cYxh6IodvFX2gYXdvjYMTF2AlOVE8WHU1xIBL/KvEYtik8UwkOGhEUMP74WO12 HCSlSMfUBmYi9Q9Asisrp3+B7ofUD8fkng5Dhk/1/M2js5G2yImaO7eBD2Q5CoaQrqP2 Cb2EpFllO8uwwoTFxShDqzxF4QxGQ7P9Ojdsx6D9gBRzjZ4FR/ELpE6c//cFMJqQGYvx 8OlV9FeP2BH4yctVj6TBIu+I63PE1wP/9/pXVcVst6F3rp903iWlCq1Lf4iJ2k9q+U++ +udgO7Jw/Hs4NuUU75WvSWsDW7B4PSmUuqNTbk+6QCnMwl40f+KhGXaiHWep19HG6Yyk qkPw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@bytedance-com.20210112.gappssmtp.com header.s=20210112 header.b=jegRWHwu; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=bytedance.com Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [23.128.96.19]) by mx.google.com with ESMTPS id 186-20020a6303c3000000b003a85cf1e1f0si9867288pgd.829.2022.04.22.12.43.01 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 22 Apr 2022 12:43:01 -0700 (PDT) Received-SPF: softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) client-ip=23.128.96.19; Authentication-Results: mx.google.com; dkim=pass header.i=@bytedance-com.20210112.gappssmtp.com header.s=20210112 header.b=jegRWHwu; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=bytedance.com Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 9C1B81334C2; Fri, 22 Apr 2022 11:48:17 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1388524AbiDUKrV (ORCPT + 99 others); Thu, 21 Apr 2022 06:47:21 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:34116 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1388505AbiDUKq7 (ORCPT ); Thu, 21 Apr 2022 06:46:59 -0400 Received: from mail-pl1-x62e.google.com (mail-pl1-x62e.google.com [IPv6:2607:f8b0:4864:20::62e]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 462422C13C for ; Thu, 21 Apr 2022 03:44:10 -0700 (PDT) Received: by mail-pl1-x62e.google.com with SMTP id u15so1076365ple.4 for ; Thu, 21 Apr 2022 03:44:10 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=bytedance-com.20210112.gappssmtp.com; s=20210112; h=from:to:cc:subject:date:message-id:in-reply-to:references :mime-version:content-transfer-encoding; bh=+wRkA/MgdpGcvHCj6EQvY7xr281zOQma1KCz2azqeQs=; b=jegRWHwuhvBJz/GaSZkEzZ6hoEymXfQqnPcmwCl69wJvDDeaoqtr1Cgv89p6+Zcojk nqtq+U/nuH3bT2ntZlFkEkovPtp+1wfbLu9/yCpKyD550W3fMB/CAVm3c2UQ8ahpqHBb SCHYt7KMoslj6+KDnJh+IQ4kas56WDt/nv4AWtcqZlzM5p78Nzxzw2qQwvAdOtqhvhrr O36N1xKrj/jUQU1FB0MSDuDfiz2tYYeK7p+y7d3jKZWsa2swIVBfYs3mTV4kAHgMRVJh QN+otgEZFGMeADO0yF8zoVJ9skQhWhoJsU/4ofHlv/e5JUgI8Iua5xrOPMsU3mtEMHwP 5EDg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references:mime-version:content-transfer-encoding; bh=+wRkA/MgdpGcvHCj6EQvY7xr281zOQma1KCz2azqeQs=; b=zQBYuVJprY9obzrN4w78PJCk2QGV5LPRnSEwTY/5h9UtECy2yvs59wRX3BSnteRP8o gTTH84H+rdBYTzirH9dIfmnpjwwvQQcQaZDT6tnp4XxTHtuDJC2d/kKhxM2s0EbLylEy XkwW3AAXUAOIKYB/MmLGSUQ12Dxvw3T+nPOJTJYo3ymSg0pBen7yV75AQQC8WwQse8Pc 0wLrzkh+jBTRaLjnG0836AxaCj/nZMk9Uccxx6wHrya+DFFMIk9u2ANFZ8hLRpmk16lV pyPbNGUycBtDkiaDrW70D5KnyDAQO+bUKY23TYPN6csK0qC7u04cO5sknqx/84b+/OHO gp3Q== X-Gm-Message-State: AOAM533c7QYH3NdvEcB9Sm9xFF8YiDfUBJHsbDNuu4aCceDtnpasVs1j n2R3JgTjV84I0796csqhj5pPPA== X-Received: by 2002:a17:902:e8c6:b0:158:f809:311f with SMTP id v6-20020a170902e8c600b00158f809311fmr20858174plg.4.1650537848321; Thu, 21 Apr 2022 03:44:08 -0700 (PDT) Received: from always-x1.bytedance.net ([61.120.150.69]) by smtp.gmail.com with ESMTPSA id w7-20020aa79547000000b0050ad0e82e6dsm3772485pfq.215.2022.04.21.03.44.02 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 21 Apr 2022 03:44:07 -0700 (PDT) From: zhenwei pi To: arei.gonglei@huawei.com, mst@redhat.com Cc: jasowang@redhat.com, herbert@gondor.apana.org.au, linux-kernel@vger.kernel.org, virtualization@lists.linux-foundation.org, linux-crypto@vger.kernel.org, helei.sig11@bytedance.com, davem@davemloft.net, zhenwei pi Subject: [PATCH v3 2/5] virtio-crypto: wait ctrl queue instead of busy polling Date: Thu, 21 Apr 2022 18:40:13 +0800 Message-Id: <20220421104016.453458-3-pizhenwei@bytedance.com> X-Mailer: git-send-email 2.25.1 In-Reply-To: <20220421104016.453458-1-pizhenwei@bytedance.com> References: <20220421104016.453458-1-pizhenwei@bytedance.com> MIME-Version: 1.0 Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,HEADER_FROM_DIFFERENT_DOMAINS,MAILING_LIST_MULTI,RDNS_NONE, SPF_HELO_NONE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Originally, after submitting request into virtio crypto control queue, the guest side polls the result from the virt queue. This works like following: CPU0 CPU1 ... CPUx CPUy | | | | \ \ / / \--------spin_lock(&vcrypto->ctrl_lock)-------/ | virtqueue add & kick | busy poll virtqueue | spin_unlock(&vcrypto->ctrl_lock) ... There are two problems: 1, The queue depth is always 1, the performance of a virtio crypto device gets limited. Multi user processes share a single control queue, and hit spin lock race from control queue. Test on Intel Platinum 8260, a single worker gets ~35K/s create/close session operations, and 8 workers get ~40K/s operations with 800% CPU utilization. 2, The control request is supposed to get handled immediately, but in the current implementation of QEMU(v6.2), the vCPU thread kicks another thread to do this work, the latency also gets unstable. Tracking latency of virtio_crypto_alg_akcipher_close_session in 5s: usecs : count distribution 0 -> 1 : 0 | | 2 -> 3 : 7 | | 4 -> 7 : 72 | | 8 -> 15 : 186485 |************************| 16 -> 31 : 687 | | 32 -> 63 : 5 | | 64 -> 127 : 3 | | 128 -> 255 : 1 | | 256 -> 511 : 0 | | 512 -> 1023 : 0 | | 1024 -> 2047 : 0 | | 2048 -> 4095 : 0 | | 4096 -> 8191 : 0 | | 8192 -> 16383 : 2 | | This means that a CPU may hold vcrypto->ctrl_lock as long as 8192~16383us. To improve the performance of control queue, a request on control queue waits completion instead of busy polling to reduce lock racing, and gets completed by control queue callback. CPU0 CPU1 ... CPUx CPUy | | | | \ \ / / \--------spin_lock(&vcrypto->ctrl_lock)-------/ | virtqueue add & kick | ---------spin_unlock(&vcrypto->ctrl_lock)------ / / \ \ | | | | wait wait wait wait Test this patch, the guest side get ~200K/s operations with 300% CPU utilization. Cc: Michael S. Tsirkin Cc: Jason Wang Cc: Gonglei Signed-off-by: zhenwei pi --- drivers/crypto/virtio/virtio_crypto_common.c | 42 +++++++++++++++----- drivers/crypto/virtio/virtio_crypto_common.h | 8 ++++ drivers/crypto/virtio/virtio_crypto_core.c | 2 +- 3 files changed, 41 insertions(+), 11 deletions(-) diff --git a/drivers/crypto/virtio/virtio_crypto_common.c b/drivers/crypto/virtio/virtio_crypto_common.c index e65125a74db2..93df73c40dd3 100644 --- a/drivers/crypto/virtio/virtio_crypto_common.c +++ b/drivers/crypto/virtio/virtio_crypto_common.c @@ -8,14 +8,21 @@ #include "virtio_crypto_common.h" +static void virtio_crypto_ctrlq_callback(struct virtio_crypto_ctrl_request *vc_ctrl_req) +{ + complete(&vc_ctrl_req->compl); +} + int virtio_crypto_ctrl_vq_request(struct virtio_crypto *vcrypto, struct scatterlist *sgs[], unsigned int out_sgs, unsigned int in_sgs, struct virtio_crypto_ctrl_request *vc_ctrl_req) { int err; - unsigned int inlen; unsigned long flags; + init_completion(&vc_ctrl_req->compl); + vc_ctrl_req->ctrl_cb = virtio_crypto_ctrlq_callback; + spin_lock_irqsave(&vcrypto->ctrl_lock, flags); err = virtqueue_add_sgs(vcrypto->ctrl_vq, sgs, out_sgs, in_sgs, vc_ctrl_req, GFP_ATOMIC); if (err < 0) { @@ -24,16 +31,31 @@ int virtio_crypto_ctrl_vq_request(struct virtio_crypto *vcrypto, struct scatterl } virtqueue_kick(vcrypto->ctrl_vq); - - /* - * Trapping into the hypervisor, so the request should be - * handled immediately. - */ - while (!virtqueue_get_buf(vcrypto->ctrl_vq, &inlen) && - !virtqueue_is_broken(vcrypto->ctrl_vq)) - cpu_relax(); - spin_unlock_irqrestore(&vcrypto->ctrl_lock, flags); + wait_for_completion(&vc_ctrl_req->compl); + return 0; } + +void virtcrypto_ctrlq_callback(struct virtqueue *vq) +{ + struct virtio_crypto *vcrypto = vq->vdev->priv; + struct virtio_crypto_ctrl_request *vc_ctrl_req; + unsigned long flags; + unsigned int len; + + spin_lock_irqsave(&vcrypto->ctrl_lock, flags); + do { + virtqueue_disable_cb(vq); + while ((vc_ctrl_req = virtqueue_get_buf(vq, &len)) != NULL) { + spin_unlock_irqrestore(&vcrypto->ctrl_lock, flags); + if (vc_ctrl_req->ctrl_cb) + vc_ctrl_req->ctrl_cb(vc_ctrl_req); + spin_lock_irqsave(&vcrypto->ctrl_lock, flags); + } + if (unlikely(virtqueue_is_broken(vq))) + break; + } while (!virtqueue_enable_cb(vq)); + spin_unlock_irqrestore(&vcrypto->ctrl_lock, flags); +} diff --git a/drivers/crypto/virtio/virtio_crypto_common.h b/drivers/crypto/virtio/virtio_crypto_common.h index d2a20fe6e13e..25b4f22e8605 100644 --- a/drivers/crypto/virtio/virtio_crypto_common.h +++ b/drivers/crypto/virtio/virtio_crypto_common.h @@ -81,6 +81,10 @@ struct virtio_crypto_sym_session_info { __u64 session_id; }; +struct virtio_crypto_ctrl_request; +typedef void (*virtio_crypto_ctrl_callback) + (struct virtio_crypto_ctrl_request *vc_ctrl_req); + /* * Note: there are padding fields in request, clear them to zero before sending to host, * Ex, virtio_crypto_ctrl_request::ctrl::u::destroy_session::padding[48] @@ -89,6 +93,8 @@ struct virtio_crypto_ctrl_request { struct virtio_crypto_op_ctrl_req ctrl; struct virtio_crypto_session_input input; struct virtio_crypto_inhdr ctrl_status; + virtio_crypto_ctrl_callback ctrl_cb; + struct completion compl; }; struct virtio_crypto_request; @@ -141,7 +147,9 @@ void virtio_crypto_skcipher_algs_unregister(struct virtio_crypto *vcrypto); int virtio_crypto_akcipher_algs_register(struct virtio_crypto *vcrypto); void virtio_crypto_akcipher_algs_unregister(struct virtio_crypto *vcrypto); +void virtcrypto_ctrlq_callback(struct virtqueue *vq); int virtio_crypto_ctrl_vq_request(struct virtio_crypto *vcrypto, struct scatterlist *sgs[], unsigned int out_sgs, unsigned int in_sgs, struct virtio_crypto_ctrl_request *vc_ctrl_req); + #endif /* _VIRTIO_CRYPTO_COMMON_H */ diff --git a/drivers/crypto/virtio/virtio_crypto_core.c b/drivers/crypto/virtio/virtio_crypto_core.c index c6f482db0bc0..e668d4b1bc6a 100644 --- a/drivers/crypto/virtio/virtio_crypto_core.c +++ b/drivers/crypto/virtio/virtio_crypto_core.c @@ -73,7 +73,7 @@ static int virtcrypto_find_vqs(struct virtio_crypto *vi) goto err_names; /* Parameters for control virtqueue */ - callbacks[total_vqs - 1] = NULL; + callbacks[total_vqs - 1] = virtcrypto_ctrlq_callback; names[total_vqs - 1] = "controlq"; /* Allocate/initialize parameters for data virtqueues */ -- 2.20.1