Received: by 2002:ac0:a5a7:0:0:0:0:0 with SMTP id m36-v6csp5476227imm; Sun, 22 Jul 2018 23:41:38 -0700 (PDT) X-Google-Smtp-Source: AAOMgpfT/o9wcFdnAQRSqevdj2E9FcWF/8yAu2tX0s43gh/qsw62JIJVH+L6NzC8tKik8TrzRDml X-Received: by 2002:a63:920c:: with SMTP id o12-v6mr11080055pgd.141.1532328097984; Sun, 22 Jul 2018 23:41:37 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1532328097; cv=none; d=google.com; s=arc-20160816; b=Cwk/N7fgyk0KV+tgZjbwe4+jluRGIAbWhShc8AGf+AFuPVmiXsRwjoA0OVnUTvbBrz z9oTxnXWbZWetc8i9Gs/WB8V4O4WnxNlxcmpvVWoXdDN8N0tMinGQHgEtd3t+qipOxOJ ccTe5jgEEEnvfEELEbBg4Ina0NGf1fWLOryOokMI4KY8lhmtEKGWSJubvRnvY2x4xfcD sYdQZ3UNlehYoyDXvbGk/3R6kDJK8nVjAPQwl1r0UpeKLRk33D4R8PSm8ajoKLF+uhp8 GcpxMYrnMJq9H9c2og8QqqOl0YMjoy03t6bzU95L1HePso+h2OCMzCUsaLmek7tT8YqA siBA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:message-id:date:subject:cc:to:from :dkim-signature:arc-authentication-results; bh=07tgC2NhRGJJFeudTQ2D6feQ+qtfTI+yBjXaPz1hor0=; b=wv0Uk07Nvfw5rftqkAo4sd6uauKIAJh1/XevGG8UHV0wS2BfsyJqL8tHZ+vtHR5y+b LDFCdprdcVgo2l+/wgOI4w0GFS1cuunJxPyGiI5j/tspGN0qJkpaWrlmgKuL5NQkrvdJ HAQjJQENO4VYlKi/tTD68++5KY7JQCNkx3Yx8/YhYvyb3v8drBMdQI63BJ0M1GpZNfe0 YfBFEPMrKB+59NWV09gvMhtqLLRkvd3p3JYCmqy7G6QHKfWj1xuRnvIWKG52INNdOqDn agmSMRiI2veqTmMCCnL0CMcwNS0GAzdIpKZOtQ/qNXGgKqld1oVf6XTQEX4ibAi+Hbzy mipA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=a4jAXSIt; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id g16-v6si7923416pgk.465.2018.07.22.23.41.23; Sun, 22 Jul 2018 23:41:37 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=a4jAXSIt; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2388073AbeGWHju (ORCPT + 99 others); Mon, 23 Jul 2018 03:39:50 -0400 Received: from mail-pf1-f194.google.com ([209.85.210.194]:41059 "EHLO mail-pf1-f194.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2388004AbeGWHjt (ORCPT ); Mon, 23 Jul 2018 03:39:49 -0400 Received: by mail-pf1-f194.google.com with SMTP id y10-v6so645687pfn.8; Sun, 22 Jul 2018 23:40:10 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=from:to:cc:subject:date:message-id:in-reply-to:references :mime-version:content-transfer-encoding; bh=07tgC2NhRGJJFeudTQ2D6feQ+qtfTI+yBjXaPz1hor0=; b=a4jAXSItfPiRPjZlsUanDHzdWnGGBVc8UjCPRtjQdiYL3lZjaPS5WLaifHPGJ5OQNI 2qjOOqYSTXb8QOxDzOtg0ZMRP6LOGrFpbCCTouZmIv4Ser4u2poSb4rL3OZ9u9qTs3m+ ZZT+EpxkSCUZZkHB/dvtnecP6Kkx9LsI1VexNeMLUzsGfQxYdHnFo2yf6/vER0f58rr+ dkWJsticF0MoHEh5S6M/2ktv1EtHxBN73FvEryRpDXkmszCMMv1EFNUQporYXHt/Q68+ +bPOvWUu0ETJiGZ9+LvO+2h0K3fkXPLLHRTtZIYj6JmdP6z9plduBv26P8itKqfVoX7L 6N4Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:from:to:cc:subject:date:message-id:in-reply-to :references:mime-version:content-transfer-encoding; bh=07tgC2NhRGJJFeudTQ2D6feQ+qtfTI+yBjXaPz1hor0=; b=B0R4V7rclqLpS6g+loLY0IkWvFYIk5cWpTEziH0YyU3TxJVZJ1erHa/pxyPb0I3cCv p8sWpnCiMRQ0tGB1mMSKEVYm85fXgtwDlHqKf7Ec4NN+t/qDFtTw/5hhfKMM3nHxvZDI /fXJhhk6aUBE+p9XysDEhNzyWYsWTqV7fJHVEZ5ODNb6N9lBQBE24kvDJwHnlLG8U1NY x+PJYAnFlWBeZJa5eILboCfMq49i9GfXg3kkKJRY4X8w+FWJ515YvjNE7vaWaVA4T3CY EwS2XtDTG5+iJ7AE5aFi3v3fJ1a4LYsgzNFwdft0j+O9O43F8vEmDOcU8BBtMzj8tUxn sITg== X-Gm-Message-State: AOUpUlFImiPL/wey4xr6oEueB+veCSpY8wiOfIzA3FXOhmhxE/u6dW64 wXUPcHOk/9KHe/PGD5/tME8PRzrg X-Received: by 2002:a63:9856:: with SMTP id l22-v6mr11343671pgo.208.1532328009609; Sun, 22 Jul 2018 23:40:09 -0700 (PDT) Received: from localhost.localdomain ([203.205.141.123]) by smtp.googlemail.com with ESMTPSA id p17-v6sm8165598pgv.26.2018.07.22.23.40.07 (version=TLS1_2 cipher=ECDHE-RSA-AES128-SHA bits=128/128); Sun, 22 Jul 2018 23:40:09 -0700 (PDT) From: Wanpeng Li X-Google-Original-From: Wanpeng Li To: linux-kernel@vger.kernel.org, kvm@vger.kernel.org Cc: Paolo Bonzini , =?UTF-8?q?Radim=20Kr=C4=8Dm=C3=A1=C5=99?= , Vitaly Kuznetsov Subject: [PATCH v5 4/6] KVM: X86: Implement PV IPIs send hypercall Date: Mon, 23 Jul 2018 14:39:54 +0800 Message-Id: <1532327996-17619-5-git-send-email-wanpengli@tencent.com> X-Mailer: git-send-email 2.7.4 In-Reply-To: <1532327996-17619-1-git-send-email-wanpengli@tencent.com> References: <1532327996-17619-1-git-send-email-wanpengli@tencent.com> MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org From: Wanpeng Li Using hypercall to send IPIs by one vmexit instead of one by one for xAPIC/x2APIC physical mode and one vmexit per-cluster for x2APIC cluster mode. Intel guest can enter x2apic cluster mode when interrupt remmaping is enabled in qemu, however, latest AMD EPYC still just supports xapic mode which can get great improvement by Exit-less IPIs. This patchset lets a guest send multicast IPIs, with at most 128 destinations per hypercall in 64-bit mode and 64 vCPUs per hypercall in 32-bit mode. Hardware: Xeon Skylake 2.5GHz, 2 sockets, 40 cores, 80 threads, the VM is 80 vCPUs, IPI microbenchmark(https://lkml.org/lkml/2017/12/19/141): x2apic cluster mode, vanilla Dry-run: 0, 2392199 ns Self-IPI: 6907514, 15027589 ns Normal IPI: 223910476, 251301666 ns Broadcast IPI: 0, 9282161150 ns Broadcast lock: 0, 8812934104 ns x2apic cluster mode, pv-ipi Dry-run: 0, 2449341 ns Self-IPI: 6720360, 15028732 ns Normal IPI: 228643307, 255708477 ns Broadcast IPI: 0, 7572293590 ns => 22% performance boost Broadcast lock: 0, 8316124651 ns x2apic physical mode, vanilla Dry-run: 0, 3135933 ns Self-IPI: 8572670, 17901757 ns Normal IPI: 226444334, 255421709 ns Broadcast IPI: 0, 19845070887 ns Broadcast lock: 0, 19827383656 ns x2apic physical mode, pv-ipi Dry-run: 0, 2446381 ns Self-IPI: 6788217, 15021056 ns Normal IPI: 219454441, 249583458 ns Broadcast IPI: 0, 7806540019 ns => 154% performance boost Broadcast lock: 0, 9143618799 ns Cc: Paolo Bonzini Cc: Radim Krčmář Cc: Vitaly Kuznetsov Signed-off-by: Wanpeng Li --- Documentation/virtual/kvm/hypercalls.txt | 20 +++++++++++++++++ arch/x86/kvm/x86.c | 37 ++++++++++++++++++++++++++++++++ 2 files changed, 57 insertions(+) diff --git a/Documentation/virtual/kvm/hypercalls.txt b/Documentation/virtual/kvm/hypercalls.txt index a890529..9895123 100644 --- a/Documentation/virtual/kvm/hypercalls.txt +++ b/Documentation/virtual/kvm/hypercalls.txt @@ -121,3 +121,23 @@ compute the CLOCK_REALTIME for its clock, at the same instant. Returns KVM_EOPNOTSUPP if the host does not use TSC clocksource, or if clock type is different than KVM_CLOCK_PAIRING_WALLCLOCK. + +6. KVM_HC_SEND_IPI +------------------------ +Architecture: x86 +Status: active +Purpose: Hypercall used to send IPIs. + +a0: lower part of the bitmap of destination APIC IDs +a1: higher part of the bitmap of destination APIC IDs +a2: the lowest APIC ID in bitmap +a3: APIC ICR + +The hypercall lets a guest send multicast IPIs, with at most 128 +128 destinations per hypercall in 64-bit mode and 64 vCPUs per +hypercall in 32-bit mode. The destinations are represented by a +bitmap contained in the first two arguments (a0 and a1). Bit 0 of +a0 corresponds to the APIC ID in the third argument (a2), bit 1 +corresponds to the APIC ID a2+1, and so on. + +Returns 0 if successfully delivery the IPIs and 1 if discarded. diff --git a/arch/x86/kvm/x86.c b/arch/x86/kvm/x86.c index 2b812b3..a43a29f 100644 --- a/arch/x86/kvm/x86.c +++ b/arch/x86/kvm/x86.c @@ -6691,6 +6691,40 @@ static void kvm_pv_kick_cpu_op(struct kvm *kvm, unsigned long flags, int apicid) kvm_irq_delivery_to_apic(kvm, NULL, &lapic_irq, NULL); } +/* + * Return 0 if successfully added and 1 if discarded. + */ +static int kvm_pv_send_ipi(struct kvm *kvm, unsigned long ipi_bitmap_low, + unsigned long ipi_bitmap_high, int min, int vector, int op_64_bit) +{ + int i; + struct kvm_apic_map *map; + struct kvm_vcpu *vcpu; + struct kvm_lapic_irq irq = { + .delivery_mode = APIC_DM_FIXED, + .vector = vector, + }; + int cluster_size = op_64_bit ? 64 : 32; + + rcu_read_lock(); + map = rcu_dereference(kvm->arch.apic_map); + + for_each_set_bit(i, &ipi_bitmap_low, cluster_size) { + vcpu = map->phys_map[min + i]->vcpu; + if (!kvm_apic_set_irq(vcpu, &irq, NULL)) + return 1; + } + + for_each_set_bit(i, &ipi_bitmap_high, cluster_size) { + vcpu = map->phys_map[min + i + cluster_size]->vcpu; + if (!kvm_apic_set_irq(vcpu, &irq, NULL)) + return 1; + } + + rcu_read_unlock(); + return 0; +} + void kvm_vcpu_deactivate_apicv(struct kvm_vcpu *vcpu) { vcpu->arch.apicv_active = false; @@ -6739,6 +6773,9 @@ int kvm_emulate_hypercall(struct kvm_vcpu *vcpu) case KVM_HC_CLOCK_PAIRING: ret = kvm_pv_clock_pairing(vcpu, a0, a1); break; + case KVM_HC_SEND_IPI: + ret = kvm_pv_send_ipi(vcpu->kvm, a0, a1, a2, a3, op_64_bit); + break; #endif default: ret = -KVM_ENOSYS; -- 2.7.4