Received: by 2002:a05:6a10:17d3:0:0:0:0 with SMTP id hz19csp3112284pxb; Tue, 20 Apr 2021 00:24:14 -0700 (PDT) X-Google-Smtp-Source: ABdhPJwiDe+ZlERb3TjGkYx5UhOLsbO5nCPZg4pOzLw5mlY/8nsquK4rNj629e8sStKph7SNW2IC X-Received: by 2002:a05:6402:c7:: with SMTP id i7mr30657475edu.33.1618903454419; Tue, 20 Apr 2021 00:24:14 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1618903454; cv=none; d=google.com; s=arc-20160816; b=cjICMwdTf+rNFJtsDglKm5kUTx/TiGND48jRKeQE7U1HAdlA2nJ602am3JN6XX3klE 2I30+VlinPVtwUT5wT80qcxQ4BX7dOc1k+SZ+81jLwBWZfuO3NO4FS3Ufq8C0pZSjcF1 nisAyDDZkdc7G04XiAM+0I9E46gQQlbCKT3FVl/5XZRdyuQFx4t1rD6FG6laVnQfGcGS jjrmQ6tV3G3EAqedjmS3KTZtnO/2GZaFN23CNt9Jtw8OSAJWJVKc3AeNUgzzG/aNyG4r tT8G860VrRVf0tjElMQ3BCfU8CcjrZQcZkPRI7PwEri1cDWpHz3RA5jwPUbrhIReo5iY aUIw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject:dkim-signature; bh=iOjPfwXb4OBqjnrtphdyCIjJaLUOZwAgc5HsS5Qcaz4=; b=d6RnBTwkRIn/i4dSe5lzLxJOAhUYh4S1iOJB6fHBFK/gh8HTEL65bG98TH9Pp9WNpi pU1JEs34ABOLxAZOqHQRpglE61lnqRAhJbOpEx+BfEx5u2ikf1RdwSpTfynsF7TG7fr4 CkJQ6rM7Bh8czXPXlFDwY29ukNw12M6UFNuMe4Bm6CEzEP2vXgtwPiZOqWxG3LMFdfvz A9b4YuSLnIrfmhBEyIT15BYiPWqQIFy6P1vwO653tjOQSFD5VqNmIqZ0CbY0NVE1PPGL cxO5oz8ek8VxFEA/Rufh/Sat8otuJkiUX6An3uXNSXSstE68HBHN1ob7x/etZxLL5EpB KmEQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b="RXPO/zWm"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id a8si6733882eds.138.2021.04.20.00.23.50; Tue, 20 Apr 2021 00:24:14 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b="RXPO/zWm"; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229559AbhDTHX2 (ORCPT + 99 others); Tue, 20 Apr 2021 03:23:28 -0400 Received: from us-smtp-delivery-124.mimecast.com ([170.10.133.124]:43246 "EHLO us-smtp-delivery-124.mimecast.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229471AbhDTHX2 (ORCPT ); Tue, 20 Apr 2021 03:23:28 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1618903376; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=iOjPfwXb4OBqjnrtphdyCIjJaLUOZwAgc5HsS5Qcaz4=; b=RXPO/zWmQvcXEL1Aob8gMY9ohcoZXIM6JTQn+tDka+axltg93wf3Hz5sg5Wqtf8uTXiO/i GSCiTQSmgpCrIuEnBRRR+wK3b9Ujia77jY4T4fWX+Mw5RHwPt9WhE+Gk9u1ZWLi8KSBfg2 vggzYdksWeJzDtS05Yz8B+VspryW8+U= Received: from mail-ej1-f71.google.com (mail-ej1-f71.google.com [209.85.218.71]) (Using TLS) by relay.mimecast.com with ESMTP id us-mta-444-SGZ9kiXQMryl9xOEIJBbOA-1; Tue, 20 Apr 2021 03:22:55 -0400 X-MC-Unique: SGZ9kiXQMryl9xOEIJBbOA-1 Received: by mail-ej1-f71.google.com with SMTP id r17-20020a1709069591b029037cf6a4a56dso4454127ejx.12 for ; Tue, 20 Apr 2021 00:22:55 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:subject:to:cc:references:from:message-id:date :user-agent:mime-version:in-reply-to:content-language :content-transfer-encoding; bh=iOjPfwXb4OBqjnrtphdyCIjJaLUOZwAgc5HsS5Qcaz4=; b=V1lcH6cphCihXhpiuKH4ikmxAwSghbaMKesmm+HyDsLCDQQKCsZ8U1aIXXRcUQwdmg VGyqbU5Mz7q+i8yL0PoKnzS+BO1wqt295R7Dclj9PVs8FUcCaS2RQaYo23wuHRmPMQcy nuQMIU2BXd1f4ZkncLv9Vmbd1PjmnqxtDe1gN8yAhrRC7KK8d65Xd8/4gQOhABluHkoh DvxZvJYTp8jnMNZ4J9pXeD/ZWyci8tq+NFrvDlLCU3TUoCdASxEgK66JrWFvpS4HK4mT 3ytmisYWbFLio5tl8wx5W93pb94IfQ3SeZzV0uzNrzI+NjPen6KWphhxZx8mlMAHSRjW poLA== X-Gm-Message-State: AOAM533N1oKSWsQxD8MyFctymI0rsgkmNNwdIYDG7qaxj+KafJo0FC+f Zc24/vb0aos9MdAkDHqfNDOtIIM/L5k4EoEIJGOQQpX9lX1E9rng36yhWeNdF3dOmqGdDCUM66Y 4fhAJWTM065YXsC5ravbOCyNc X-Received: by 2002:a17:906:c1c5:: with SMTP id bw5mr4745272ejb.510.1618903374226; Tue, 20 Apr 2021 00:22:54 -0700 (PDT) X-Received: by 2002:a17:906:c1c5:: with SMTP id bw5mr4745246ejb.510.1618903373975; Tue, 20 Apr 2021 00:22:53 -0700 (PDT) Received: from ?IPv6:2001:b07:6468:f312:c8dd:75d4:99ab:290a? ([2001:b07:6468:f312:c8dd:75d4:99ab:290a]) by smtp.gmail.com with ESMTPSA id p4sm14795453edr.43.2021.04.20.00.22.52 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Tue, 20 Apr 2021 00:22:53 -0700 (PDT) Subject: Re: [PATCH] KVM: Boost vCPU candidiate in user mode which is delivering interrupt To: Wanpeng Li Cc: Sean Christopherson , LKML , kvm , Vitaly Kuznetsov , Wanpeng Li , Jim Mattson , Joerg Roedel References: <1618542490-14756-1-git-send-email-wanpengli@tencent.com> <9c49c6ff-d896-e6a5-c051-b6707f6ec58a@redhat.com> From: Paolo Bonzini Message-ID: Date: Tue, 20 Apr 2021 09:22:51 +0200 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:78.0) Gecko/20100101 Thunderbird/78.7.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 20/04/21 08:08, Wanpeng Li wrote: > On Tue, 20 Apr 2021 at 14:02, Wanpeng Li wrote: >> >> On Tue, 20 Apr 2021 at 00:59, Paolo Bonzini wrote: >>> >>> On 19/04/21 18:32, Sean Christopherson wrote: >>>> If false positives are a big concern, what about adding another pass to the loop >>>> and only yielding to usermode vCPUs with interrupts in the second full pass? >>>> I.e. give vCPUs that are already in kernel mode priority, and only yield to >>>> handle an interrupt if there are no vCPUs in kernel mode. >>>> >>>> kvm_arch_dy_runnable() pulls in pv_unhalted, which seems like a good thing. >>> >>> pv_unhalted won't help if you're waiting for a kernel spinlock though, >>> would it? Doing two passes (or looking for a "best" candidate that >>> prefers kernel mode vCPUs to user mode vCPUs waiting for an interrupt) >>> seems like the best choice overall. >> >> How about something like this: I was thinking of something simpler: diff --git a/virt/kvm/kvm_main.c b/virt/kvm/kvm_main.c index 9b8e30dd5b9b..455c648f9adc 100644 --- a/virt/kvm/kvm_main.c +++ b/virt/kvm/kvm_main.c @@ -3198,10 +3198,9 @@ void kvm_vcpu_on_spin(struct kvm_vcpu *me, bool yield_to_kernel_mode) { struct kvm *kvm = me->kvm; struct kvm_vcpu *vcpu; - int last_boosted_vcpu = me->kvm->last_boosted_vcpu; int yielded = 0; int try = 3; - int pass; + int pass, num_passes = 1; int i; kvm_vcpu_set_in_spin_loop(me, true); @@ -3212,13 +3211,14 @@ void kvm_vcpu_on_spin(struct kvm_vcpu *me, bool yield_to_kernel_mode) * VCPU is holding the lock that we need and will release it. * We approximate round-robin by starting at the last boosted VCPU. */ - for (pass = 0; pass < 2 && !yielded && try; pass++) { - kvm_for_each_vcpu(i, vcpu, kvm) { - if (!pass && i <= last_boosted_vcpu) { - i = last_boosted_vcpu; - continue; - } else if (pass && i > last_boosted_vcpu) - break; + for (pass = 0; pass < num_passes; pass++) { + int idx = me->kvm->last_boosted_vcpu; + int n = atomic_read(&kvm->online_vcpus); + for (i = 0; i < n; i++, idx++) { + if (idx == n) + idx = 0; + + vcpu = kvm_get_vcpu(kvm, idx); if (!READ_ONCE(vcpu->ready)) continue; if (vcpu == me) @@ -3226,23 +3226,36 @@ void kvm_vcpu_on_spin(struct kvm_vcpu *me, bool yield_to_kernel_mode) if (rcuwait_active(&vcpu->wait) && !vcpu_dy_runnable(vcpu)) continue; - if (READ_ONCE(vcpu->preempted) && yield_to_kernel_mode && - !kvm_arch_vcpu_in_kernel(vcpu)) - continue; if (!kvm_vcpu_eligible_for_directed_yield(vcpu)) continue; + if (READ_ONCE(vcpu->preempted) && yield_to_kernel_mode && + !kvm_arch_vcpu_in_kernel(vcpu)) { + /* + * A vCPU running in userspace can get to kernel mode via + * an interrupt. That's a worse choice than a CPU already + * in kernel mode so only do it on a second pass. + */ + if (!vcpu_dy_runnable(vcpu)) + continue; + if (pass == 0) { + num_passes = 2; + continue; + } + } + yielded = kvm_vcpu_yield_to(vcpu); if (yielded > 0) { kvm->last_boosted_vcpu = i; - break; + goto done; } else if (yielded < 0) { try--; if (!try) - break; + goto done; } } } +done: kvm_vcpu_set_in_spin_loop(me, false); /* Ensure vcpu is not eligible during next spinloop */ Paolo