Received: by 2002:a6b:500f:0:0:0:0:0 with SMTP id e15csp180948iob; Mon, 2 May 2022 16:27:58 -0700 (PDT) X-Google-Smtp-Source: ABdhPJz6p2DzU4tiD2itnZBdk/MKhMEekcjW4RttI8Ft+kGOWV3lFMAJXgf6cbKfzUzgDipITSPv X-Received: by 2002:a65:6e8b:0:b0:3ab:a3fb:e95a with SMTP id bm11-20020a656e8b000000b003aba3fbe95amr11557557pgb.433.1651534077979; Mon, 02 May 2022 16:27:57 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1651534077; cv=none; d=google.com; s=arc-20160816; b=rXHfo0WIGkSNBACrSwIrEKDY7PPvh0UXfucPCexkUqlhyC8nupg9vOw2SE5IA7OjzK wSM2gri0bbXry/VFLru+pLlPCsuhVzTb3aww41uvhz2s12IjaCkv2TlUeQJvoMlQqMvv 39gS3NOWn3JXhPi0vrj6lqJkLOIZ9Yoai/JhNuu9gnmMjM0GdY+MpCEYXbK5wEc2s+Di uNQJtoo57ibIwtBLiepMpXMir/Cdx9vJAoYf5s701sJCO9lBAGVMn4a/ao3OqZflIKXO VxiLCATLg1vHCrO6xs1iUwhS3Et1cajcL5sepbWWGlW7OB2J822HaLLklXBc/ppJAU63 7z9Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:content-language :in-reply-to:mime-version:user-agent:date:message-id:from:references :cc:to:subject:reply-to:dkim-signature; bh=Dj6lOIpl1/fVHNjIgwNOiF+RPKGdSdTc7hK+FjT4aNs=; b=r56waW/Nx8FcccEy541soGWWiND0s2E7IoZg421NEPfKqYE8gjxlB8bKPWjG2Kicya 3Xc6BsZWGVYPj1G0+liwIC8fq5P1N5cx0jQ1S2rsom4JvGTdjx9iMwl5Tq32PEFOJXXd OOOA0yqlHxS4QPQzQ639pvMB8ZuP1GmeGsTsocnq/HPdB8sSEkaoCzRJcjKXALYTuzRA Xmz7hyPwZjo/sFbjQmRc+NNTV99T0wDowtNGWVpAW7Qeoza5k69uXIWFbYWaZvn2KrAA zoxw3IbNOf6kvv8GOyyyMtobNcB5tpXKCN/BolNw3bnIS7u9fO6Udbq2HgAjPCLgCeKs AmKA== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=fn19C6iT; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [23.128.96.19]) by mx.google.com with ESMTPS id gi18-20020a17090b111200b001d950ab2cfdsi560010pjb.75.2022.05.02.16.27.57 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 02 May 2022 16:27:57 -0700 (PDT) Received-SPF: softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) client-ip=23.128.96.19; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=fn19C6iT; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id E49ECB7E7; Mon, 2 May 2022 16:27:46 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1356982AbiEBCjB (ORCPT + 99 others); Sun, 1 May 2022 22:39:01 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:36942 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229931AbiEBCi7 (ORCPT ); Sun, 1 May 2022 22:38:59 -0400 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id BFF9B1A82A for ; Sun, 1 May 2022 19:35:30 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1651458929; h=from:from:reply-to:reply-to:subject:subject:date:date: message-id:message-id:to:to:cc:cc:mime-version:mime-version: content-type:content-type: content-transfer-encoding:content-transfer-encoding: in-reply-to:in-reply-to:references:references; bh=Dj6lOIpl1/fVHNjIgwNOiF+RPKGdSdTc7hK+FjT4aNs=; b=fn19C6iTCRbQhcnzp43RtXruscp3gvO92YXD+kH/xaOCG7Qg7bt6nNeBo7uiizounhDNFV gBzP92T9iNFFwriCyXaeUno1fGGC4iZxFgztj4nenGwyXkIhvcGbth64s+OwxMuMBQQ4Am 7TnzLc9eXQHMLQlleyxEJ79yH3Jgngs= Received: from mimecast-mx02.redhat.com (mimecast-mx02.redhat.com [66.187.233.88]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id us-mta-350-ilWeI8ciPjGVNZKyEVIsgw-1; Sun, 01 May 2022 22:35:18 -0400 X-MC-Unique: ilWeI8ciPjGVNZKyEVIsgw-1 Received: from smtp.corp.redhat.com (int-mx08.intmail.prod.int.rdu2.redhat.com [10.11.54.8]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mimecast-mx02.redhat.com (Postfix) with ESMTPS id 1192A1814501; Mon, 2 May 2022 02:35:18 +0000 (UTC) Received: from [10.72.12.86] (ovpn-12-86.pek2.redhat.com [10.72.12.86]) by smtp.corp.redhat.com (Postfix) with ESMTPS id 5DF15C27E97; Mon, 2 May 2022 02:35:10 +0000 (UTC) Reply-To: Gavin Shan Subject: Re: [PATCH v6 03/18] KVM: arm64: Add SDEI virtualization infrastructure To: Oliver Upton Cc: kvmarm@lists.cs.columbia.edu, linux-kernel@vger.kernel.org, eauger@redhat.com, Jonathan.Cameron@huawei.com, vkuznets@redhat.com, will@kernel.org, shannon.zhaosl@gmail.com, james.morse@arm.com, mark.rutland@arm.com, maz@kernel.org, pbonzini@redhat.com, shan.gavin@gmail.com References: <20220403153911.12332-1-gshan@redhat.com> <20220403153911.12332-4-gshan@redhat.com> <36899ea9-e8bd-27b2-8dfb-75b76eab50d7@redhat.com> <0e26da1a-00bb-3d63-a8bf-6cd3271b0a38@redhat.com> <96711526-c4f3-3b50-c015-beba8cc9fcc9@redhat.com> From: Gavin Shan Message-ID: <62f06a03-d6fc-3803-a2d2-7a85cf733459@redhat.com> Date: Mon, 2 May 2022 10:35:08 +0800 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.2.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit X-Scanned-By: MIMEDefang 2.85 on 10.11.54.8 X-Spam-Status: No, score=-4.2 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,NICE_REPLY_A,RDNS_NONE,SPF_HELO_NONE, T_SCC_BODY_TEXT_LINE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Oliver, On 4/30/22 10:16 PM, Oliver Upton wrote: > On Sat, Apr 30, 2022 at 07:38:29PM +0800, Gavin Shan wrote: >> Thank you for the comments and details. It should work by using bitmaps >> to represent event's states. I will adopt your proposed structs in next >> respin. However, there are more states needed. So I would adjust >> "struct kvm_sdei_vcpu" like below in next respin. >> >> struct kvm_sdei_vcpu { >> unsigned long registered; /* the event is registered or not */ >> unsigned long enabled; /* the event is enabled or not */ >> unsigned long unregistering; /* the event is pending for unregistration */ > > I'm not following why we need to keep track of the 'pending unregister' > state directly. Is it not possible to infer from (active && !registered)? > The event can be unregistered and reseted through hypercalls when it's being handled. In this case, the unregistration for the event can't be done immediately and has to be delayed until the handling is finished. The unregistration pending state is used in this case. Yes, it's correct we also can use (active & !registered) to represent the state. >> unsigned long pending; /* the event is pending for delivery and handling */ >> unsigned long active; /* the event is currently being handled */ >> >> : >> >> }; >> >> I rename @pending to @unregister. Besides, there are two states added: >> >> @pending: Indicate there has one event has been injected. The next step >> for the event is to deliver it for handling. For one particular >> event, we allow one pending event in the maximum. > > Right, if an event retriggers when it is pending we still dispatch a > single event to the guest. And since we're only doing normal priority > events, it is entirely implementation defined which gets dispatched > first. > Yep, we will simply rely on find_first_bit() for the priority. It means the software signaled event, whose number is zero, will have the highest priority. >> @active: Indicate the event is currently being handled. The information >> stored in 'struct kvm_sdei_event_context' instance can be >> correlated with the event. > > Does this need to be a bitmap though? We can't ever have more than one > SDEI event active at a time since this is private to a vCPU. > Yes, one event is active at most on one particular vCPU. So tt don't have to be a bitmap necessarily. The reason I proposed to use bitmap for this state is to having all (event) states represented by bitmaps. In this way, all states are managed in a unified fashion. The alternative way is to have "unsigned long active_event", which traces the active event number. It also consumes 8-bytes when live migration is concerned. So I prefer a bitmap :) >> Furthermore, it's fair enough to put the (vcpu) mask state into 'flags' >> field of struct kvm_vcpu_arch :) > > I think you can get away with putting active in there too, I don't see > why we need more than a single bit for this info. > Not really. We just need one single bit for vCPU's mask state. We need multiple bits for event's active state, depending on how many events are supported. We need to know which event is currently active at least. For now, there are only two supported events (0/1), but one single bit is still not enough because there are 3 states: (1) software signaled event is active. (2) async pf event is active. (3) none of them is active. Lets use a bitmap for the event active state as I said above, if you don't strongly object :) >>>>>>> Do we need this if we disallow nesting events? >>>>>>> >>>>>> >>>>>> Yes, we need this. "event == NULL" is used as indication of invalid >>>>>> context. @event is the associated SDEI event when the context is >>>>>> valid. >>>>> >>>>> What if we use some other plumbing to indicate the state of the vCPU? MP >>>>> state comes to mind, for example. >>>>> >>>> >>>> Even the indication is done by another state, kvm_sdei_vcpu_context still >>>> need to be linked (associated) with the event. After the vCPU context becomes >>>> valid after the event is delivered, we still need to know the associated >>>> event when some of hypercalls are triggered. SDEI_1_0_FN_SDEI_EVENT_COMPLETE >>>> is one of the examples, we need to decrease struct kvm_sdei_event::event_count >>>> for the hypercall. >>> >>> Why do we need to keep track of how many times an event has been >>> signaled? Nothing in SDEI seems to suggest that the number of event >>> signals corresponds to the number of times the handler is invoked. In >>> fact, the documentation on SDEI_EVENT_SIGNAL corroborates this: >>> >>> """ >>> The event has edgetriggered semantics and the number of event signals >>> may not correspond to the number of times the handler is invoked in the >>> target PE. >>> """ >>> >>> DEN0054C 5.1.16.1 >>> >>> So perhaps we queue at most 1 pending event for the guest. >>> >>> I'd also like to see if anyone else has thoughts on the topic, as I'd >>> hate for you to go back to the whiteboard again in the next spin. >>> >> >> Agreed. In next respin, we will have one pending event at most. Error >> can be returned if user attempts to inject event whose pending state >> (struct kvm_sdei_vcpu::pending) has been set. > > I don't believe we can do that. The SDEI_EVENT_SIGNAL call should succeed, > even if the event was already pending. > I rethinking it a bit. Yes, you're correct. In this specific case, the event handler is running for multiple events. >> Indeed, the hardest part is to determine the data structures and >> functions we need. Oliver, your valuable comments are helping to >> bring this series to the right track. However, I do think it's >> helpful if somebody else can confirm the outcomes from the previous >> discussions. I'm not sure if Marc has time for a quick scan and provide >> comments. >> >> I would summarize the outcomes from our discussions, to help Marc >> or others to confirm: > > Going to take a look at some of your later patches as well, just a heads > up. > Yep, thanks again for your valuable comments :) >> - Drop support for the shared event. >> - Dropsupport for the critical event. >> - The events in the implementations are all private and can be signaled >> (raised) by software. >> - Drop migration support for now, and we will consider it using >> pseudo firmware registers. So add-on patches are expected to support >> the migration in future. > > Migration will be supported in a future spin of this series, not a > subsequent one right? :) I had just made the suggestion because there was > a lot of renovations that we were discussing. > I prefer a separate series to support migration after this series gets merged. There are couple of reasons to do so: (1) The migration depends on Raghavendra's series to support for hypercall services selection. The series is close to be merged, but not happen yet. The SDEI is one of the hypercall services at least. SDEI's pseudo firmware registers for migration will be managed by the infrastructure. (2) I would focus on the core functinality for now. In this way, we give migration space. For example, the data structures needs sorts of adjustments for migration, just in case. >> - Drop locking mechanism. All the functions are executed in vcpu context. > > Well, not entirely. Just need to make sure atomics are used to post > events to another vCPU in the case of SDEI_EVENT_SIGNAL. > > set_bit() fits the bill here, as we've discussed. > Yes, I meant to remove struct kvm_sdei_vcpu::lock by dropping the locking mechanism :) >> - To use the data struct as you suggested. Besides, the vcpu's mask >> state is put to struct kvm_arch_vcpu::flags. >> enum kvm_sdei_event >> struct kvm_sdei_event_handler >> struct kvm_sdei_event_context >> struct kvm_sdei_vcpu >> Thanks, Gavin