Received: by 2002:a25:8b12:0:0:0:0:0 with SMTP id i18csp1300585ybl; Thu, 22 Aug 2019 12:19:31 -0700 (PDT) X-Google-Smtp-Source: APXvYqz3FDmvUf7/CKWp35PWySxDrIQRiNdlAg7neDjFUgvExI1vkW5KQjLEN6odGnpbCerQ/MBi X-Received: by 2002:a17:902:9f8e:: with SMTP id g14mr415220plq.67.1566501571213; Thu, 22 Aug 2019 12:19:31 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1566501571; cv=none; d=google.com; s=arc-20160816; b=SvC+jcwiV/VOFf2QEkXG364FAbqylUg3p5hh1sH9pNdqOPrbuwsz74UKVz4UacbGDM yExpHu/B0MQIdgZuHWFIcyFZ3nC8oUqk1QsPcmhIUFGS4/oM/UbZ03FscunmGNyxk5LS 5Q5E7Uph/GQx1EO2g24VyElNux/KFaxaimivMEfBAifyO1hZ5L8KYIZGP8uGb89cLniu b/V5/wvatOkMTMzH9tHON3VHJN4LUPFAVAbqqfVX75lZovkHWV1sL/KMu51zUJ7M2jV0 Ht9qhsW5641EVOA2s/Rc5Macp0pNBh8PS/Mk7tQCUPYODp9TqYxNDKgMEcRYzqsJOQxA ssJg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding :content-language:in-reply-to:mime-version:user-agent:date :message-id:from:references:cc:to:subject:dkim-signature; bh=8ioN/JPy48toyNhc9jaRO5V8hI5LIKQUVL1VCL1aBl8=; b=PA9lTc3/SQxrozKOPuxkp8o2ziEMLsuJ/w+cn26X2A9U/gqpeTuoTtWT3GmnFiWYte zCH3m1Wco8+1ClgcS1yUB9Wp7aahJ18rGBBOl/EWF80BBlzrVszr7etW0/gAaO8sELdT x+dbXQBnqPzG/4FVTDZVxCrPiGT/qPWJyjw7uxtba+W2Z0z7bpDjxYR6cJoHNlWbaMwN tuK+0Cd0LRyELtnHLLCPuKHH26ipaxXruPOYt6AXoW3Fl3E9M0P3L//MWvhE0ZL154Bi G1lgIDEcMb74viYe4kmEQdAUISszE7U3jd1OecCqU91aMn6JBZpXeAHJPNOcssUunj9E Qumg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@amazon.com header.s=amazon201209 header.b=AqnGzCDU; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=amazon.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id f17si135290pgh.552.2019.08.22.12.19.15; Thu, 22 Aug 2019 12:19:31 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@amazon.com header.s=amazon201209 header.b=AqnGzCDU; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=amazon.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2387492AbfHVOMk (ORCPT + 99 others); Thu, 22 Aug 2019 10:12:40 -0400 Received: from smtp-fw-9102.amazon.com ([207.171.184.29]:15285 "EHLO smtp-fw-9102.amazon.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1732331AbfHVOMk (ORCPT ); Thu, 22 Aug 2019 10:12:40 -0400 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=amazon.com; i=@amazon.com; q=dns/txt; s=amazon201209; t=1566483158; x=1598019158; h=subject:to:cc:references:from:message-id:date: mime-version:in-reply-to:content-transfer-encoding; bh=8ioN/JPy48toyNhc9jaRO5V8hI5LIKQUVL1VCL1aBl8=; b=AqnGzCDU3WVeuBlI8CdISQJO34yeO5jl/LGO6hkfQRvrtTBQ2AMzmjQ9 MdqtTin1BVN0R4WCI/QSARnhUctzkgtHmVxFp7LRj+cyD7l2zTo1XolmX HSOnjqkMUufuVV5nEAEcZfE87SNfokjFAENGrz/R61g7SrgHtnh+gHbQ5 g=; X-IronPort-AV: E=Sophos;i="5.64,416,1559520000"; d="scan'208";a="696479885" Received: from sea3-co-svc-lb6-vlan3.sea.amazon.com (HELO email-inbound-relay-1e-97fdccfd.us-east-1.amazon.com) ([10.47.22.38]) by smtp-border-fw-out-9102.sea19.amazon.com with ESMTP; 22 Aug 2019 14:12:33 +0000 Received: from EX13MTAUWC001.ant.amazon.com (iad55-ws-svc-p15-lb9-vlan3.iad.amazon.com [10.40.159.166]) by email-inbound-relay-1e-97fdccfd.us-east-1.amazon.com (Postfix) with ESMTPS id A7360A26E4; Thu, 22 Aug 2019 14:12:28 +0000 (UTC) Received: from EX13D20UWC001.ant.amazon.com (10.43.162.244) by EX13MTAUWC001.ant.amazon.com (10.43.162.135) with Microsoft SMTP Server (TLS) id 15.0.1367.3; Thu, 22 Aug 2019 14:12:27 +0000 Received: from 38f9d3867b82.ant.amazon.com (10.43.161.67) by EX13D20UWC001.ant.amazon.com (10.43.162.244) with Microsoft SMTP Server (TLS) id 15.0.1367.3; Thu, 22 Aug 2019 14:12:24 +0000 Subject: Re: [PATCH v5 08/20] RISC-V: KVM: Implement KVM_GET_ONE_REG/KVM_SET_ONE_REG ioctls To: Anup Patel CC: Anup Patel , Palmer Dabbelt , "Paul Walmsley" , Paolo Bonzini , Radim K , Daniel Lezcano , Thomas Gleixner , Atish Patra , Alistair Francis , Damien Le Moal , Christoph Hellwig , "kvm@vger.kernel.org" , "linux-riscv@lists.infradead.org" , "linux-kernel@vger.kernel.org" References: <20190822084131.114764-1-anup.patel@wdc.com> <20190822084131.114764-9-anup.patel@wdc.com> From: Alexander Graf Message-ID: <2871ee6a-ae7c-6937-e8ef-38a8c318638a@amazon.com> Date: Thu, 22 Aug 2019 16:12:21 +0200 User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10.14; rv:60.0) Gecko/20100101 Thunderbird/60.8.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset="utf-8"; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit X-Originating-IP: [10.43.161.67] X-ClientProxiedBy: EX13D24UWB002.ant.amazon.com (10.43.161.159) To EX13D20UWC001.ant.amazon.com (10.43.162.244) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 22.08.19 16:00, Anup Patel wrote: > On Thu, Aug 22, 2019 at 5:31 PM Alexander Graf wrote: >> >> On 22.08.19 10:44, Anup Patel wrote: >>> For KVM RISC-V, we use KVM_GET_ONE_REG/KVM_SET_ONE_REG ioctls to access >>> VCPU config and registers from user-space. >>> >>> We have three types of VCPU registers: >>> 1. CONFIG - these are VCPU config and capabilities >>> 2. CORE - these are VCPU general purpose registers >>> 3. CSR - these are VCPU control and status registers >>> >>> The CONFIG registers available to user-space are ISA and TIMEBASE. Out >>> of these, TIMEBASE is a read-only register which inform user-space about >>> VCPU timer base frequency. The ISA register is a read and write register >>> where user-space can only write the desired VCPU ISA capabilities before >>> running the VCPU. >>> >>> The CORE registers available to user-space are PC, RA, SP, GP, TP, A0-A7, >>> T0-T6, S0-S11 and MODE. Most of these are RISC-V general registers except >>> PC and MODE. The PC register represents program counter whereas the MODE >>> register represent VCPU privilege mode (i.e. S/U-mode). >>> >>> The CSRs available to user-space are SSTATUS, SIE, STVEC, SSCRATCH, SEPC, >>> SCAUSE, STVAL, SIP, and SATP. All of these are read/write registers. >>> >>> In future, more VCPU register types will be added (such as FP) for the >>> KVM_GET_ONE_REG/KVM_SET_ONE_REG ioctls. >>> >>> Signed-off-by: Anup Patel >>> Acked-by: Paolo Bonzini >>> Reviewed-by: Paolo Bonzini >>> --- >>> arch/riscv/include/uapi/asm/kvm.h | 40 ++++- >>> arch/riscv/kvm/vcpu.c | 235 +++++++++++++++++++++++++++++- >>> 2 files changed, 272 insertions(+), 3 deletions(-) >>> >>> diff --git a/arch/riscv/include/uapi/asm/kvm.h b/arch/riscv/include/uapi/asm/kvm.h >>> index 6dbc056d58ba..024f220eb17e 100644 >>> --- a/arch/riscv/include/uapi/asm/kvm.h >>> +++ b/arch/riscv/include/uapi/asm/kvm.h >>> @@ -23,8 +23,15 @@ >>> >>> /* for KVM_GET_REGS and KVM_SET_REGS */ >>> struct kvm_regs { >>> + /* out (KVM_GET_REGS) / in (KVM_SET_REGS) */ >>> + struct user_regs_struct regs; >>> + unsigned long mode; >> >> Is there any particular reason you're reusing kvm_regs and don't invent >> your own struct? kvm_regs is explicitly meant for the get_regs and >> set_regs ioctls. > > We are implementing only ONE_REG interface so most of these > structs are unused hence we tried to reuse these struct instead > of introducing new structs. (Similar to KVM ARM64) > >> >>> }; >>> >>> +/* Possible privilege modes for kvm_regs */ >>> +#define KVM_RISCV_MODE_S 1 >>> +#define KVM_RISCV_MODE_U 0 >>> + >>> /* for KVM_GET_FPU and KVM_SET_FPU */ >>> struct kvm_fpu { >>> }; >>> @@ -41,10 +48,41 @@ struct kvm_guest_debug_arch { >>> struct kvm_sync_regs { >>> }; >>> >>> -/* dummy definition */ >>> +/* for KVM_GET_SREGS and KVM_SET_SREGS */ >>> struct kvm_sregs { >>> + unsigned long sstatus; >>> + unsigned long sie; >>> + unsigned long stvec; >>> + unsigned long sscratch; >>> + unsigned long sepc; >>> + unsigned long scause; >>> + unsigned long stval; >>> + unsigned long sip; >>> + unsigned long satp; >> >> Same comment here. > > Same as above, we are trying to use unused struct. > >> >>> }; >>> >>> +#define KVM_REG_SIZE(id) \ >>> + (1U << (((id) & KVM_REG_SIZE_MASK) >> KVM_REG_SIZE_SHIFT)) >>> + >>> +/* If you need to interpret the index values, here is the key: */ >>> +#define KVM_REG_RISCV_TYPE_MASK 0x00000000FF000000 >>> +#define KVM_REG_RISCV_TYPE_SHIFT 24 >>> + >>> +/* Config registers are mapped as type 1 */ >>> +#define KVM_REG_RISCV_CONFIG (0x01 << KVM_REG_RISCV_TYPE_SHIFT) >>> +#define KVM_REG_RISCV_CONFIG_ISA 0x0 >>> +#define KVM_REG_RISCV_CONFIG_TIMEBASE 0x1 >>> + >>> +/* Core registers are mapped as type 2 */ >>> +#define KVM_REG_RISCV_CORE (0x02 << KVM_REG_RISCV_TYPE_SHIFT) >>> +#define KVM_REG_RISCV_CORE_REG(name) \ >>> + (offsetof(struct kvm_regs, name) / sizeof(unsigned long)) >> >> I see, you're trying to implicitly use the struct offsets as index. >> >> I'm not a really big fan of it, but I can't pinpoint exactly why just >> yet. It just seems too magical (read: potentially breaking down the >> road) for me. >> >>> + >>> +/* Control and status registers are mapped as type 3 */ >>> +#define KVM_REG_RISCV_CSR (0x03 << KVM_REG_RISCV_TYPE_SHIFT) >>> +#define KVM_REG_RISCV_CSR_REG(name) \ >>> + (offsetof(struct kvm_sregs, name) / sizeof(unsigned long)) >>> + >>> #endif >>> >>> #endif /* __LINUX_KVM_RISCV_H */ >>> diff --git a/arch/riscv/kvm/vcpu.c b/arch/riscv/kvm/vcpu.c >>> index 7f59e85c6af8..9396a83c0611 100644 >>> --- a/arch/riscv/kvm/vcpu.c >>> +++ b/arch/riscv/kvm/vcpu.c >>> @@ -164,6 +164,215 @@ vm_fault_t kvm_arch_vcpu_fault(struct kvm_vcpu *vcpu, struct vm_fault *vmf) >>> return VM_FAULT_SIGBUS; >>> } >>> >>> +static int kvm_riscv_vcpu_get_reg_config(struct kvm_vcpu *vcpu, >>> + const struct kvm_one_reg *reg) >>> +{ >>> + unsigned long __user *uaddr = >>> + (unsigned long __user *)(unsigned long)reg->addr; >>> + unsigned long reg_num = reg->id & ~(KVM_REG_ARCH_MASK | >>> + KVM_REG_SIZE_MASK | >>> + KVM_REG_RISCV_CONFIG); >>> + unsigned long reg_val; >>> + >>> + if (KVM_REG_SIZE(reg->id) != sizeof(unsigned long)) >>> + return -EINVAL; >>> + >>> + switch (reg_num) { >>> + case KVM_REG_RISCV_CONFIG_ISA: >>> + reg_val = vcpu->arch.isa; >>> + break; >>> + case KVM_REG_RISCV_CONFIG_TIMEBASE: >>> + reg_val = riscv_timebase; >> >> What does this reflect? The current guest time hopefully not? An offset? >> Related to what? > > riscv_timebase is the frequency in HZ of the system timer. > > The name "timebase" is not appropriate but we have been > carrying it since quite some time now. What do you mean by "some time"? So far I only see a kernel internal variable named after it. That's dramatically different from something exposed via uapi. Just name it tbfreq. So if this is the frequency, where is the offset? You will need it on save/restore. If you're saying that's out of scope for now, that's fine with me too :). Alex