Received: by 2002:ac0:a5a7:0:0:0:0:0 with SMTP id m36-v6csp1924684imm; Tue, 10 Jul 2018 10:04:38 -0700 (PDT) X-Google-Smtp-Source: AAOMgpd7q3/CyeKgAcH0ihZVhzA7hvDP8wJlXpr85b0ybmqnFrBZmcHuSlkgJIkw0UULbb/wCnKu X-Received: by 2002:a65:5686:: with SMTP id v6-v6mr23586166pgs.141.1531242278106; Tue, 10 Jul 2018 10:04:38 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1531242278; cv=none; d=google.com; s=arc-20160816; b=DN+0bXOe9nzNbf5svfoSmwLAe8Y758VYXk5CNF1v5yICdp8jpOydcn79QEq9CCu+X8 zF5LtW75U68LQUWdpouH/hIcnh3sJLvcSbyXjUCEhcMi4RxyD0pNzKCenCGkjptHOTu/ u4X3wLVvxFa+FYYapsmMJBX89ZviAXnonnaOiVYjlnAALt/4Rq5mxqNDWTJ77WY/THW3 46Wdec0cfBca+rJ1Ewgl7xwdPTBSvzjwC5xFLf3kLNhBiA/hPZsNvL1T9Bis6YE6I0Jg skLf6QeRqe+I81pLClwTAZDV6JiPBSz6mYA8PMwt3o24PxOeQwPyNkuZNHo0oCLb7Gw4 3gfQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=FbbaBqX2GBsygk6MmpwHxpnCHL5UXb3SSrwHQVY6OSM=; b=Dn1yFCKw5F+W0dCLaJrThu6Vbt5+xkOhMGcHfGj419nGjhdsOUiGCNaN2/QQXrKSo0 T6y+aXYL2ArT1Ow8AaqyULAbAxLS96YTgZJ4YkH7+bK2CjdOprawal7l2t825YxXKLDM 5Zea0NHJHA6PiFgX+dpc9wqIG21XcfA0B7jWpJBoVJ/lvekunxTrq6rh5N4ZbOBrfdgG RJQez58que7sSW1j1D/KjUgLyHkT/KgPps68oV5kqcwO1cVuE8x1Smwd0ul7FtAqQDvL +MN+jUSMEgwGxxzANxTL8Ex5uC4pnK8XszB9pcvxoI+FsMDi1YCLyj2I0b5S7Q7ricVr HHSA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 91-v6si17448323ple.308.2018.07.10.10.04.21; Tue, 10 Jul 2018 10:04:38 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933354AbeGJRDg (ORCPT + 99 others); Tue, 10 Jul 2018 13:03:36 -0400 Received: from usa-sjc-mx-foss1.foss.arm.com ([217.140.101.70]:50538 "EHLO foss.arm.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1754260AbeGJRDf (ORCPT ); Tue, 10 Jul 2018 13:03:35 -0400 Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.72.51.249]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 2A18580D; Tue, 10 Jul 2018 10:03:35 -0700 (PDT) Received: from e103592.cambridge.arm.com (usa-sjc-imap-foss1.foss.arm.com [10.72.51.249]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id 187B03F589; Tue, 10 Jul 2018 10:03:32 -0700 (PDT) Date: Tue, 10 Jul 2018 18:03:30 +0100 From: Dave Martin To: Suzuki K Poulose Cc: Marc Zyngier , cdall@kernel.org, kvm@vger.kernel.org, catalin.marinas@arm.com, punit.agrawal@arm.com, Will Deacon , linux-kernel@vger.kernel.org, qemu-devel@nongnu.org, Paolo Bonzini , kvmarm@lists.cs.columbia.edu, linux-arm-kernel@lists.infradead.org Subject: Re: [PATCH v3 15/20] kvm: arm/arm64: Allow tuning the physical address size for VM Message-ID: <20180710170330.GJ9486@e103592.cambridge.arm.com> References: <1530270944-11351-16-git-send-email-suzuki.poulose@arm.com> <20180704155104.GN4828@arm.com> <12d1832a-1a13-7dd4-662b-addf58400789@arm.com> <9f1af26e-2913-2b0b-1352-63160096f78f@arm.com> <20180709112326.GD9486@e103592.cambridge.arm.com> <17f8d585-3489-ab6f-6ee1-4d8d337dcf9c@arm.com> <20180709133750.GE9486@e103592.cambridge.arm.com> <377420ce-97a8-4359-6224-273d91f37247@arm.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <377420ce-97a8-4359-6224-273d91f37247@arm.com> User-Agent: Mutt/1.5.23 (2014-03-12) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Jul 10, 2018 at 05:38:39PM +0100, Suzuki K Poulose wrote: > On 09/07/18 14:37, Dave Martin wrote: > >On Mon, Jul 09, 2018 at 01:29:42PM +0100, Marc Zyngier wrote: > >>On 09/07/18 12:23, Dave Martin wrote: [...] > >>>Wedging arguments into a few bits in the type argument feels awkward, > >>>and may be regretted later if we run out of bits, or something can't be > >>>represented in the chosen encoding. > >> > >>I think that's a pretty convincing argument for a "better" CREATE_VM, > >>one that would have a clearly defined, structured (and potentially > >>extensible) argument. > >> > >>I've quickly hacked the following: > >> > >>diff --git a/include/uapi/linux/kvm.h b/include/uapi/linux/kvm.h > >>index b6270a3b38e9..3e76214034c2 100644 > >>--- a/include/uapi/linux/kvm.h > >>+++ b/include/uapi/linux/kvm.h > >>@@ -735,6 +735,20 @@ struct kvm_ppc_resize_hpt { > >> __u32 pad; > >> }; > >> > >>+struct kvm_create_vm2 { > >>+ __u64 version; /* Or maybe not */ > >>+ union { > >>+ struct { > >>+#define KVM_ARM_SVE_CAPABLE (1 << 0) > >>+#define KVM_ARM_SELECT_IPA {1 << 1) > >>+ __u64 capabilities; > >>+ __u16 sve_vlen; > >>+ __u8 ipa_size; > >>+ } arm64; > >>+ __u64 dummy[15]; > >>+ }; > >>+}; > >>+ > >> #define KVMIO 0xAE > >> > >> /* machine type bits, to be used as argument to KVM_CREATE_VM */ > >> > >>Other architectures could fill in their own bits if they need to. > >> > >>Thoughts? > > > >This kind of thing should work, but it may still get messy when we > >add additional fields. > > > Marc, Dave, > > I like Dave's approach. Some comments below. > > > > >It we want this to work cross-arch, would it make sense to go > >for a more generic approach, say > > > >struct kvm_create_vm_attr_any { > > __u32 type; > >}; > > > >#define KVM_CREATE_VM_ATTR_ARCH_CAPABILITIES 1 > >struct kvm_create_vm_attr_arch_capabilities { > > __u32 type; > > __u16 size; /* support future expansion of capabilities[] */ > > __u16 reserved; > > __u64 capabilities[1]; > >}; > > We also need to advertise which attributes are supported by the host, > so that the user can tune the available ones. That would make a bit mask > like the above trickier, unless we return the supported values back > in the argument ptr for the "probe" call. And this scheme in general > can be useful for passing back a non-boolean result specific to the > attribute, without having a per-attribute ioctl. (e.g, maximum limit > for IPA). Maybe, but this could quickly become bloated. (My approach already feels a bit bloated...) I'm not sure that arbitrarily complex negotiation will really be needed, but userspace might want to change its mind if setting a particular propertiy fails. An alternative might be to have a bunch of per-VM ioctls to configure different things, like x86 has. There's at least precedent for that. For arm, we currently only have a few. That allows for easy extension, at the cost of adding ioctls. There may be some ioctls we can reuse, like KVM_ENABLE_CAP for per- vm capability flags. [...] > >union kvm_create_vm_attr { > > struct kvm_create_vm_attr_any; > > struct kvm_create_vm_attr_arch_capabilities; > > struct kvm_create_vm_attr_arm64_physaddr_size; > > /* ... */ > >}; > > nit: Could we simply do s/kvm_create_vm_attr/kvm_vm_attr/ everywhere ? > While I agree that the kvm_create_vm_attr makes it implicit that the attributes > are valid only "create" ioctl, the lack of an ioctl to set the VM attribute > should be sufficient to indicate the same. I just randomly came up with some names. The precise naming scheme isn't that important, so long as it unlikely to result in name collisions and so long as it's reasonablu clear (or compiler-checkable, or preferably both) which things can be used where. I wouldn't have a problem with something a bit terser. > > > > >struct kvm_create_vm2 { > > __u32 version; /* harmless, even if not useful */ > > __u16 nr_attrs; /* or could just terminate attrs with a > > NULL entry */ > > union kvm_create_vm_attr __user *__user *attrs; > >}; > > > > > >This is quite flexible, but obviously a bit heavy. > > > >However, if we're adding a new interface due to lack of extensibility, > >it may be worth going for something that's freely extensible. > > True. I could hack something up along the lines above and send it here. Sure, but best to keep it fairly rough for now. Cheers ---Dave