Received: by 2002:a19:f614:0:0:0:0:0 with SMTP id x20csp63568lfe; Fri, 15 Apr 2022 19:40:33 -0700 (PDT) X-Google-Smtp-Source: ABdhPJx0wHapzsU9mpSCE3Z17LesDOlea7Z7V127yAyQ04zL4NuqzLyMNMhBVYzqxyqZMhyvLRDL X-Received: by 2002:a17:903:2348:b0:154:dd0:aba4 with SMTP id c8-20020a170903234800b001540dd0aba4mr1833524plh.39.1650076833284; Fri, 15 Apr 2022 19:40:33 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1650076833; cv=none; d=google.com; s=arc-20160816; b=oqNUUy1dFCiVAdrjN8GJpmw2uA3NtemyMeaSYMTK3/VMRpHCQi1jDDB11Rn/bQXRqu DqCDcKtr3kmbzNsJvJe8Tr8ALx7YCzqg3WwLd8PsKBtffhks2DegNmXCfUNSQgU1fGzs r6fZEPAxUO6GH6j5UOArmXAjq45o+PB4Jxx9CVjwb9pjn4ver2AUY+l3Z5NM4EBeshgB ChyChHsMWZhDQUIc7wYDivJVpZztbqZofeVAwzzsAx/P28vkkbb+NeAbf++9SqmrLnpa cXWbjUqw5fTTmEJEumevsGck8B8OVltv+Umuqbeh00OKGPlqHiy3iAwPFAr4HgU17b4d jc7A== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature; bh=puJYXDXIW0rt8QlOAX8cnmpWvo/2m86AP/4nXdIuCaM=; b=rEreLj6+eZBt9aR3g0NLIKCVNpBKVABgTS3MQqMNVNRJ1LgRZvb9VyCMV/34U/+vAD l13pd/qrIszDnRyQOo1fUee8nwRDUcN4q8tLKLWGDgHbuamYZ9xWqLFF0sMxv+0TywxM gpYbuNya6SAH9K7jsd3VKdTyQL2MSYyPdkGMuUQXNari55MYWd7EGdjhWXe3y6I3B5Lr vVfuPwNqvOjzdZ7WnOQrZHgVV7vpUiqhdv79IAUySP9gI7tn2Kb7mc9u6Awd39t4RDPV opSZDiCXNUC1HJtHVVo1qwY4sBeOND08wLZjcNeV7wMUX9702E687U0/YmRIXdGaLVWD fzjQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=g26YDFoR; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [23.128.96.19]) by mx.google.com with ESMTPS id y9-20020a626409000000b004fe185e3ed5si2803867pfb.183.2022.04.15.19.40.32 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 15 Apr 2022 19:40:33 -0700 (PDT) Received-SPF: softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) client-ip=23.128.96.19; Authentication-Results: mx.google.com; dkim=pass header.i=@google.com header.s=20210112 header.b=g26YDFoR; spf=softfail (google.com: domain of transitioning linux-kernel-owner@vger.kernel.org does not designate 23.128.96.19 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=REJECT sp=REJECT dis=NONE) header.from=google.com Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 0CD0C17F2B5; Fri, 15 Apr 2022 18:47:41 -0700 (PDT) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1354754AbiDOPDJ (ORCPT + 99 others); Fri, 15 Apr 2022 11:03:09 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:47936 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S239013AbiDOPDI (ORCPT ); Fri, 15 Apr 2022 11:03:08 -0400 Received: from mail-pg1-x530.google.com (mail-pg1-x530.google.com [IPv6:2607:f8b0:4864:20::530]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 550229D4CC for ; Fri, 15 Apr 2022 08:00:40 -0700 (PDT) Received: by mail-pg1-x530.google.com with SMTP id bg9so7436574pgb.9 for ; Fri, 15 Apr 2022 08:00:40 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20210112; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:in-reply-to; bh=puJYXDXIW0rt8QlOAX8cnmpWvo/2m86AP/4nXdIuCaM=; b=g26YDFoRTCVJqAQ1bQPPP206qrINE01PTp/ltfRYl5QRsvGRE/oNJHn/YKu8/Hdz7D SslOKcwIS3sbin2n30FPIxOP0s8V5YP60asdGLLNz1G7G0MdbCgWwh1b6fbVZMSe3548 uWd9WuTxl09t0yIdI+aEyouNQEvnNdJRrW+0/+eBGdN7112r6yGrIpYd+qGLE/anvBrG p+qWppfXNktBeKFTWcw/4DHzHNsH4qzOfhVAyhgnDWcmJa0H67x73W0GySXey9gfWgo9 +A2cVLGre8M/xs6A+ME4JqvTheu9bOq62z8QvOGUeerpn8YpbcDdaXoyMoB7NQJtwwWg xs8Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:in-reply-to; bh=puJYXDXIW0rt8QlOAX8cnmpWvo/2m86AP/4nXdIuCaM=; b=VWs1o7xcFJR6Y31YDFcaydIp4kxJh5uFIcSWmHFDqajRtibpYBg73Tg0g37oSzHpuU OqMp4hxbPRsmAzKRF2YDVSKrO5bferS67qk/W8RDOSCSVxmUA8zd/kG78ASOokQKI3rY GFoXWJyOpyfLNcGNcRJjOF82p2IYtiqturaokqkXuZfYLfCMIDsXIpRZf3ZkmHt+8KWJ tLJH5AlV4wbMzojO3Nf2dsqW0xXWZADKYrEClHeeYYS9Xr39pHJXMx/Z87II8sAoT/4f cgW8VULdeC8t8ym/nzmqWL10EEX5KHPBksHnMMMtx4EhtFRuvlF21/hSkpmBrnh+dLZX wbeg== X-Gm-Message-State: AOAM533WWpwnO4akTE8QjzBM2+5ZEtHS3rRUziybSy4oGI6EAlF0RlU+ dbHkuBunNT0NPqF84hLW5Vcngg== X-Received: by 2002:a05:6a00:22cf:b0:505:c880:41dc with SMTP id f15-20020a056a0022cf00b00505c88041dcmr9314157pfj.80.1650034839229; Fri, 15 Apr 2022 08:00:39 -0700 (PDT) Received: from google.com (157.214.185.35.bc.googleusercontent.com. [35.185.214.157]) by smtp.gmail.com with ESMTPSA id oo16-20020a17090b1c9000b001b89e05e2b2sm5408549pjb.34.2022.04.15.08.00.38 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 15 Apr 2022 08:00:38 -0700 (PDT) Date: Fri, 15 Apr 2022 15:00:34 +0000 From: Sean Christopherson To: Zeng Guang Cc: Paolo Bonzini , Vitaly Kuznetsov , Wanpeng Li , Jim Mattson , Joerg Roedel , kvm@vger.kernel.org, Dave Hansen , Tony Luck , Kan Liang , Thomas Gleixner , Ingo Molnar , Borislav Petkov , "H. Peter Anvin" , Kim Phillips , Jarkko Sakkinen , Jethro Beekman , Kai Huang , x86@kernel.org, linux-kernel@vger.kernel.org, Robert Hu , Gao Chao , Marc Zyngier , James Morse , Alexandru Elisei , Suzuki K Poulose , Christian Borntraeger , Janosch Frank , Claudio Imbrenda , David Hildenbrand Subject: Re: [PATCH v8 7/9] KVM: Move kvm_arch_vcpu_precreate() under kvm->lock Message-ID: References: <20220411090447.5928-1-guang.zeng@intel.com> <20220411090447.5928-8-guang.zeng@intel.com> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20220411090447.5928-8-guang.zeng@intel.com> X-Spam-Status: No, score=-9.5 required=5.0 tests=BAYES_00,DKIMWL_WL_MED, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,RDNS_NONE,SPF_HELO_NONE,T_SCC_BODY_TEXT_LINE, USER_IN_DEF_DKIM_WL autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Heh, lot's of people cc'd, but none of the people who's code this affects. +s390 and arm folks On Mon, Apr 11, 2022, Zeng Guang wrote: > Arch specific KVM common data may require pre-allocation or other > preprocess ready before vCPU creation at runtime. Please provide the specific motivation for the move, i.e. explain the desire to do per-VM allocations based on max_vcpu_ids at the first vCPU creation. > It's safe to invoke kvm_arch_vcpu_precreate() within the protection of > kvm->lock directly rather than take into account in the implementation for > each architecture. This absolutely needs to explain _why_ it's safe, e.g. only arm64, x86, and s390 have non-nop implementations and they're all simple and short with no tendrils into other code that might take kvm->lock. And as before, I suspect arm64 needs this protection, the vgic_initialized() check looks racy. Though it's hard to tell if doing the check under kvm->lock actually fixes anything. > Suggested-by: Sean Christopherson > Signed-off-by: Zeng Guang > --- > arch/s390/kvm/kvm-s390.c | 2 -- > virt/kvm/kvm_main.c | 2 +- I think it's also worth changing x86's implementation to check created_vcpus instead of online_vcpus. That'll fix a race where userspace could never see the pr_warn() (which is arguably useless, but whatever), e.g. if it creates a VM with 2 vCPUs and both simultaneously go through kvm_arch_vcpu_precreate(). > 2 files changed, 1 insertion(+), 3 deletions(-) > > diff --git a/arch/s390/kvm/kvm-s390.c b/arch/s390/kvm/kvm-s390.c > index 156d1c25a3c1..5c795bbcf1ea 100644 > --- a/arch/s390/kvm/kvm-s390.c > +++ b/arch/s390/kvm/kvm-s390.c > @@ -3042,9 +3042,7 @@ static int sca_can_add_vcpu(struct kvm *kvm, unsigned int id) > if (!sclp.has_esca || !sclp.has_64bscao) > return false; > > - mutex_lock(&kvm->lock); > rc = kvm->arch.use_esca ? 0 : sca_switch_to_extended(kvm); > - mutex_unlock(&kvm->lock); > > return rc == 0 && id < KVM_S390_ESCA_CPU_SLOTS; > } > diff --git a/virt/kvm/kvm_main.c b/virt/kvm/kvm_main.c > index 70e05af5ebea..a452e678a015 100644 > --- a/virt/kvm/kvm_main.c > +++ b/virt/kvm/kvm_main.c > @@ -3732,9 +3732,9 @@ static int kvm_vm_ioctl_create_vcpu(struct kvm *kvm, u32 id) > } > > kvm->created_vcpus++; > + r = kvm_arch_vcpu_precreate(kvm, id); Hmm, so I think I'd prefer this to be invoked before bumping created_vcpus. The existing implementation don't reference created_vcpus, so there's no change needed to existing code. Logically, a pre-create helper should not see a non-zero count as the "pre" part strongly implies it's being called _before_ creating the first vCPU. Then switching from online_vcpus to created_vcpus in the x86 implementation doesn't need to have a wierd change from "> 0" => "> 1". Ah, and then it also wouldn't have goofy behavior where it drops and reacquires kvm->lock on failure just to decrement created_vcpus. > mutex_unlock(&kvm->lock); > > - r = kvm_arch_vcpu_precreate(kvm, id); > if (r) > goto vcpu_decrement; > > -- > 2.27.0 >