Received: by 2002:a5d:9c59:0:0:0:0:0 with SMTP id 25csp7501iof; Wed, 8 Jun 2022 13:55:09 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxE9Q/000kglu0dHNATfRqy8aqzV2NyeQslQpMOJdr9FVdThm+WRFIxXKIqgdPSi27xjTSG X-Received: by 2002:a17:906:358a:b0:6f4:2903:417e with SMTP id o10-20020a170906358a00b006f42903417emr34056730ejb.592.1654721709081; Wed, 08 Jun 2022 13:55:09 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1654721709; cv=none; d=google.com; s=arc-20160816; b=VVXvAeGQXb+oI4eF2c367nsaSljpjy9gCqaGgqZkyCf0bLIeswqcc4u41F0ErXtI6t MHq291hs1E09hxZvWa1jzz4Taq/sBws9C1MO6ofm4SMV98MFzw6GMn4u6//m/rwlU8Fk qYG2zOcyds45nafHo1/uBi/UUp5L68QFCligDYDD0ftuWx8iM/YTxUlNGoMnP/1YE9E8 T10AiOJ3ncd3SUYtojX6rwEgIJ5mY+4gYgF1K6bpPhIbmU/9fMrmTt2vCEWDNJrA4Gcx JaG8AOgNdUxvqHiiPHOvfIj/glKFnYxRTmIzEr+hRk2+adzlYp34k4gw8UhvGIIlnVs/ hSQQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:cc:to:subject:message-id:date:from:in-reply-to :references:mime-version:dkim-signature; bh=XRSme8wOBaM90PvfwVzl0l/m5nMDDm6ImTO1PuXLlu0=; b=BBblFd7GFBNWMKnVcYdHtJFh8cIcBQsgGqxyE7S93PgxQZw9Q4YWLkbv2RfhkXUhzh FzEp8Y6xHruKxEM6dVvaI0BW/R3bTWGzgn4P9mrn2hC1fy9d93tIoZsHYR+jG3uo8/r/ IC+kU0bQ5ndProK2x6xccQO1BLKNYTyZiFgUiLX21OqIEFedL2Ty55qMFiGcSxxuep55 PYgsXJtRApNy1eKt9ralJH5btIPvJjLhgfxf9zcjeE/ln5nCy1quAOvPKcPmhjOYBOkA M6/Ysk3RCMb0mJ3Gz9zyL+aQHfpS87uP1jttAd3El35J68/uJ129GM0P1+8ii4On5z1d de/g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=ddRHzWiu; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id x12-20020a05640225cc00b0043187c8b46esi2163583edb.575.2022.06.08.13.54.42; Wed, 08 Jun 2022 13:55:09 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@redhat.com header.s=mimecast20190719 header.b=ddRHzWiu; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232114AbiFHUej (ORCPT + 99 others); Wed, 8 Jun 2022 16:34:39 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:52588 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229720AbiFHUei (ORCPT ); Wed, 8 Jun 2022 16:34:38 -0400 Received: from us-smtp-delivery-124.mimecast.com (us-smtp-delivery-124.mimecast.com [170.10.133.124]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id D471564CF for ; Wed, 8 Jun 2022 13:34:35 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=redhat.com; s=mimecast20190719; t=1654720474; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=XRSme8wOBaM90PvfwVzl0l/m5nMDDm6ImTO1PuXLlu0=; b=ddRHzWiuQgmKgsLzwmjOXyM3p9ZwLVuUoQKs43f3J9CpLm61MuAHkLLqKvtklOLxU6mBpW iuRCCug2D7Gb7CYdZhBGadAj7Pc2nWQ45tED/blEKlFjX7XVlJDhNe76TOe/GywQpZPcwE vq1PrGqapopO+2YcM+xtTgDZam+uLT4= Received: from mail-ej1-f72.google.com (mail-ej1-f72.google.com [209.85.218.72]) by relay.mimecast.com with ESMTP with STARTTLS (version=TLSv1.2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id us-mta-584-hKKOcRqaN4qEPe0liffftw-1; Wed, 08 Jun 2022 16:34:31 -0400 X-MC-Unique: hKKOcRqaN4qEPe0liffftw-1 Received: by mail-ej1-f72.google.com with SMTP id q5-20020a17090676c500b00704ffb95131so9987384ejn.8 for ; Wed, 08 Jun 2022 13:34:30 -0700 (PDT) X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=XRSme8wOBaM90PvfwVzl0l/m5nMDDm6ImTO1PuXLlu0=; b=KirRHkerhe1srgb+Q5iqbL8iWvNsUtLKVomNcP/P3QuFY1Zr0CJaUDoWakUX4yrxrt iYae1aMFEq942f5M8HjW5Pt3aOwzywe87joMOpXU11Iyf0qZ25VfCLk9haHzQ8XUVuku HPhgOUIVbo8sth3NQxPwj2lCiSfPkLnqXkrUxHhW2OpOnTdK9Cttt7L6gBkQjQ6Zcs6P s8qnZY+ZYS8Ghll/F4P2raAb7AODFVbP/EmMBq/C584hby7fuSCmhhIKsp+LO9naSmb+ saXkEfwjvQtAO109bUKFQXTtYytnLtjAcvyVqdkoAbyw+A508QrKl428ZfBiYLmlHl5K Z9iQ== X-Gm-Message-State: AOAM531MlK6Qa3Eq+ks8Bz5r9CgVxra3RWcIb9P2els53BOitugIRXtI oqc6jFLB9sLbSB9FW7DpfjFTCAd9fH1axCWtd2aMYhfzNQ/31REP6oqT9GGarYLBC8UhnrrW0yZ Dtd6PJLTNOa+nUt5ju0IbM27N5sEQtrHyw7ACCSTF X-Received: by 2002:a17:906:824a:b0:70f:4c58:6ec6 with SMTP id f10-20020a170906824a00b0070f4c586ec6mr25916966ejx.648.1654720469948; Wed, 08 Jun 2022 13:34:29 -0700 (PDT) X-Received: by 2002:a17:906:824a:b0:70f:4c58:6ec6 with SMTP id f10-20020a170906824a00b0070f4c586ec6mr25916954ejx.648.1654720469668; Wed, 08 Jun 2022 13:34:29 -0700 (PDT) MIME-Version: 1.0 References: <20220301201344.18191-1-sashal@kernel.org> <20220301201344.18191-7-sashal@kernel.org> <5f2b7b93-d4c9-1d59-14df-6e8b2366ca8a@redhat.com> <2d9ba70b-ac18-a461-7a57-22df2c0165c6@redhat.com> <9d336622-6964-454a-605f-1ca90b902836@redhat.com> In-Reply-To: From: Leonardo Bras Soares Passos Date: Wed, 8 Jun 2022 17:34:18 -0300 Message-ID: Subject: Re: [PATCH AUTOSEL 5.16 07/28] x86/kvm/fpu: Limit guest user_xfeatures to supported bits of XCR0 To: Peter Xu Cc: Sean Christopherson , Paolo Bonzini , Sasha Levin , linux-kernel@vger.kernel.org, stable@vger.kernel.org, Thomas Gleixner , Ingo Molnar , Borislav Petkov , Dave Hansen , x86@kernel.org, "Chang S. Bae" , Andy Lutomirski , kvm@vger.kernel.org Content-Type: text/plain; charset="UTF-8" X-Spam-Status: No, score=-3.3 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_NONE, SPF_HELO_NONE,SPF_NONE,T_SCC_BODY_TEXT_LINE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hello Peter, On Tue, Jun 7, 2022 at 5:07 PM Peter Xu wrote: > > On Tue, Jun 07, 2022 at 02:17:54PM -0400, Peter Xu wrote: > > On Tue, Jun 07, 2022 at 03:04:27PM +0000, Sean Christopherson wrote: > > > On Tue, Jun 07, 2022, Paolo Bonzini wrote: > > > > On 6/6/22 23:27, Peter Xu wrote: > > > > > On Mon, Jun 06, 2022 at 06:18:12PM +0200, Paolo Bonzini wrote: > > > > > > > However there seems to be something missing at least to me, on why it'll > > > > > > > fail a migration from 5.15 (without this patch) to 5.18 (with this patch). > > > > > > > In my test case, user_xfeatures will be 0x7 (FP|SSE|YMM) if without this > > > > > > > patch, but 0x0 if with it. > > > > > > > > > > > > What CPU model are you using for the VM? > > > > > > > > > > I didn't specify it, assuming it's qemu64 with no extra parameters. > > > > > > > > Ok, so indeed it lacks AVX and this patch can have an effect. > > > > > > > > > > For example, if the source lacks this patch but the destination has it, > > > > > > the source will transmit YMM registers, but the destination will fail to > > > > > > set them if they are not available for the selected CPU model. > > > > > > > > > > > > See the commit message: "As a bonus, it will also fail if userspace tries to > > > > > > set fpu features (with the KVM_SET_XSAVE ioctl) that are not compatible to > > > > > > the guest configuration. Such features will never be returned by > > > > > > KVM_GET_XSAVE or KVM_GET_XSAVE2." > > > > > > > > > > IIUC you meant we should have failed KVM_SET_XSAVE when they're not aligned > > > > > (probably by failing validate_user_xstate_header when checking against the > > > > > user_xfeatures on dest host). But that's probably not my case, because here > > > > > KVM_SET_XSAVE succeeded, it's just that the guest gets a double fault after > > > > > the precopy migration completes (or for postcopy when the switchover is > > > > > done). > > > > > > > > Difficult to say what's happening without seeing at least the guest code > > > > around the double fault (above you said "fail a migration" and I thought > > > > that was a different scenario than the double fault), and possibly which was > > > > the first exception that contributed to the double fault. > > > > > > Regardless of why the guest explodes in the way it does, is someone planning on > > > bisecting this (if necessary?) and sending a backport to v5.15? There's another > > > bug report that is more than likely hitting the same bug. > > > > What's the bisection you mentioned? I actually did a bisection and I also > > checked reverting Leo's change can also fix this issue. Or do you mean > > something else? > > Ah, I forgot to mention on the "stable tree decisions": IIUC it also means > we should apply Leo's patch to all the stable trees if possible, then > migrations between them won't trigger the misterous faults anymore, > including when migrating to the latest Linux versions. > > However there's the delimma that other kernels (any kernel that does not > have Leo's patch) will start to fail migrations to the stable branches that > apply Leo's patch too.. IIUC, you commented before that the migration issue should be solved with a QEMU fix, is that correct? That would mean something like 'QEMU is relying on a kernel bug to work', and should be no blocker for fixing the kernel. If that's the case, I think we should apply the fix to every supported stable branch that have the fpku issue, and in parallel come with a qemu fix for that. What do you think about it? Best regards, Leo > So that's kind of a slight pity. It's just IIUC > the stable trees are more important, because it should have a broader > audience (most Linux distros)? > > > > > > > > > https://lore.kernel.org/all/48353e0d-e771-8a97-21d4-c65ff3bc4192@sentex.net > > > > That is kvm64, and I agree it could be the same problem since both qemu64 > > and kvm64 models do not have any xsave feature bit declared in cpuid 0xd, > > so potentially we could be migrating some fpu states to it even with > > user_xfeatures==0 on dest host. > > > > So today I continued the investigation, and I think what's really missing > > is qemu seems to be ignoring the user_xfeatures check for KVM_SET_XSAVE and > > continues even if it returns -EINVAL. IOW, I'm wondering whether we should > > fail properly and start to check kvm_arch_put_registers() retcode. But > > that'll be a QEMU fix, and it'll at least not causing random faults > > (e.g. double faults) in guest but we should fail the migration gracefully. > > > > Sean: a side note is that I can also easily trigger one WARN_ON_ONCE() in > > your commit 98c25ead5eda5 in kvm_arch_vcpu_ioctl_run(): > > > > WARN_ON_ONCE(kvm_lapic_hv_timer_in_use(vcpu)); > > > > It'll be great if you'd like to check that up. > > > > Thanks, > > > > -- > > Peter Xu > > -- > Peter Xu >