Received: by 10.223.164.202 with SMTP id h10csp291122wrb; Wed, 8 Nov 2017 16:44:45 -0800 (PST) X-Google-Smtp-Source: ABhQp+Q78LFKWmc0kCCnVN+z4ud8vFOIRlw2qo5MQHgwIy2+60sA0wcrXuAUaVKt1Kra/nBJJHyS X-Received: by 10.99.119.199 with SMTP id s190mr2146963pgc.110.1510188284898; Wed, 08 Nov 2017 16:44:44 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1510188284; cv=none; d=google.com; s=arc-20160816; b=NbVQpxxaZPj01oUzJme7EcsGHnqYJ5q72mbFHvtO5kvM3MbPilKXzWwQkayAwYxewx /003A/XRIvjDQStwocVZ/FaJ9IKzDZm3SMSOZ7l8ilBHe2DZNbSdg/BraQC0RzvHQRez SKITRM8nGLay04PI8xSNP0SOx1k9R0g/A+0vWvpsZvT6B+NIx94/K8Wm6lkfi4wzfVKc Eo4a3M7ESssIuwF579eefXY13wTOnpo9NRBhDwrtEaE9Gwj0qnG01anbTE4zVpC9iTGo RAVCVPQQRmJinqsBwk21i1tIfNGOzUPyyLAOenr2Ei65X1q+27FoHPDw9jTUpB3lWxwI abxw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:cc:to:subject :message-id:date:from:references:in-reply-to:mime-version :dkim-signature:arc-authentication-results; bh=a1sr2mArvoBwP0v1qHGbtomZbPqQJJO4dwtyeVyKipM=; b=Hc4dzzvHoh4lDQ9/U3iRrC19eVr4k5M81ngNIBvrVihc6T7VAm4GMtfVTWu++nQQDp hbisXH+AkyHAgAfFHeP5C0SgoMIrAIteiFzGzIalPyiZR3m3iKZP52r22ujGHEffOMsM FZTdt8v5u5y22ghfTTFFFAjeMygy1ZjPDbERcDjcNfxcO34OKPmNuI3WHankBfiC6BaM lcyR4Q0fYHc2dnnZRkr5NiTgGeGJlGtgQ5BELjGs0wLWaXgqG7ySo4xTZhgLp3AEIQ9f m5YCpGIzc2zvnw+TGfLGr7EiDHjJjSE18O+s4CGp9WFiUHn1ME6HhXpL6WtFzeSpV0zs NgIg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=jVol3+T9; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=gmail.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id v74si5282590pfd.77.2017.11.08.16.44.33; Wed, 08 Nov 2017 16:44:44 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20161025 header.b=jVol3+T9; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=gmail.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753153AbdKIAnx (ORCPT + 84 others); Wed, 8 Nov 2017 19:43:53 -0500 Received: from mail-oi0-f65.google.com ([209.85.218.65]:53562 "EHLO mail-oi0-f65.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752716AbdKIAnv (ORCPT ); Wed, 8 Nov 2017 19:43:51 -0500 Received: by mail-oi0-f65.google.com with SMTP id h6so3289685oia.10; Wed, 08 Nov 2017 16:43:51 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to :cc:content-transfer-encoding; bh=a1sr2mArvoBwP0v1qHGbtomZbPqQJJO4dwtyeVyKipM=; b=jVol3+T9y+k141FjIjuN41bAl0BxU+o5lrdss18jkfXuwYvjOuxs7UNrDCVgdnaOPN g5DjTjGP0MrY+eMwRRjh9Tn/zMCnZ9j/+legD7sLjaBbyuRa/6Xr6nA4/6/pasUzKA9G qqMzo+k+lYDwU0kcFiOtcK1n/8YIYbo9gmqIe/EksJnQibRDDKzzlMaF2Aa8w1R2PBr9 4W/sxJOKGAMfRmZ2Rt/arcnWYAHBYKWgWgQYFwKF/l0SCinISQ69Siz7itRhSuuwkWWp Y7JzOp3uSEb8OwAWC0WgbYIZ1sT2ia2vlzbpaPowlW4fwIpY6fDRhOq69RNWI3jYFoZU u4Gg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to:cc:content-transfer-encoding; bh=a1sr2mArvoBwP0v1qHGbtomZbPqQJJO4dwtyeVyKipM=; b=Ut1c1nWGIFWdtmOqpcdsxuS4sB9dqDmC1pNDkzoFJa/g9yJa59Jyj54JUxhVHMINXA /igWCh0kN+L5BkQ+HeK1qAUTfD2g+lTOKKtvZkhZe3VbM1vAt43M6OqAFNzlDxbgNskv Bo1coIk7S7ZuELByB/FHPr1PzlT7A5K/oWeFiHQVOJAC+Zc6l9Uzxq3ppNGnZGNDS0OF G8rvolljKHbsPVipRkGh+FWRKNkfazNcOweG6HGQlIRqNdoape4RYZlYZNwagAYrHI55 Y9dMBKRE4qfSbYpyCK6m0pMEITohFdXrNL3/qWfbLPjDktEMOD9l4tYsbqEfszT/rQbn sswA== X-Gm-Message-State: AJaThX4m0sqaLBFKXVpLSUMfwPNWN7YKwM0rV61cGVRbz8BCoLPLaJwg mgaIJkcTrpOyyrL9i3dX/uPDOJfosaTNMGmqp1c= X-Received: by 10.202.234.2 with SMTP id i2mr1422684oih.348.1510188230605; Wed, 08 Nov 2017 16:43:50 -0800 (PST) MIME-Version: 1.0 Received: by 10.74.53.27 with HTTP; Wed, 8 Nov 2017 16:43:50 -0800 (PST) In-Reply-To: <20171108162630.GA3099@flask> References: <1509970640-18679-1-git-send-email-wanpeng.li@hotmail.com> <20171108162630.GA3099@flask> From: Wanpeng Li Date: Thu, 9 Nov 2017 08:43:50 +0800 Message-ID: Subject: Re: [PATCH v2] KVM: X86: Fix softlockup when get the current kvmclock timestamp To: =?UTF-8?B?UmFkaW0gS3LEjW3DocWZ?= Cc: "linux-kernel@vger.kernel.org" , kvm , Paolo Bonzini , Wanpeng Li Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org 2017-11-09 0:26 GMT+08:00 Radim Kr=C4=8Dm=C3=A1=C5=99 : > 2017-11-06 04:17-0800, Wanpeng Li: >> From: Wanpeng Li >> >> watchdog: BUG: soft lockup - CPU#6 stuck for 22s! [qemu-system-x86:1018= 5] >> CPU: 6 PID: 10185 Comm: qemu-system-x86 Tainted: G OE 4.14.= 0-rc4+ #4 >> RIP: 0010:kvm_get_time_scale+0x4e/0xa0 [kvm] >> Call Trace: >> ? get_kvmclock_ns+0xa3/0x140 [kvm] >> get_time_ref_counter+0x5a/0x80 [kvm] >> kvm_hv_process_stimers+0x120/0x5f0 [kvm] >> ? kvm_hv_process_stimers+0x120/0x5f0 [kvm] >> ? preempt_schedule+0x27/0x30 >> ? ___preempt_schedule+0x16/0x18 >> kvm_arch_vcpu_ioctl_run+0x4b4/0x1690 [kvm] >> ? kvm_arch_vcpu_load+0x47/0x230 [kvm] >> kvm_vcpu_ioctl+0x33a/0x620 [kvm] >> ? kvm_vcpu_ioctl+0x33a/0x620 [kvm] >> ? kvm_vm_ioctl_check_extension_generic+0x3b/0x40 [kvm] >> ? kvm_dev_ioctl+0x279/0x6c0 [kvm] >> do_vfs_ioctl+0xa1/0x5d0 >> ? __fget+0x73/0xa0 >> SyS_ioctl+0x79/0x90 >> entry_SYSCALL_64_fastpath+0x1e/0xa9 >> >> This can be reproduced when running kvm-unit-tests/hyperv_stimer.flat an= d >> cpu-hotplug stress simultaneously. __this_cpu_read(cpu_tsc_khz) returns = 0 >> (set in kvmclock_cpu_down_prep()) when the pCPU is unhotplug which resul= ts >> in kvm_get_time_scale() gets into an infinite loop. >> >> This patch fixes it by skipping to fill the hv_clock when the pCPU is of= fline. >> >> Cc: Paolo Bonzini >> Cc: Radim Kr=C4=8Dm=C3=A1=C5=99 >> Signed-off-by: Wanpeng Li >> --- >> v1 -> v2: >> * avoid infinite loop >> >> arch/x86/kvm/x86.c | 3 +++ >> 1 file changed, 3 insertions(+) >> >> diff --git a/arch/x86/kvm/x86.c b/arch/x86/kvm/x86.c >> index 03869eb..d2507c6 100644 >> --- a/arch/x86/kvm/x86.c >> +++ b/arch/x86/kvm/x86.c >> @@ -1259,6 +1259,9 @@ static void kvm_get_time_scale(uint64_t scaled_hz,= uint64_t base_hz, >> uint64_t tps64; >> uint32_t tps32; >> >> + if (unlikely(base_hz =3D=3D 0)) >> + return; > > This is a sensible thing to do and will prevent the loop, but KVM will > still have a minor bug: get_kvmclock_ns() passes uninitialized stack > values with the expectation that kvm_get_time_scale() will set them, but > returning here would result in __pvclock_read_cycles() with random data > and inject timer interrupts early (if not worse). > > I think it would be best if kvm_get_time_scale() wasn't executing when > cpu_tsc_khz is 0, by clearing cpu_tsc_khz later and setting earlier; > do you see any problems with moving the CPUHP_AP_X86_KVM_CLK_ONLINE > before CPUHP_AP_ONLINE? I think this will break Thomas's hotplug state machine, and I'm not the hotplug expert. How about something like below to avoid the random data: diff --git a/arch/x86/kvm/x86.c b/arch/x86/kvm/x86.c index 34c85aa..954f510 100644 --- a/arch/x86/kvm/x86.c +++ b/arch/x86/kvm/x86.c @@ -1795,10 +1795,13 @@ u64 get_kvmclock_ns(struct kvm *kvm) /* both __this_cpu_read() and rdtsc() should be on the same cpu */ get_cpu(); - kvm_get_time_scale(NSEC_PER_SEC, __this_cpu_read(cpu_tsc_khz) * 100= 0LL, - &hv_clock.tsc_shift, - &hv_clock.tsc_to_system_mul); - ret =3D __pvclock_read_cycles(&hv_clock, rdtsc()); + if (__this_cpu_read(cpu_tsc_khz)) { + kvm_get_time_scale(NSEC_PER_SEC, __this_cpu_read(cpu_tsc_khz) * 1000LL, + &hv_clock.tsc_shift, + &hv_clock.tsc_to_system_mul); + ret =3D __pvclock_read_cycles(&hv_clock, rdtsc()); + } else + ret =3D ktime_get_boot_ns() + ka->kvmclock_offset; put_cpu(); From 1583515916682845638@xxx Wed Nov 08 16:27:39 +0000 2017 X-GM-THRID: 1583276151146767154 X-Gmail-Labels: Inbox,Category Forums,HistoricalUnread