Received: by 2002:ac0:a582:0:0:0:0:0 with SMTP id m2-v6csp62660imm; Thu, 11 Oct 2018 15:28:54 -0700 (PDT) X-Google-Smtp-Source: ACcGV623sr/jozl74AK6aXuszDEj4AJZp3b/f5OVHz+qkDH2s+kzDwErTbx5id96bRG3rDjHd+OS X-Received: by 2002:a63:b4b:: with SMTP id a11-v6mr2901867pgl.97.1539296934635; Thu, 11 Oct 2018 15:28:54 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1539296934; cv=none; d=google.com; s=arc-20160816; b=LhzhyFcrrLbnos+CKwX9c2G1UN9O6HkvFB9NBLsrG+9lR8tkYaqZN/kVsOFbRxvm2W IvU1TYHWR17PZmY6XzelZJ26xMZpKagh7sXLgtSh9kPmiKQvvIrwJVLOPjPAymr5a26E rAn8Yzv/Sfp1Nrz7WlCFEObii17xIUW2kJzhrRstbwgj1UsF0VGWs7g9JbIHrIjom8Gw fAcQZCSG5UzrAer143BkN1z34FaADVzJ/gDFDO91iNrEuVa4oMPAGHMnKK4jlPUAvc2P X/+RSlz3Vv6jrU02jNsYoLygHDmy0C9BPMh3ydTVDRIgb7Oo6L6+sexYt2FR/c5W00A7 v/4Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=meO/VwrTM8wK1o8lP3NJmcS4fnZmHmDA3Sf1YdkLHLU=; b=nu/4X09q7NQ9jVbvJwPZOApzC1InxhxX89iL1wqWJYnHaCGOcfNQRhz9TS33DB+wJJ dxI9v6abL0za5h379/46a9rSvvaa2LVzYVykB7N1j9st9y2gqvr6RCow7TCp2OQaHxZT crIgxCgnsKWzCNCwTpTprgpppG+dJ7BgzkSV6LfrUIszfYXBa47VrlEz1Ax1/iXyp4Jj ln1z/fy1KEeZIWDkpp1JiyfP173JzrQockqi3RrswYkFQUdMAeAivSW7ERp9UU665rPf d8GtCKXGRru2mJvovxF9VlnjoH3xsdEyyXyR28gq7fI0NOAnVOxNHNc+1gCH5vWW3CIH zXbg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id g10-v6si4101963pll.337.2018.10.11.15.28.35; Thu, 11 Oct 2018 15:28:54 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=redhat.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726568AbeJLF5b (ORCPT + 99 others); Fri, 12 Oct 2018 01:57:31 -0400 Received: from mx1.redhat.com ([209.132.183.28]:55316 "EHLO mx1.redhat.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725840AbeJLF5b (ORCPT ); Fri, 12 Oct 2018 01:57:31 -0400 Received: from smtp.corp.redhat.com (int-mx01.intmail.prod.int.phx2.redhat.com [10.5.11.11]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.redhat.com (Postfix) with ESMTPS id 98A9830001D5; Thu, 11 Oct 2018 22:28:11 +0000 (UTC) Received: from amt.cnet (ovpn-112-2.gru2.redhat.com [10.97.112.2]) by smtp.corp.redhat.com (Postfix) with ESMTP id 11CA06048C; Thu, 11 Oct 2018 22:28:10 +0000 (UTC) Received: from amt.cnet (localhost [127.0.0.1]) by amt.cnet (Postfix) with ESMTP id A21AC105144; Thu, 11 Oct 2018 19:27:55 -0300 (BRT) Received: (from marcelo@localhost) by amt.cnet (8.14.7/8.14.7/Submit) id w9BMRj1O018303; Thu, 11 Oct 2018 19:27:45 -0300 Date: Thu, 11 Oct 2018 19:27:44 -0300 From: Marcelo Tosatti To: Andy Lutomirski Cc: Peter Zijlstra , Vitaly Kuznetsov , Thomas Gleixner , Paolo Bonzini , Radim Krcmar , Wanpeng Li , LKML , X86 ML , Matt Rickard , Stephen Boyd , John Stultz , Florian Weimer , KY Srinivasan , devel@linuxdriverproject.org, Linux Virtualization , Arnd Bergmann , Juergen Gross Subject: Re: [patch 00/11] x86/vdso: Cleanups, simmplifications and CLOCK_TAI support Message-ID: <20181011222744.GA17955@amt.cnet> References: <20181004193150.GQ19272@hirez.programming.kicks-ass.net> <499807AB-E779-40C3-AA3F-E8C77A7770EC@amacapital.net> <20181006202731.GC7129@amt.cnet> <20181008152650.GB27822@amt.cnet> <20181008193632.GA31729@amt.cnet> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.21 (2010-09-15) X-Scanned-By: MIMEDefang 2.79 on 10.5.11.11 X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.5.16 (mx1.redhat.com [10.5.110.42]); Thu, 11 Oct 2018 22:28:12 +0000 (UTC) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Oct 09, 2018 at 01:09:42PM -0700, Andy Lutomirski wrote: > On Tue, Oct 9, 2018 at 8:28 AM Marcelo Tosatti wrote: > > > > On Mon, Oct 08, 2018 at 10:38:22AM -0700, Andy Lutomirski wrote: > > > On Mon, Oct 8, 2018 at 8:27 AM Marcelo Tosatti wrote: > > > > I read the comment three more times and even dug through the git > > > history. It seems like what you're saying is that, under certain > > > conditions (which arguably would be bugs in the core Linux timing > > > code), > > > > I don't see that as a bug. Its just a side effect of reading two > > different clocks (one is CLOCK_MONOTONIC and the other is TSC), > > and using those two clocks to as a "base + offset". > > > > As the comment explains, if you do that, can't guarantee monotonicity. > > > > > actually calling ktime_get_boot_ns() could be non-monotonic > > > with respect to the kvmclock timing. But get_kvmclock_ns() isn't used > > > for VM timing as such -- it's used for the IOCTL interfaces for > > > updating the time offset. So can you explain how my patch is > > > incorrect? > > > > ktime_get_boot_ns() has frequency correction applied, while > > reading masterclock + TSC offset does not. > > > > So the clock reads differ. > > > > Ah, okay, I finally think I see what's going on. In the kvmclock data > exposed to the guest, tsc_shift and tsc_to_system_mul come from > tgt_tsc_khz, whereas master_kernel_ns and master_cycle_now come from > CLOCK_BOOTTIME. So the kvmclock and kernel clock drift apart at a > rate given by the frequency shift and then suddenly agree again every > time the pvclock data is updated. Yes. > Is there a reason to do it this way? Since pvclock updates which update system_timestamp are expensive (must stop all vcpus), they should be avoided. So only HW TSC counts, and used as offset against vcpu's tsc_timestamp.