Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753198AbdI0Npg (ORCPT ); Wed, 27 Sep 2017 09:45:36 -0400 Received: from Galois.linutronix.de ([146.0.238.70]:53122 "EHLO Galois.linutronix.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1753155AbdI0Npe (ORCPT ); Wed, 27 Sep 2017 09:45:34 -0400 Date: Wed, 27 Sep 2017 15:45:19 +0200 (CEST) From: Thomas Gleixner To: Paolo Bonzini cc: Denis Plotnikov , rkrcmar@redhat.com, kvm@vger.kernel.org, john.stultz@linaro.org, mingo@redhat.com, hpa@zytor.com, linux-kernel@vger.kernel.org, x86@kernel.org, rkagan@virtuozzo.com, den@virtuozzo.com Subject: Re: [PATCH v5 1/6] timekeeper: introduce extended clocksource reading callback In-Reply-To: <52d6cc44-b065-93d8-a284-9e372033ba9c@redhat.com> Message-ID: References: <1504106628-172372-1-git-send-email-dplotnikov@virtuozzo.com> <1504106628-172372-2-git-send-email-dplotnikov@virtuozzo.com> <7b886c9f-70a7-951b-ff6f-f77f263754e7@redhat.com> <7f73ec16-7262-329c-d2c3-d5481142c370@redhat.com> <52d6cc44-b065-93d8-a284-9e372033ba9c@redhat.com> User-Agent: Alpine 2.20 (DEB 67 2015-01-07) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII X-Linutronix-Spam-Score: -1.0 X-Linutronix-Spam-Level: - X-Linutronix-Spam-Status: No , -1.0 points, 5.0 required, ALL_TRUSTED=-1,SHORTCIRCUIT=-0.0001 Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2088 Lines: 70 On Wed, 27 Sep 2017, Paolo Bonzini wrote: > On 27/09/2017 13:53, Thomas Gleixner wrote: > >> I think the hook should be specific to x86. For example it could be an > >> array of function pointers, indexed by vclock_mode, with the same > >> semantics as read_with_stamp. > > I don't think you need that. > > > > The get_time_fn() which is handed in to get_device_system_crossstamp() can > > convey that information: > > > > /* > > * Try to synchronously capture device time and a system > > * counter value calling back into the device driver > > */ > > ret = get_time_fn(&xtstamp->device, &system_counterval, ctx); > > if (ret) > > return ret; > > > > So in your case get_time_fn() would be kvmclock or hyperv clock specific > > and the actual hypercall implementation can return a failure code if the > > requirements are not met: > > > > 1) host clock source is TSC > > 2) capturing of host time and TSC is atomic > > So you are suggesting reusing the cross-timestamp hypercall to implement > nested pvclock. There are advantages and disadvantages to that. > > With read_with_stamp-like callbacks: > > + running on old KVM or on Hyper-V is supported > - pvclock_gtod_copy does not go away > > With hypercall-based callbacks on the contrary: > > + KVM can use ktime_get_snapshot for the bare metal case > - only very new KVM is supported I don't think that it's an either or decision. get_device_system_crossstamp(get_time_fn, ......) So you can have specific get_time_fn() implementations for your situation: old_kvm_fn() retrieve data from pvclock_gtod copy new_kvm_fn() use hypercall hyperv_fn() do what must be done All implementations need a way to tell you: 1) Host time 2) Host TSC timestamp which corresponds to #1 3) Validity For old_kvm_fn() pvclock_gtod_data.clock.vclock_mode == VCLOCK_TSC For new_kvm_fn() hypercall result For hyperv_fn() whatever it takes Thanks, tglx