Received: by 2002:ac0:a581:0:0:0:0:0 with SMTP id m1-v6csp90322imm; Thu, 21 Jun 2018 14:30:05 -0700 (PDT) X-Google-Smtp-Source: ADUXVKJ2jWV++tkoBlRaOkYP+eUdlLUvkxkKhXziIPVxTM9yaDXpSRVvR7gV4A2Y2idUsn4gxMIG X-Received: by 2002:a62:c4dd:: with SMTP id h90-v6mr29419060pfk.86.1529616605149; Thu, 21 Jun 2018 14:30:05 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1529616605; cv=none; d=google.com; s=arc-20160816; b=NI1y95oHxykANxtjm0laDVv8oiCuFeGt/RNOgT3O9Pq8kiE93STvuSCDygSVfvRnlK XPbx1G9PWNYFk/W99an01H6fpwC3djOl6uE9ccyPO+F8uHq0RlRTDod9whRD0VWf5A0R bCCTXGx/jdUDnmJVcGmxt5eluOrxn5zA++LX5cbVCutTGQILvhGbQ9JXJ4DeQNOhbAVY P+/Yerh+hYMr0Hbg2H5mWlxFxR15EL/tmb8GrG6uqhoXwjsIugvl10si9gDmzLJMxlYy vg7GU8we6YKyNUF7M7vsZtiNssKKwejx91qP+uk43vZn8uskQK4JuL+eIkW1z8afNo6n +ZHA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:references:in-reply-to:message-id:date :subject:to:from:dkim-signature:arc-authentication-results; bh=MaR2nkbhFIMiVvzGC4n58/686bzlW57n/ekheBu9UEk=; b=UYxwhtg4zlGGqOIDYaF8vw29Ty6n7ypHJCb0GTpKYag3Wi0dISero2juMA4WAMfm5k Fj3ugPitQOW4aOo2ISTXJDvCuKd6bJt9oeVb9FlsoOpIEKnmIqcNv3rj/6YhJ88vL/b6 MowTh042Hycmu95PJvaNeEX0E2+lmRpAOGTghRnFeTpmD+/i45VrKnk3fGBgYLVAQcYX AFZCJ/iCvFIyzNTHaW3gNfG4QUk0g5K5tnecJT/7RwwsAPsZT06MZ56uQvhT5eP5DmWJ xEmID/NtNzxhQ1/Ej9aYSsiFB33/9ElHtnNWNCdqCvTygttu+kXViNNlBjNsWHmB9P56 8fGg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@oracle.com header.s=corp-2017-10-26 header.b="Gn/Zp5q9"; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=oracle.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id d18-v6si4539484pgv.248.2018.06.21.14.29.50; Thu, 21 Jun 2018 14:30:05 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@oracle.com header.s=corp-2017-10-26 header.b="Gn/Zp5q9"; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=oracle.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S933796AbeFUV2r (ORCPT + 99 others); Thu, 21 Jun 2018 17:28:47 -0400 Received: from aserp2120.oracle.com ([141.146.126.78]:38334 "EHLO aserp2120.oracle.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S933537AbeFUV2d (ORCPT ); Thu, 21 Jun 2018 17:28:33 -0400 Received: from pps.filterd (aserp2120.oracle.com [127.0.0.1]) by aserp2120.oracle.com (8.16.0.22/8.16.0.22) with SMTP id w5LLNqvc138868; Thu, 21 Jun 2018 21:25:36 GMT DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=oracle.com; h=from : to : subject : date : message-id : in-reply-to : references; s=corp-2017-10-26; bh=MaR2nkbhFIMiVvzGC4n58/686bzlW57n/ekheBu9UEk=; b=Gn/Zp5q9MH4e1FoTPwyjwawSaHzdv+N+Kv1kMb3BRVMHumf47sfyG7M+s9gbW5CTlwSF cKgPcD6ed4VnyzRudrGnpSZP1z5wRwha7sR1/iK5kcLKRLddqBaSHWKi4BQdh8WQnCHA ad/uF/RKgsmD4WVyEuvBS1bD+JexBFmkFGBA6+HMToaUguzfYCxD+WxH1P9x6GJazY3V xMoKFy+urEI8XbLOymw+UKWiKszhCrmeeDuowLe6uUTDMweSPQodZEDy+VYImjW9BW2j kVc8gfwMVW52s9/JOsn0cCF+sYpgJc8w8jagkJi46B1pxkt6YA3dIpd6QlypXPum/nDV fA== Received: from userv0022.oracle.com (userv0022.oracle.com [156.151.31.74]) by aserp2120.oracle.com with ESMTP id 2jmtgx3qdb-1 (version=TLSv1.2 cipher=ECDHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Thu, 21 Jun 2018 21:25:36 +0000 Received: from userv0121.oracle.com (userv0121.oracle.com [156.151.31.72]) by userv0022.oracle.com (8.14.4/8.14.4) with ESMTP id w5LLPZHN028348 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-GCM-SHA384 bits=256 verify=OK); Thu, 21 Jun 2018 21:25:35 GMT Received: from abhmp0010.oracle.com (abhmp0010.oracle.com [141.146.116.16]) by userv0121.oracle.com (8.14.4/8.13.8) with ESMTP id w5LLPZnx015209; Thu, 21 Jun 2018 21:25:35 GMT Received: from xakep.us.oracle.com (/10.39.228.116) by default (Oracle Beehive Gateway v4.0) with ESMTP ; Thu, 21 Jun 2018 14:25:34 -0700 From: Pavel Tatashin To: steven.sistare@oracle.com, daniel.m.jordan@oracle.com, linux@armlinux.org.uk, schwidefsky@de.ibm.com, heiko.carstens@de.ibm.com, john.stultz@linaro.org, sboyd@codeaurora.org, x86@kernel.org, linux-kernel@vger.kernel.org, mingo@redhat.com, tglx@linutronix.de, hpa@zytor.com, douly.fnst@cn.fujitsu.com, peterz@infradead.org, prarit@redhat.com, feng.tang@intel.com, pmladek@suse.com, gnomes@lxorguk.ukuu.org.uk, linux-s390@vger.kernel.org Subject: [PATCH v12 06/11] time: replace read_boot_clock64() with read_persistent_wall_and_boot_offset() Date: Thu, 21 Jun 2018 17:25:13 -0400 Message-Id: <20180621212518.19914-7-pasha.tatashin@oracle.com> X-Mailer: git-send-email 2.17.1 In-Reply-To: <20180621212518.19914-1-pasha.tatashin@oracle.com> References: <20180621212518.19914-1-pasha.tatashin@oracle.com> X-Proofpoint-Virus-Version: vendor=nai engine=5900 definitions=8931 signatures=668703 X-Proofpoint-Spam-Details: rule=notspam policy=default score=0 suspectscore=0 malwarescore=0 phishscore=0 bulkscore=0 spamscore=0 mlxscore=0 mlxlogscore=419 adultscore=0 classifier=spam adjust=0 reason=mlx scancount=1 engine=8.0.1-1805220000 definitions=main-1806210230 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org If architecture does not support exact boot time, it is challenging to estimate boot time without having a reference to the current persistent clock value. Yet, we cannot read the persistent clock time again, because this may lead to math discrepancies with the caller of read_boot_clock64() who have read the persistent clock at a different time. This is why it is better to provide two values simultaneously: the persistent clock value, and the boot time. Thus, we replace read_boot_clock64() with: read_persistent_wall_and_boot_offset(wall_time, boot_offset) Where wall_time is returned by read_persistent_clock() And boot_offset is wall_time - boot time We calculate boot_offset using the current value of local_clock() so architectures, that do not have a dedicated boot_clock but have early sched_clock(), such as SPARCv9, x86, and possibly more will benefit from this change by getting a better and more consistent estimate of the boot time without need for an arch specific implementation. Signed-off-by: Pavel Tatashin --- include/linux/timekeeping.h | 3 +- kernel/time/timekeeping.c | 61 +++++++++++++++++++------------------ 2 files changed, 34 insertions(+), 30 deletions(-) diff --git a/include/linux/timekeeping.h b/include/linux/timekeeping.h index 86bc2026efce..686bc27acef0 100644 --- a/include/linux/timekeeping.h +++ b/include/linux/timekeeping.h @@ -243,7 +243,8 @@ extern void ktime_get_snapshot(struct system_time_snapshot *systime_snapshot); extern int persistent_clock_is_local; extern void read_persistent_clock64(struct timespec64 *ts); -extern void read_boot_clock64(struct timespec64 *ts); +void read_persistent_clock_and_boot_offset(struct timespec64 *wall_clock, + struct timespec64 *boot_offset); extern int update_persistent_clock64(struct timespec64 now); /* diff --git a/kernel/time/timekeeping.c b/kernel/time/timekeeping.c index 4786df904c22..aface5c13e7d 100644 --- a/kernel/time/timekeeping.c +++ b/kernel/time/timekeeping.c @@ -17,6 +17,7 @@ #include #include #include +#include #include #include #include @@ -1496,18 +1497,23 @@ void __weak read_persistent_clock64(struct timespec64 *ts64) } /** - * read_boot_clock64 - Return time of the system start. + * read_persistent_wall_and_boot_offset - Read persistent clock, and also offset + * from the boot. * * Weak dummy function for arches that do not yet support it. - * Function to read the exact time the system has been started. - * Returns a timespec64 with tv_sec=0 and tv_nsec=0 if unsupported. - * - * XXX - Do be sure to remove it once all arches implement it. + * wall_time - current time as returned by persistent clock + * boot_offset - offset that is defined as wall_time - boot_time + * The default function calculates offset based on the current value of + * local_clock(). This way architectures that support sched_clock() but don't + * support dedicated boot time clock will provide the best estimate of the + * boot time. */ -void __weak read_boot_clock64(struct timespec64 *ts) +void __weak __init +read_persistent_wall_and_boot_offset(struct timespec64 *wall_time, + struct timespec64 *boot_offset) { - ts->tv_sec = 0; - ts->tv_nsec = 0; + read_persistent_clock64(wall_time); + *boot_offset = ns_to_timespec64(local_clock()); } /* Flag for if timekeeping_resume() has injected sleeptime */ @@ -1521,28 +1527,28 @@ static bool persistent_clock_exists; */ void __init timekeeping_init(void) { + struct timespec64 wall_time, boot_offset, wall_to_mono; struct timekeeper *tk = &tk_core.timekeeper; struct clocksource *clock; unsigned long flags; - struct timespec64 now, boot, tmp; - - read_persistent_clock64(&now); - if (!timespec64_valid_strict(&now)) { - pr_warn("WARNING: Persistent clock returned invalid value!\n" - " Check your CMOS/BIOS settings.\n"); - now.tv_sec = 0; - now.tv_nsec = 0; - } else if (now.tv_sec || now.tv_nsec) - persistent_clock_exists = true; - read_boot_clock64(&boot); - if (!timespec64_valid_strict(&boot)) { - pr_warn("WARNING: Boot clock returned invalid value!\n" - " Check your CMOS/BIOS settings.\n"); - boot.tv_sec = 0; - boot.tv_nsec = 0; + read_persistent_wall_and_boot_offset(&wall_time, &boot_offset); + if (timespec64_valid_strict(&wall_time) && + timespec64_to_ns(&wall_time)) { + persistent_clock_exists = true; + } else { + pr_warn("Persistent clock returned invalid value"); + wall_time = (struct timespec64){0}; } + if (timespec64_compare(&wall_time, &boot_offset) < 0) + boot_offset = (struct timespec64){0}; + + /* We want set wall_to_mono, so the following is true: + * wall time + wall_to_mono = boot time + */ + wall_to_mono = timespec64_sub(boot_offset, wall_time); + raw_spin_lock_irqsave(&timekeeper_lock, flags); write_seqcount_begin(&tk_core.seq); ntp_init(); @@ -1552,13 +1558,10 @@ void __init timekeeping_init(void) clock->enable(clock); tk_setup_internals(tk, clock); - tk_set_xtime(tk, &now); + tk_set_xtime(tk, &wall_time); tk->raw_sec = 0; - if (boot.tv_sec == 0 && boot.tv_nsec == 0) - boot = tk_xtime(tk); - set_normalized_timespec64(&tmp, -boot.tv_sec, -boot.tv_nsec); - tk_set_wall_to_mono(tk, tmp); + tk_set_wall_to_mono(tk, wall_to_mono); timekeeping_update(tk, TK_MIRROR | TK_CLOCK_WAS_SET); -- 2.17.1