Received: by 2002:a05:6a10:a841:0:0:0:0 with SMTP id d1csp417003pxy; Wed, 28 Apr 2021 06:57:43 -0700 (PDT) X-Google-Smtp-Source: ABdhPJxawSBoWv6+7UugjeVdNGDzG1/luUCHOlRE4gYOtnoegHLU2o2aJC1pRva6GZ0rmSKhC1XL X-Received: by 2002:aa7:db95:: with SMTP id u21mr11343044edt.152.1619618263083; Wed, 28 Apr 2021 06:57:43 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1619618263; cv=none; d=google.com; s=arc-20160816; b=rsHzr6Q87iK9YWBpovnjFtl3mPA4aLYvdQQzH+RqUDt9KC0wmIElSSCZIGzuUkwFwv uiaLBgRkT4VnuWJbHqg62L4fjyp4Ugc1luVdQBGtt9BQcOIF2DVbt/5ktJSG9rwcE9yR UWU746JOTTeOvtRRr20Qd85+MR5wKoFTITfEhVOnwjPzjaNvZ85qCtrLLL0SID2nSfkP QIz7mvOe/dy4GZPCa74meFz4Bf0Fd00USUSyjLXJR4Kre2xAfGBPUdGzIlPRki8aFMfc 9y/VBmss2lrd3Ii30IJXGKvMHHEX21Ev3sex1HubS/bGyB2GucRfW8MSbjIhSoPBO6Lj rKfQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:mime-version:message-id:date:references :in-reply-to:subject:cc:to:dkim-signature:dkim-signature:from; bh=FkABMUrVTIed7Pud/pAOG8Jsv0xIjFbTnk9egvEwBMI=; b=H1NNwqOdPMILur6VSfyMYDG4JrfOy7xRW5Nq6xuuufK/7zpyddRuOqIc5xkZtIAsm7 h/xwVPp59uqdtBYG+jMCPtW+CT3Htfz3qGSAOfJ5IkXK7eFa9EDVotU35Wfa6AViLLbS zuWPiHT8HkSEBu9GXKRqPF+ilLhC5TqmJWx3jsiNDscq6Gyh/w6WVD8V1KPdkRu/8R7t gEo+1kRo+FROszNOguxGsub2PEnxQnig9aV/feCOLiroKIET010lacBscURjHVaWkGy1 Y8PKRzBt+jrjf3ng069RLwcnX541jKevSANLArwg8hNwIjQ2EifqpDyYmdStpymzqOEm Fw1g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linutronix.de header.s=2020 header.b=KpQv1InN; dkim=neutral (no key) header.i=@linutronix.de; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=linutronix.de Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id oq26si79284ejb.28.2021.04.28.06.57.12; Wed, 28 Apr 2021 06:57:43 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@linutronix.de header.s=2020 header.b=KpQv1InN; dkim=neutral (no key) header.i=@linutronix.de; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=linutronix.de Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S238907AbhD1KPk (ORCPT + 99 others); Wed, 28 Apr 2021 06:15:40 -0400 Received: from Galois.linutronix.de ([193.142.43.55]:47124 "EHLO galois.linutronix.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S238964AbhD1KPg (ORCPT ); Wed, 28 Apr 2021 06:15:36 -0400 From: Thomas Gleixner DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linutronix.de; s=2020; t=1619604889; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=FkABMUrVTIed7Pud/pAOG8Jsv0xIjFbTnk9egvEwBMI=; b=KpQv1InN6faOHKAL2CchKOUwht+Fha5LEV8LrjBreikUq9MiIR631sEGs1QKb6/ivK2Gqm niCHF878oI4rpNKo7heekExFFyG42bwtnQug3/LmO95RYdnBk5TIpyKkM6Xvc5/R3WOaIh V9JD0l6xW3V9wvpPJ6jI+91qmBUb1eJ6DOWK+Kgb9jMSfA5Ngy62Ar1/9Plr808HIwWzer aumStgiM2krWtt2Aw8YXDyseMnlIX+Mvnufb7ghOslIbLJxABcpwkGKYbU0LoqpIzhINCa smGJ/cCQz7knPQ7ndQ9O1ahmu6hFOrhtNKxTgQXhF7hRoMa07zkg2+spW+DYnw== DKIM-Signature: v=1; a=ed25519-sha256; c=relaxed/relaxed; d=linutronix.de; s=2020e; t=1619604889; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=FkABMUrVTIed7Pud/pAOG8Jsv0xIjFbTnk9egvEwBMI=; b=Jiguxh1vcFNG8UMGL0S4K6oJHuaUIpNTvsL3X6FIfeBtw8j5D9I0uYFXrPPzL+/YSKNRfC GMlWQmo5LuFougAw== To: paulmck@kernel.org Cc: Feng Tang , kernel test robot , 0day robot , John Stultz , Stephen Boyd , Jonathan Corbet , Mark Rutland , Marc Zyngier , Andi Kleen , Xing Zhengjun , LKML , lkp@lists.01.org, kernel-team@fb.com, neeraju@codeaurora.org, zhengjun.xing@intel.com Subject: Re: [clocksource] 8c30ace35d: WARNING:at_kernel/time/clocksource.c:#clocksource_watchdog In-Reply-To: <20210428014819.GA965980@paulmck-ThinkPad-P17-Gen-1> References: <20210427175027.GA945820@paulmck-ThinkPad-P17-Gen-1> <87y2d3mo2q.ffs@nanos.tec.linutronix.de> <20210428014819.GA965980@paulmck-ThinkPad-P17-Gen-1> Date: Wed, 28 Apr 2021 12:14:49 +0200 Message-ID: <87czuen2au.ffs@nanos.tec.linutronix.de> MIME-Version: 1.0 Content-Type: text/plain Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Apr 27 2021 at 18:48, Paul E. McKenney wrote: > On Tue, Apr 27, 2021 at 11:09:49PM +0200, Thomas Gleixner wrote: >> Paul, >> >> On Tue, Apr 27 2021 at 10:50, Paul E. McKenney wrote: >> > On Tue, Apr 27, 2021 at 06:37:46AM -0700, Paul E. McKenney wrote: >> >> I suppose that I give it (say) 120 seconds instead of the current 60, >> >> which might be the right thing to do, but it does feel like papering >> >> over a very real initramfs problem. Alternatively, I could provide a >> >> boot parameter allowing those with slow systems to adjust as needed. >> > >> > OK, it turns out that there are systems for which boot times in excess >> > of one minute are expected behavior. They are a bit rare, though. >> > So what I will do is keep the 60-second default, add a boot parameter, >> > and also add a comment by the warning pointing out the boot parameter. >> >> Oh, no. This starts to become yet another duct tape horror show. >> >> I'm not at all against a more robust and resilent watchdog mechanism, >> but having a dozen knobs to tune and heuristics which are doomed to fail >> is not a solution at all. > > One problem is that I did the .max_drift patch backwards. I tightened > the skew requirements on all clocks except those specially marked, and > I should have done the reverse. With that change, all of the clocks > except for clocksource_tsc would work (or as the case might be, fail to > work) in exactly the same way that they do today, but still rejecting > false-positive skew events due to NMIs, SMIs, vCPU preemption, and so on. > > Then patch v10 7/7 can go away completely, and patch 6/7 becomes much > smaller (and gets renamed), for example, as shown below. > > Does that help? No. Because the problem is on both ends. We have TSC early which has inaccurate frequency and we have watchdogs which are inaccurate, i.e. refined jiffies. So the threshold has to take both into account. Thanks, tglx