Received: by 2002:a05:6359:6284:b0:131:369:b2a3 with SMTP id se4csp2780363rwb; Mon, 7 Aug 2023 03:29:19 -0700 (PDT) X-Google-Smtp-Source: AGHT+IGYW4RAzBikIYKka9+McIWsEvQLYjorXhzVP58qREfaJzFtlA0HNToWm2KZPHFgJB0o6+Gx X-Received: by 2002:a17:903:124f:b0:1b8:b41e:66b4 with SMTP id u15-20020a170903124f00b001b8b41e66b4mr10130359plh.67.1691404158742; Mon, 07 Aug 2023 03:29:18 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1691404158; cv=none; d=google.com; s=arc-20160816; b=mixU2nd6XonGNRAU3U73tL+PCkCBJ12OyLrZB7xy+vY6zgTZgEhroRB5edinMZYCkN xiGSX7q3+zUclFVGvOCywTJEubNRfiwapE6QFy/kSeiYMZ2F1zH38OPvtTMI8vrjl3EX x+9+JIgAKl/oGI+eRrlqsDcsVab+dQ2vbbFRbjY3N1rMFaZaDQPhT7cAugTFE72Do9x7 8bUgeTL0wT48mmUizM9t86NmkR3gl1G5+qxZ0+kEWrZL+gyIZvwRWh0DR1sDvVn+3PRp JaFSqQuTT/ptFMkwHQWPzM2dMymLXY0iKFiFf8s71O9bpglZnxK6DoOAhNggYR8EltYr 1QgA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:mime-version:message-id:date:subject:cc:to :dkim-signature:dkim-signature:from; bh=s/zPaS9Z5FLXbNNBcGX5RsYNKU80L6L1pU+cuGQC75s=; fh=apWchzV4oMgbpwiaDmJAfiSaIT1o03VWxwj95y/rE5k=; b=Ta7iOQeUS4h5AhEkoVih55tbjWXxdFoi7iJL/AqpYj091a6Og31l+TKe/MOlRwZrlp +VqYAgGK1Rwbb8VIiu2xBfBMmbVg6b3Pn5uLLxL+dkR6LZ/BfWTzwfgl7P/i88287foK gbusQtehMmpuKPEmPpvsxIfGv6KbdPZNWB1ta6oq0zdwl1ZJbYikzoH7kaAFJy4LC5A7 gx1vF6YuhKJUcT5AX1O3MsdV2A5gc2TOtBwO8ym+CI61N+gpxQBf4qs3IOamcoWOBukY qm5ahWGLlNgYcCWl1yhHHEBjQj4dAiiL63P6WbNWx68CqnA8dODUdvBlg5cjse6fu+9u DMMg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linutronix.de header.s=2020 header.b="0Co9hz0/"; dkim=neutral (no key) header.i=@linutronix.de header.s=2020e header.b=zulpOQEH; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=linutronix.de Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id p18-20020a170902c71200b001b7fa1a9a36si3199949plp.67.2023.08.07.03.29.07; Mon, 07 Aug 2023 03:29:18 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@linutronix.de header.s=2020 header.b="0Co9hz0/"; dkim=neutral (no key) header.i=@linutronix.de header.s=2020e header.b=zulpOQEH; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=linutronix.de Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231452AbjHGJTp (ORCPT + 99 others); Mon, 7 Aug 2023 05:19:45 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:52022 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231422AbjHGJTm (ORCPT ); Mon, 7 Aug 2023 05:19:42 -0400 Received: from galois.linutronix.de (Galois.linutronix.de [193.142.43.55]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 4C885E7B for ; Mon, 7 Aug 2023 02:19:41 -0700 (PDT) From: Thomas Gleixner DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linutronix.de; s=2020; t=1691399978; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type; bh=s/zPaS9Z5FLXbNNBcGX5RsYNKU80L6L1pU+cuGQC75s=; b=0Co9hz0/oJEiK/ZGH5xWX1TtqD/N4t54kEoeMNL+adKYgg8HzSahcuynNMEB2GwGqCCIiP aHf5jJYvzrHVTSBsE9KwqEwkF2MjdW3+KFZmr1Bi/+5rHDTXrmwGx8fHMbKVd1SjCXhBuw AiGLI8x09fIUb4pjuAM5n/4xCeKhTCSu8c8i8nseqtoPJdKInNDo7W5mjWCN5gUKAkDT8v TryXxDsHI0bUhRJGaKjjFz7g7xbR6wfEM1dlsAMqwgJvg4RiMo+YXWy2aQ89YXEElKWI2n jsNFhepV8IJ62fhI94dQsEPJ1+USAMEfIoM5lsMLLFlmQFzZaemNfvUuhzVlJg== DKIM-Signature: v=1; a=ed25519-sha256; c=relaxed/relaxed; d=linutronix.de; s=2020e; t=1691399978; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type; bh=s/zPaS9Z5FLXbNNBcGX5RsYNKU80L6L1pU+cuGQC75s=; b=zulpOQEHSPTDDri5elyM+eEkPFHigOFF3+RZX6aKSCfaaxAiIlefgpsGOQ5y+g/lJmBdWV v+FcBS/Eqri69EBQ== To: Juergen Gross Cc: LKML , Andrew Cooper , Jan Beulich , xen-devel@lists.xenproject.org, "Paul E. McKenney" , x86@kernel.org Subject: [BUG] XEN/PV dom0 time management Date: Mon, 07 Aug 2023 11:19:38 +0200 Message-ID: <87a5v3us45.ffs@tglx> MIME-Version: 1.0 Content-Type: text/plain X-Spam-Status: No, score=-4.4 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_MED,SPF_HELO_NONE, SPF_PASS autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi! Something in XEN/PV time management seems to be seriously broken: timekeeping watchdog on CPU9: Marking clocksource 'tsc' as unstable because the skew is too large: [ 152.557154] clocksource: 'xen' wd_nsec: 511979417 wd_now: 24e4d7625e wd_last: 24c65332c5 mask: ffffffffffffffff [ 152.566197] clocksource: 'tsc' cs_nsec: 512468734 cs_now: 9a306c9b808c cs_last: 9a302c9e30ba mask: ffffffffffffffff [ 152.572319] clocksource: Clocksource 'tsc' skewed 489317 ns (0 ms) over watchdog 'xen' interval of 511979417 ns (511 ms) [ 152.578067] clocksource: 'tsc' is current clocksource. [ 152.581023] tsc: Marking TSC unstable due to clocksource watchdog [ 152.583751] clocksource: Checking clocksource tsc synchronization from CPU 5 to CPUs 0,3,8,10,12,15. [ 152.590860] clocksource: CPUs 8 ahead of CPU 5 for clocksource tsc. [ 152.597196] clocksource: CPU 5 check durations 14197ns - 124761ns for clocksource tsc. [ 152.602675] clocksource: Switched to clocksource xen This is fully reproducible with variations of the failure report in the following setup: - VM running on KVM on a SKLX machine - Debian bookworm install with XEN 4.17 - Happens with the off the shelf debian 6.1 kernel and with current upstream (6.5-rc4) Why am I convinced that this is a XENPV issue? Simply because the same kernels booted w/o XEN on the same VM and the same hardware do not have any issue with using TSC as clocksource. The TSC on that machine is stable and fully synchronized. The clocksource watchdog uses kvm-clock to monitor TSC and it never had any complaints. But with XEN underneath its a matter of minutes after boot to happen. I tried to make sense out of it, but ran out of steam and patience, so I decided to report this to the XEN wizards. Thanks, tglx