Received: by 2002:ab2:2994:0:b0:1ef:ca3e:3cd5 with SMTP id n20csp428053lqb; Thu, 14 Mar 2024 15:52:03 -0700 (PDT) X-Forwarded-Encrypted: i=3; AJvYcCURKAksuHxUgO/Xh9NG7bfH3dr+zkvWfHrYWO01wDfMv5wd8pwR4+Bo9cKm+Ek1U+oDgRup491jICWpbxbZ8l53NOk8F1gh13dBzBc+6w== X-Google-Smtp-Source: AGHT+IGEEQgKF66WGx8rDVWkwCncXf/KvxnvUUesRYehL4mr8eks3gmRJtet+Ym8A+gtiQ1owSig X-Received: by 2002:a05:620a:90b:b0:788:5a11:29f1 with SMTP id v11-20020a05620a090b00b007885a1129f1mr3274941qkv.78.1710456723086; Thu, 14 Mar 2024 15:52:03 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1710456723; cv=pass; d=google.com; s=arc-20160816; b=aTlQOTbbljGHamQi+K/5IHoYCldDcm9hfoLBVF42yd8pUNFH4AOfsA7FlKxbNx+RRy WKonCXti7I24EEPG7UW+L2iwI77SnxQ9XR7kWhU3rOQfA5evNRr8OLTQASLoIJltKNby HLfI3gtg2c8isMg1tuPIHYBN6NzBJOLjQ+ggo/F8R6BeMpAKeiWvGLKuZ/i/OEPddws/ b0+Lmu+wlh9U3hAXXIoBEDJow7pRKCkBThP8XFyvZ0I4iTaRLYn3MIbDdOI02GnUdE7Z 3wihSvcnK0GyCXcXakDppT1FGzX/l2nvJmjKZI711v0Ogqiz2TbzIcHoofgw5xR1TQUo HXxA== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=mime-version:list-unsubscribe:list-subscribe:list-id:precedence :message-id:date:references:in-reply-to:subject:cc:to:dkim-signature :dkim-signature:from; bh=1x6BoiJ2R+wIuiaWxSk4Ux5uhLtUhIOpd/83aUhZwks=; fh=sivLZ+YwdsQJBqzENcwFbjRbWrTsfjxYApFsgXSwxF4=; b=tbm9UTRW01xZY6R4X32kv/LqoYsZ0pUzmMOYqivN859C2q334JiTmhV+J5Wb1zipxO 6nVtoFJu1g/fNUt3A1sa5aCb1uwk3MoHDmFAvGJ0NC4sqydD8l8SSTg6WrOdHPDLzcap KFC7e/+MxMZiCB4oesLrpbg+8yL7mJtN3iSuqOvNUn598NjxqsXBq46OijW5W0FYKz0H KNPKWIs2Yc8oo+nirsygnEu6e//GkrQDww+5Az/WYLJqAcbKdfcaiq3/Lf0n1YPjDi48 1RqAi9D3xBkH/V8yiPdO/zojb0x8fh7CxCIEzBF0a61rKR1/HZGrLD+4XE3EzOgggAgK z2+w==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@linutronix.de header.s=2020 header.b=Pih3OuIG; dkim=neutral (no key) header.i=@linutronix.de header.s=2020e; arc=pass (i=1 spf=pass spfdomain=linutronix.de dkim=pass dkdomain=linutronix.de dmarc=pass fromdomain=linutronix.de); spf=pass (google.com: domain of linux-kernel+bounces-103847-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.199.223 as permitted sender) smtp.mailfrom="linux-kernel+bounces-103847-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=linutronix.de Return-Path: Received: from ny.mirrors.kernel.org (ny.mirrors.kernel.org. [147.75.199.223]) by mx.google.com with ESMTPS id k18-20020a05620a415200b007879cd066easi2831723qko.434.2024.03.14.15.52.02 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 14 Mar 2024 15:52:03 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel+bounces-103847-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.199.223 as permitted sender) client-ip=147.75.199.223; Authentication-Results: mx.google.com; dkim=pass header.i=@linutronix.de header.s=2020 header.b=Pih3OuIG; dkim=neutral (no key) header.i=@linutronix.de header.s=2020e; arc=pass (i=1 spf=pass spfdomain=linutronix.de dkim=pass dkdomain=linutronix.de dmarc=pass fromdomain=linutronix.de); spf=pass (google.com: domain of linux-kernel+bounces-103847-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.199.223 as permitted sender) smtp.mailfrom="linux-kernel+bounces-103847-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=linutronix.de Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ny.mirrors.kernel.org (Postfix) with ESMTPS id B715F1C217E2 for ; Thu, 14 Mar 2024 22:52:02 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 3993C8C1E; Thu, 14 Mar 2024 22:51:56 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=linutronix.de header.i=@linutronix.de header.b="Pih3OuIG"; dkim=permerror (0-bit key) header.d=linutronix.de header.i=@linutronix.de header.b="QJTzQ2Zl" Received: from galois.linutronix.de (Galois.linutronix.de [193.142.43.55]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 62964DF59; Thu, 14 Mar 2024 22:51:53 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=193.142.43.55 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1710456715; cv=none; b=AzXLD41PkpKp19sLcJJD9v+gxtRjJDpG2YAv+gahSsmWxvnlfbVqP6H7tyBhxcNrRo01BMAZmSj0zlMekfmXgCBfFhsmvUJECI5MHc1He0rQjx4HPyEPMiMhkEdLpGtBzfO1wY6X6mSh26GYRYGSh9TKKQpzBZCjPTN4fsoYxBU= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1710456715; c=relaxed/simple; bh=lfE9bu5HRXF5tO0qg8xsCSrAQraK0qItLSvD6agDm7s=; h=From:To:Cc:Subject:In-Reply-To:References:Date:Message-ID: MIME-Version:Content-Type; b=UeFwnOZAnr9iJ/u4lnkwyP6N9k78Wjw8BQL3NG73U0Wvv3MGrkAdQxRBKiIDhYrRAD88r8szwo9oiDYDEgtnMIv4UeGh/OUJT9zAWOIKFegZpJbpysb9a803IhNbrPLYKzvNdszAtNpN+FHuWjTmROyxX64SDb//49NqAvyqKsk= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=linutronix.de; spf=pass smtp.mailfrom=linutronix.de; dkim=pass (2048-bit key) header.d=linutronix.de header.i=@linutronix.de header.b=Pih3OuIG; dkim=permerror (0-bit key) header.d=linutronix.de header.i=@linutronix.de header.b=QJTzQ2Zl; arc=none smtp.client-ip=193.142.43.55 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=linutronix.de Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=linutronix.de From: Thomas Gleixner DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linutronix.de; s=2020; t=1710456711; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=1x6BoiJ2R+wIuiaWxSk4Ux5uhLtUhIOpd/83aUhZwks=; b=Pih3OuIGQcS61QjoP2KuiE3390kbeQpxn9jx224nSXAfiXm/DFAqs1gIdULtefF/IvhZaW GzM1T0azw/jGvcVOobh4noA3MaieHLSPbdedWlIwKlBY6ZmDZvpVUHDN6TTrtXhzbTd7+n itZndrmMsEwIvo5rjhuCf5gJxQoeppijU+FJ1GSAKtYo/Nul7e2tuOjDlLRZKmr4VLhzBN xU/I3+wGyucQSX9IX8GnX9RQLIBOJmh27ZxQNmjqm6udQseFnXPljITNc1dKxrnQfV6l3T mMuPtFDhg6lnleujOHsR/H9ffW3IMMV13tTqKLsv7HC87R1Wpoo61MPOhWwtQg== DKIM-Signature: v=1; a=ed25519-sha256; c=relaxed/relaxed; d=linutronix.de; s=2020e; t=1710456711; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=1x6BoiJ2R+wIuiaWxSk4Ux5uhLtUhIOpd/83aUhZwks=; b=QJTzQ2ZluUTsc5i71qrPP987FUBaJDN+dSu+Hca8zgsZ1hJB2fR3pPrGD2lw4yfF/Fbsly uGeLP0YHgtHjVSCQ== To: Florian Fainelli , Boqun Feng Cc: Frederic Weisbecker , "Russell King (Oracle)" , Joel Fernandes , Anna-Maria Behnsen , Linus Torvalds , linux-kernel@vger.kernel.org, kernel-team@meta.com, paulmck@kernel.org, mingo@kernel.org, rcu@vger.kernel.org, neeraj.upadhyay@amd.com, urezki@gmail.com, qiang.zhang1211@gmail.com, bigeasy@linutronix.de, chenzhongjin@huawei.com, yangjihong1@huawei.com, rostedt@goodmis.org, Justin Chen Subject: Re: Unexplained long boot delays [Was Re: [GIT PULL] RCU changes for v6.9] In-Reply-To: <8f977bbb-d949-4e90-b3d2-b9815189b842@gmail.com> References: <2fb110ed-ba04-4320-9ef0-8766c9df5578@gmail.com> <533151c9-afb5-453b-8014-9fbe7c3b26c2@gmail.com> <87v85olez3.ffs@tglx> <87sf0sldbi.ffs@tglx> <8f977bbb-d949-4e90-b3d2-b9815189b842@gmail.com> Date: Thu, 14 Mar 2024 23:51:50 +0100 Message-ID: <87msr0l94p.ffs@tglx> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Type: text/plain On Thu, Mar 14 2024 at 14:53, Florian Fainelli wrote: > On 3/14/24 14:21, Thomas Gleixner wrote: >> 8ca1836769d758e4fbf5851bb81e181c52193f5d is related, but dos not fully >> explain the fail. I haven't yet spotted where this goes into lala land. > > It was a lot harder to generate the same issue on cold boot against > 8ca1836769d758e4fbf5851bb81e181c52193f5d, That's good as it points into the exactly right direction as far as I can tell from the data we have, but I might be wrong at the end. > however, just like against 36e40df35d2c1891fe58241640c7c95de4aa739b, > it would happen resuming from suspend to DRAM whereby the CPU core(s) > lost their power and had to be re-initialized. Eventually I got a cold > boot log: > > https://gist.github.com/ffainelli/b5684585c78518a5492cbbf1c7dce16e The picture is similar to the one before: -0 2d.... 3016627us : tmigr_update_events: child=00000000 group=6b74d49d group_lvl=0 child_active=0 group_active=1 nextevt=3023000000 next_expiry=3023000000 child_evt_expiry=0 child_evtcpu=0 -0 2d.... 3016628us : tmigr_group_set_cpu_inactive: group=6b74d49d lvl=0 numa=0 active=1 migrator=1 parent=00000000 childmask=4 -0 2d.... 3016629us : tmigr_cpu_idle: cpu=2 parent=6b74d49d nextevt=3023000000 wakeup=9223372036854775807 -0 0d.... 3016684us : tmigr_group_set_cpu_inactive: group=6b74d49d lvl=0 numa=0 active=0 migrator=ff parent=00000000 childmask=1 -0 0d.... 3016685us : tmigr_cpu_idle: cpu=0 parent=6b74d49d nextevt=9223372036854775807 wakeup=9223372036854775807 -0 0d.... 3024623us : tmigr_cpu_new_timer_idle: cpu=0 parent=6b74d49d nextevt=9223372036854775807 wakeup=9223372036854775807 -0 1d.s.. 162080532us : timer_cancel: timer=2e281df7 Just a different CPU this time. The first expiring timer: kcompact-42 1d.... 2552891us : timer_start: timer=2e281df7 function=process_timeout expires=4294670348 [timeout=500] bucket_expiry=4294670352 cpu=1 idx=66 flags= Last expiry before going south: -0 1..s.. 3006620us : timer_expire_entry: timer=6f47b280 function=process_timeout now=4294670302 baseclk=4294670302 4294670352 - 4294670302 = 50 3006620us + 50*1000us = 3056620us So the previous observation of hitting the exact point of the last CPU going idle does not matter. What really bothers me is: -0 2d.... 3016629us : tmigr_cpu_idle: cpu=2 parent=6b74d49d nextevt=3023000000 wakeup=9223372036854775807 which has an event between these events: -0 0d.... 3016685us : tmigr_cpu_idle: cpu=0 parent=6b74d49d nextevt=9223372036854775807 wakeup=9223372036854775807 -0 0d.... 3024623us : tmigr_cpu_new_timer_idle: cpu=0 parent=6b74d49d nextevt=9223372036854775807 wakeup=9223372036854775807 But that event is before the next expiring timer. Confused, but Frederic just told me on IRC he's on to something. > Does the consistent ~159s mean anything? I don't think so. It might be the limitation of the clockevent device, the maximum sleep time restriction or some other unrelated event (device interrupt, hrtimer) which happens to pop up after this time for some reason. But it's definitely not relevant to the problem itself. It's just the thing which brings the machine back to life. Otherwise it might sit there forever. Thanks, tglx