Received: by 2002:ab2:710b:0:b0:1ef:a325:1205 with SMTP id z11csp1340726lql; Tue, 12 Mar 2024 14:15:51 -0700 (PDT) X-Forwarded-Encrypted: i=3; AJvYcCU41BrzBg/l9NEUBYdhY72ssmiVBGqkmEWZ8qOk9cAYjQpv1MiZAZoj/KthYc9NCrjVSUOgBO3wPvrDk9q+rzO4gMCnsQLmOrV/gUVfPg== X-Google-Smtp-Source: AGHT+IH9x3ZQKwafCyCXQ67EEIIUhyG/bQKPuSlHaaK6OQWL6lJRfQNGRa6+fIFJGYVazxqJfqYj X-Received: by 2002:a67:f9c4:0:b0:473:2d09:3965 with SMTP id c4-20020a67f9c4000000b004732d093965mr6870728vsq.6.1710278150911; Tue, 12 Mar 2024 14:15:50 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1710278150; cv=pass; d=google.com; s=arc-20160816; b=nMlzq0347IHe8gW6LecU2stT6GkYRQJClpfkd3q1Aj8OzbID/9Yytus4SC7/lbpkjs 8QKu2K4YxLKKAnGV1DWOxnSkUuKRjSLYDv73UetM6c7JOp2JMxG+D1e7nkdvwg3jxkC8 HEvhTf5nPln00MamnWKK6w+eb7y1/+OaGiLkTPgKBITKCq37uLSEGRVcoEb0CE3V/CPY Yih3oMZQE1r1IKBPPtjW8eF/Q5VWkqyy1rOEn7tWQhv/ga0K5YRFKHhA0G9+dVCtgqsz 9lEh7a21+GNAf4YTB9U+AKh2ihs51CpsK2K4ptrpjZpppDf94B2TmWI1Q+8M4IJl0ajL pyOQ== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=in-reply-to:content-transfer-encoding:content-disposition :mime-version:list-unsubscribe:list-subscribe:list-id:precedence :references:reply-to:message-id:subject:cc:to:from:date :dkim-signature; bh=2g8pMrgEtkF7KtkYN9j8n5Y2X2cF9cNOWm8Nf6k0/jI=; fh=XwvfJcUDJ73jJxhSkCanXtlX/lqq+2IzKBZQ9T7ap24=; b=PkQZAjqdKXDnQ/kRh+WV9Xd0WOCfHP1/FFGP2vpeOLN9j7krZG5r/S2NA8aIV84m+3 LDE4bmQGFuC+WXPKB03UZyg9JNKshQn1SVZ9enePt0/rr79DApJK0pXrP8h0RPDIFTxD U0sanpBy1/ryNRwSssCBKQmXcY7Mp2AuYIMahEZK+6b/wpUGMq6d5smtYflLNrWZujOd cOR98f1LpoC1Bb97zmbhZ8Lu2/ISoSO8xXgu/zCD4AATxIPohx3+qbDUZqQG/v1q99a/ sBE633A+k0uYOj2vBymOYNhcXgtnjA9UtTpLUM0cMNwqnT0hqve/L2AxOcOQ06Da5UU2 s2IA==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=a4B7fDkT; arc=pass (i=1 dkim=pass dkdomain=kernel.org); spf=pass (google.com: domain of linux-kernel+bounces-100795-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.199.223 as permitted sender) smtp.mailfrom="linux-kernel+bounces-100795-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from ny.mirrors.kernel.org (ny.mirrors.kernel.org. [147.75.199.223]) by mx.google.com with ESMTPS id gf6-20020a056214250600b0068efe8cf6dasi8581137qvb.12.2024.03.12.14.15.50 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Tue, 12 Mar 2024 14:15:50 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel+bounces-100795-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.199.223 as permitted sender) client-ip=147.75.199.223; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=a4B7fDkT; arc=pass (i=1 dkim=pass dkdomain=kernel.org); spf=pass (google.com: domain of linux-kernel+bounces-100795-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.199.223 as permitted sender) smtp.mailfrom="linux-kernel+bounces-100795-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by ny.mirrors.kernel.org (Postfix) with ESMTPS id 900D81C214C4 for ; Tue, 12 Mar 2024 21:15:50 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 559D514372D; Tue, 12 Mar 2024 21:15:43 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b="a4B7fDkT" Received: from smtp.kernel.org (aws-us-west-2-korg-mail-1.web.codeaurora.org [10.30.226.201]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 795E43FE4B; Tue, 12 Mar 2024 21:15:42 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=10.30.226.201 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1710278142; cv=none; b=RfxnVzJQKNsvTiOLMZVbadc+uL3NQfocF+aUEZcsQ54bYOFNAmJAPTKLC4Msl9+0tfo73e/M9CS9SDHeE3pyMIR5YiACj58wzdXmQIEEqhCbyBlhIGGx2oovJRN8f8GKCF2ZanvnyLxT1OEuhfKUE415TWJKW1FjmMQy7q6d4As= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1710278142; c=relaxed/simple; bh=fWDd+5msU0BfCFa5Mc1s8n6dwpCV8l+LsPPJhSxZJck=; h=Date:From:To:Cc:Subject:Message-ID:References:MIME-Version: Content-Type:Content-Disposition:In-Reply-To; b=kZrLW2BXmHlCRWkhD04vQCsvyexWs+D1/MeKaUNGXxduy81UIXRdKht8XjZLy9YrtKZuFQD1mQvcXDhdTpT4JzJxX4WpVscRArnLisQqoiK++uZP8TcmR2wWsdeGMx+DUHo54hGp8zyVmNt1Y2yQ5RhhYb+r9ovPkIAUb4iaJMw= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b=a4B7fDkT; arc=none smtp.client-ip=10.30.226.201 Received: by smtp.kernel.org (Postfix) with ESMTPSA id DFE7FC433C7; Tue, 12 Mar 2024 21:15:41 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1710278141; bh=fWDd+5msU0BfCFa5Mc1s8n6dwpCV8l+LsPPJhSxZJck=; h=Date:From:To:Cc:Subject:Reply-To:References:In-Reply-To:From; b=a4B7fDkTpLuAimTDV4ARFBUjjZsu+qiegh/jBc3Qz4RylXs31L2qebG5DFmNR2/qf ARTzMS2ldoWtksTO2wPnxTgodme40yyRs+2QPZlUxu3+nqOYp54LCAe6eVUExecotn H05bP3bBBFMSQ38hKC1zl74PC9HESgeZV3Wkmca9S2+nEmzDUYx4k4BePo6xQrnV/T wJ169izLpDFM5rewS8ygZzjGNZGT7KwZw894qYkLcyHxwGXBv2qkpZqJzRoB9O0K6f BykVM9HcWGmFH8zUh6e47zxFpqcCXa8GQvyG/LcNce55fwoK+Y+qOoybC9E7piGL54 vD+QmwO9kCdWg== Received: by paulmck-ThinkPad-P17-Gen-1.home (Postfix, from userid 1000) id 78777CE0C4C; Tue, 12 Mar 2024 14:15:41 -0700 (PDT) Date: Tue, 12 Mar 2024 14:15:41 -0700 From: "Paul E. McKenney" To: Frederic Weisbecker Cc: Florian Fainelli , Boqun Feng , torvalds@linux-foundation.org, linux-kernel@vger.kernel.org, kernel-team@meta.com, mingo@kernel.org, tglx@linutronix.de, rcu@vger.kernel.org, joel@joelfernandes.org, neeraj.upadhyay@amd.com, urezki@gmail.com, qiang.zhang1211@gmail.com, bigeasy@linutronix.de, anna-maria@linutronix.de, chenzhongjin@huawei.com, yangjihong1@huawei.com, rostedt@goodmis.org Subject: Re: Unexplained long boot delays [Was Re: [GIT PULL] RCU changes for v6.9] Message-ID: Reply-To: paulmck@kernel.org References: <4274be61-60bd-4e1e-9c16-26e6e5e06f65@gmail.com> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: On Tue, Mar 12, 2024 at 10:01:00PM +0100, Frederic Weisbecker wrote: > Le Tue, Mar 12, 2024 at 01:32:03PM -0700, Florian Fainelli a ?crit : > > Hi Boqun, > > > > On 3/8/24 09:15, Boqun Feng wrote: > > > Hi Linus, > > > > > > Please pull this for the RCU changes of v6.9: > > > > > > The following changes since commit 41bccc98fb7931d63d03f326a746ac4d429c1dd3: > > > > > > Linux 6.8-rc2 (2024-01-28 17:01:12 -0800) > > > > > > are available in the Git repository at: > > > > > > git://git.kernel.org/pub/scm/linux/kernel/git/boqun/linux.git tags/rcu.next.v6.9 > > > > > > for you to fetch changes up to 3add00be5fe5810d7aa5ec3af8b6a245ef33144b: > > > > > > Merge branches 'rcu-doc.2024.02.14a', 'rcu-nocb.2024.02.14a', 'rcu-exp.2024.02.14a', 'rcu-tasks.2024.02.26a' and 'rcu-misc.2024.02.14a' into rcu.2024.02.26a (2024-02-26 17:37:25 -0800) > > > > > > > > > Two merge conflicts were detected by linux-next: > > > > > > * https://lore.kernel.org/lkml/20240226135745.12ac854d@canb.auug.org.au/ > > > * https://lore.kernel.org/lkml/20240227125522.2bdbe6be@canb.auug.org.au/ > > > > > > These conflict resolutions from linux-next look good to me, plus I made > > > my own resolutions at branch merge/rcu.2024.02.27a for your reference. > > > > > > > > > Some highlights of the changes: > > > > > > * Eliminates deadlocks involving do_exit() and RCU tasks, by Paul: > > > Instead of SRCU read side critical sections, now a percpu list is used > > > in do_exit() for scaning yet-to-exit tasks. > > > > > > * Fixes a deadlock due to the dependency between workqueue and RCU > > > expedited grace period, reported by Anna-Maria Behnsen and Thomas > > > Gleixner and fixed by Frederic: Now RCU expedited always uses its own > > > kthread worker instead of a workqueue. > > > > At least one device in my test farm (ARM 32-bit) has consistently shown a > > very long boot, and some others are intermittently affected. This > > consistently looks like this on most of my devices: > > > > [ 2.450351] bcmgenet f0480000.ethernet: GENET 5.0 EPHY: 0x0000 > > [ 2.547562] ata1: SATA link down (SStatus 0 SControl 300) > > [ 162.107264] unimac-mdio unimac-mdio.0: Broadcom UniMAC MDIO bus > > > > this gets flagged by my boot script as a boot failure since we exceeded the > > 30 seconds timeout given to boot a kernel to a prompt. > > > > It has been somewhat difficult to get a reliable bisection going on, but > > what I am sure of is that e5a3878c947ceef7b6ab68fdc093f3848059842c~1 does > > not expose the problem for 10 consecutive boots, while I *might* see it at > > e5a3878c947ceef7b6ab68fdc093f3848059842c and beyond. > > > > Any clues what is going on here? > > I understand it doesn't reliably reproduce. But is there any chance you could > test that alone? > > 0bb11a372fc8 (rcu-tasks: Maintain real-time response in rcu_tasks_postscan()) > > And if it reproduces there try a bisection? > > git bisect bad 0bb11a372fc8d7006b4d0f42a2882939747bdbff > git bisect good 41bccc98fb7931d63d03f326a746ac4d429c1dd3 Good point! Is this kernel built with CONFIG_PREEMPT_RT=y and CONFIG_TASKS_RCU=y? Either way, is there also a large number of tasks exiting at this point during boot? Thanx, Paul