Received: by 2002:a05:7412:b10a:b0:f3:1519:9f41 with SMTP id az10csp2853945rdb; Mon, 4 Dec 2023 09:11:22 -0800 (PST) X-Google-Smtp-Source: AGHT+IFfXxtOnOZtkSu3qrhx2+jzpHBWmqYZx9h8iCgPww7Z+aqmyc9/FrLi74G3l6yGffZv1360 X-Received: by 2002:a17:902:724c:b0:1d0:6ffd:e2e6 with SMTP id c12-20020a170902724c00b001d06ffde2e6mr4400854pll.128.1701709882402; Mon, 04 Dec 2023 09:11:22 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1701709882; cv=none; d=google.com; s=arc-20160816; b=iJNBw/o8kO3rTWEDmjHx09ovrji/zNnaMNaNptyA6tCUjUXXPuyEJzPq1FPAVkOVQd BiEXzolxz8/dfHP1kR/DS904GehpKp1jiXOV/37fLvr6YwKh9tkcQJ6Fagjhq1JImabU PPe1iEw7h8MwK0ML/JzjQV6GXPJW4WGaB1gY6C0luEX2xDqTemFfKQRTqlv6V78xLQep Qx3Zb2FWzU41g68DM5GFepWT9io7XOdD+lgQGRZweyLIvf6t3HbrVPw4ZXKmjEyq6v+c vTEHFN3ApWgVx7Z2urNQgtkt0C08TS6S2HQY++iaswlc+V0fgxNG1KdKZDak0UlxevfS ARCg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-disposition:mime-version :references:reply-to:message-id:subject:cc:to:from:date :dkim-signature; bh=N7OSmWvNx0rOwAi696zRyP7etai5R9U9M+H1NHfZodg=; fh=nQTlVV4kJjpP584zKHYStF6iFbqrh3yHhptmviEWUrw=; b=Fa/9AEq1wZdsxL5z0VptCsf5IlhGp0aoM1/T8uhJrPLBmuCWfuQ7MouCPwwyMBWggY sQCvl2LMgd85QufxC7rgXvR4TsTsFl/q52OC2Jit8DYfHA1l53PVANXNdHBFRvFzvknY uLLjVUEQdSU2kQ2YSQfeS4jJTs3xLWR8kTNyVarn1hmLSERKAghLypk1sVJOrLCVJoRW HzYKFG7QjmM66tiF5KyxpO4qg9Mr2j8+zMelBF2GCdVcIZQWMzO83FURcJDPHFFDJXmp eVllAk5FNxuEB6TZop/G9jZytWWtHEKvXMPfkVdZPw61UEBAE7P3HM40fLbAValvUSFm 6btQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=B83T4568; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.34 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from howler.vger.email (howler.vger.email. [23.128.96.34]) by mx.google.com with ESMTPS id x22-20020a170902821600b001cfd52a2266si5171877pln.403.2023.12.04.09.11.21 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Mon, 04 Dec 2023 09:11:22 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.34 as permitted sender) client-ip=23.128.96.34; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=B83T4568; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.34 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by howler.vger.email (Postfix) with ESMTP id 7BD2D8083A92; Mon, 4 Dec 2023 09:11:19 -0800 (PST) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.11 at howler.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S231430AbjLDRK4 (ORCPT + 99 others); Mon, 4 Dec 2023 12:10:56 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:56886 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S234869AbjLDRKw (ORCPT ); Mon, 4 Dec 2023 12:10:52 -0500 Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 9E795CD for ; Mon, 4 Dec 2023 09:10:58 -0800 (PST) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 44926C433C9; Mon, 4 Dec 2023 17:10:58 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1701709858; bh=eYlfVLjV3hhcYavJ60dgUIHZwtpyUj6mNwY7SPULvfM=; h=Date:From:To:Cc:Subject:Reply-To:References:In-Reply-To:From; b=B83T4568LPa9narG26Op803qArGPHwWv2OwSIGkYCDpoVlBa2OHjqmUZLe3rcco3n yIYQEhAzJVk5ZdGsV3PX5JlhkWEuS3RUFzeZ5d0U7R5mFvaqNKcpXq5E+KgGwkTAvm ad48ikdKZpQGMC9vu3msxUWiZnFTNMICvgfaneoW4v0mHl4c+B7klcqYbS6jPH7vYB qyjcv+dN+X8aShmr9ai7RiHYfn6Pi1/uz7Kmb2PjlsZF3Wi8FpNr0bJfn1MHqeYUIG 7Pi6loKtLCdjqUUfQyNL5zZIOA8WQHIX7svm3dtOKHb9XQVy/+iVHxr/69QRVLFHk0 MuCE2tj1haSSQ== Received: by paulmck-ThinkPad-P17-Gen-1.home (Postfix, from userid 1000) id CD3F8CE0CC3; Mon, 4 Dec 2023 09:10:57 -0800 (PST) Date: Mon, 4 Dec 2023 09:10:57 -0800 From: "Paul E. McKenney" To: sfr@canb.auug.org.au, peterz@infradead.org, tglx@linutronix.de Cc: linux-kernel@vger.kernel.org Subject: Re: [BUG] RCU CPU stall warning (bisected) Message-ID: <995d8fb2-6901-489a-9191-360ad074dad3@paulmck-laptop> Reply-To: paulmck@kernel.org References: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: X-Spam-Status: No, score=-1.2 required=5.0 tests=DKIMWL_WL_HIGH,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,MAILING_LIST_MULTI, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=unavailable autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on howler.vger.email Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (howler.vger.email [0.0.0.0]); Mon, 04 Dec 2023 09:11:19 -0800 (PST) On Wed, Nov 22, 2023 at 10:43:52AM -0800, Paul E. McKenney wrote: > Hello! > > Just FYI for the moment. > > I hit the following three times out of 15 ten-hour TREE03 rcutorture > runs on next-20231121, which suggests an MTBF of about 50 hours. This is > new over the past week or so. > > The symptom is that the RCU grace-period kthread is marked as running > ("R"), but remains stuck in schedule() for the remainder of the run. > > My next steps will be to retry on today's -next, and if that reproduces > the bug, attempt to bisect. > > But I figured that I should send this out on the off-chance that it is > a known problem. And the bisection fingered this commit, maybe even rightfully so: 5c0930ccaad5 ("hrtimers: Push pending hrtimers away from outgoing CPU earlier") Next step is to revert this on top of v6.7-rc3 and retest. I will let you know what happened when the test completes. And the error rate did drop off during the bisection, so it is possible that there are multiple causes of this bug. :-/ In the meantime, is this a known problem? ("Did I just waste a week bisecting something that has already been fixed?") Thanx, Paul