Received: by 2002:a25:1985:0:0:0:0:0 with SMTP id 127csp3240121ybz; Mon, 27 Apr 2020 12:29:35 -0700 (PDT) X-Google-Smtp-Source: APiQypJyXzabNGMQre4xtUFZVw7B1vpftJG9oKJKpJaOFyV/D/SpKRLgNfLGcr8TzucV/huf1an0 X-Received: by 2002:a50:bb07:: with SMTP id y7mr19646859ede.358.1588015775614; Mon, 27 Apr 2020 12:29:35 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1588015775; cv=none; d=google.com; s=arc-20160816; b=OAR2eNEZqrywH1NizqnZGMcgkSVeP66GzaM5e4ng4CMyAJhz1BjFOkchvCsN6bapA0 8yp6KBFMUln6Lkem0+zvn5M/bdXmRVNS0oSFm9J/yKrI1X2XWREwRQPBizlH1smLIjTB dBch8FLCsuU8+lbifhkYNWWkttxYxF9zkrajaScctaShD9YCvQBdZZlegwBkriyjp++O br29QY62TluZEhpLzIQMHbqAdksdgd40XYYTo5cMvMUTwb0HIQKj9MNFgCP+s1iXQytF 0ZvP6Rnlt4TQ76b4GybTOVseEfjqLjBgRZlRLY9l36IrqxTByAyKPne9t2F56h95SlLx 7yuw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :references:in-reply-to:date:cc:to:from:subject:message-id :dkim-signature; bh=61fptsV5DLKpmlNchEBjtkM0zcul7LMw/TsciRLcZlM=; b=uorc6BCmVmuNS1sHpy1Aytb7dgzR6q+BZQWBWnHaDfE26SQKpzYViRvFARUd96jd6T uzSjhvtW5PqlOu71cC/XfkTrkPyjJ72TRh4x5BbJ9uTRfMxjcrQun3PxR1mOM3Z2dmv9 WfBbUEwkygulxw6i8igID53SKQhYf70QyHwRh211LWMnsRUyE98W72rJDI9glUnCVDTM XsPKgY6R0nc0hL8s4GUFuG+rJXBcWjrtOFjWN9K6mwkUQXxyq0iW2J/nf1Dps3FefQq+ iaSMuIzDF0GVy8TUQIrAACzIHmg1iAfNZGMUL67c2q5shTYcYfLLUUGPzQZrXbWuoOEg kyuw== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=iDvgkZAb; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id b7si364634eju.8.2020.04.27.12.29.11; Mon, 27 Apr 2020 12:29:35 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=iDvgkZAb; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1726698AbgD0T04 (ORCPT + 99 others); Mon, 27 Apr 2020 15:26:56 -0400 Received: from mail.kernel.org ([198.145.29.99]:43958 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726315AbgD0T0z (ORCPT ); Mon, 27 Apr 2020 15:26:55 -0400 Received: from tzanussi-mobl (c-98-220-238-81.hsd1.il.comcast.net [98.220.238.81]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 1D01220775; Mon, 27 Apr 2020 19:26:54 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1588015615; bh=+6ORraxcq0bFebqoTSSgJj2lGpfwZ/siWQ6vQG2K/Ek=; h=Subject:From:To:Cc:Date:In-Reply-To:References:From; b=iDvgkZAbTXBJmLrnTrp5dbaKID+/ZmDkyzHXhFmHDlJn7HtVvQO2LUoiqilYn0AES RBN698cxUIZHagHu47Wla60SruRknJobEmXOnxVBLpeRg3PXDhbqpMANtwsQuviQaw zu5MruNuJDi2STH/bn4ZGdnc1ZHBSs6S3tLZpQ4U= Message-ID: <2535a046bc15f28478c5289742e617a468becbd6.camel@kernel.org> Subject: Re: [PATCH RT 10/30] hrtimer: Prevent using hrtimer_grab_expiry_lock() on migration_base From: Tom Zanussi To: Steven Rostedt , Rasmus Villemoes Cc: linux-kernel@vger.kernel.org, linux-rt-users , Thomas Gleixner , Carsten Emde , Sebastian Andrzej Siewior , John Kacur , Julia Cartwright , Daniel Wagner , Julien Grall Date: Mon, 27 Apr 2020 14:26:52 -0500 In-Reply-To: <20200427150626.4f217c82@gandalf.local.home> References: <20200123203930.646725253@goodmis.org> <20200123203943.749508731@goodmis.org> <20200427150626.4f217c82@gandalf.local.home> Content-Type: text/plain; charset="UTF-8" X-Mailer: Evolution 3.28.5-0ubuntu0.18.04.1 Mime-Version: 1.0 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, 2020-04-27 at 15:06 -0400, Steven Rostedt wrote: > On Mon, 27 Apr 2020 15:10:00 +0200 > Rasmus Villemoes wrote: > > > However, building with their rescue initrd and booting that, the > > board > > would consistently hang during reboot. Sometimes I would get lines > > like > > > > [ 72.956630] sched: RT throttling activated > > [ 72.973769] lanx: port 1(lan1) entered disabled state > > [ 73.000401] lanx: port 2(lan2) entered disabled state > > [ 73.974951] lanx: port 3(lan3) entered disabled state > > [ 73.997473] lanx: port 4(lan4) entered disabled state > > [ 74.968006] lanx: port 5(lan5) entered disabled state > > > > other times there would be no output, but the board was still > > hanging. > > Reverting > > > > b1a471ec4df1 - hrtimer: Prevent using hrtimer_grab_expiry_lock() on > > migration_base > > 40aae5708e7a - hrtimer: Add a missing bracket and hide > > `migration_base' > > on !SMP > > > > on top of v4.19.94-rt39 makes that problem go away, i.e. the board > > reboots as expected. > > > > The board is a 32 bit powerpc (mpc8309) !SMP. Any ideas what I can > > do to > > debug this further? > > Thanks Rasmus for looking into this. Tom now maintains 4.19-rt. > > Tom, care to pull in these patches on top of 4.19-rt? > Those patches are already in 4.19-rt - he's saying that reverting them fixes the problem. I'm guessing that the assumption of base or base->cpu_base always being non-NULL in those patches might be wrong. If so, the below patch should fix the problem: Subject: [PATCH] hrtimer: Add back base and base->cpu_base checks in hrtimer_grab_expiry_lock() 4.19 commit b1a471ec4df1 [hrtimer: Prevent using hrtimer_grab_expiry_lock() on migration_base] removed the NULL checks for timer->base and timer->base->cpu_base on the assumption that they're always non-NULL. That assumption is apparently not to be true, so add the checks back. Signed-off-by: Tom Zanussi --- kernel/time/hrtimer.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/kernel/time/hrtimer.c b/kernel/time/hrtimer.c index e54a95de8b79..6f20cf23008b 100644 --- a/kernel/time/hrtimer.c +++ b/kernel/time/hrtimer.c @@ -953,7 +953,7 @@ void hrtimer_grab_expiry_lock(const struct hrtimer *timer) { struct hrtimer_clock_base *base = READ_ONCE(timer->base); - if (timer->is_soft && is_migration_base(base)) { + if (timer->is_soft && base && base->cpu_base && is_migration_base(base)) { spin_lock(&base->cpu_base->softirq_expiry_lock); spin_unlock(&base->cpu_base->softirq_expiry_lock); } -- 2.17.1 > -- Steve