Received: by 2002:a05:6a10:f347:0:0:0:0 with SMTP id d7csp4678777pxu; Mon, 21 Dec 2020 20:22:23 -0800 (PST) X-Google-Smtp-Source: ABdhPJw1cqQX9fcdvm7LJj5g/FuBIplAP/0VwA/JSZy0MAbO85ab5LEbh7wuActxsJ0lKi1aqBny X-Received: by 2002:a50:d484:: with SMTP id s4mr18609016edi.13.1608610942870; Mon, 21 Dec 2020 20:22:22 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1608610942; cv=none; d=google.com; s=arc-20160816; b=wkm0zUuw4HpF4tf9atlMVCMEqDVo4QSbNVl/IaLzj/N7HuiOXLuMYP2otjnMURGpL9 dHb5Vh99jqOpcT/zGVdfEvLEXLZoW5r9BueJsqH46bwXFFf0Lm1YFWpG5ZkeBo49ooHv RTLAbddbCg+u2NGtQwKXH2PBZiIraltrYHW0hy52Y+dw6SNgMW98gALpu33GkI4MrmlP pA4jRigWP1t6O58Z1H7hKbwfANvPHvzu0hsHE9ldDqaBLUKN7qvS5wKpnM9STIfeByOc 6jhnae3g2LV5j+gP6/if/bjhi+ksJ6ZIiVjy7vhBt5+NSN73bqE/NyJHj6ufui0oCzX3 HWEw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-disposition :mime-version:references:reply-to:message-id:subject:cc:to:from:date :dkim-signature; bh=J6BkJ/ul2iB4Gu+INQznfAH4DxMZZBy0vnrxTB+7XsI=; b=MXFoZI4S/Wp/jNjAAUGiWO7LUx9JXpTZGhnBisGP1U7nWK4cXuG9lSXPmwvk1L5VhY KpAcqMtZ7WXKTrhxuNtdQQVH3iN/c40B2MFlIDbTGJPdHBdYuJkcot1wP+roeg/RTfuu Vh2KdfeRh9FyeJzdvyLb+gj5pucS/SkVwQX60Q2PSKriv3t1zHME1yYxNKV47R9m3WkV wtk7mXqEoC2kYwe2nhp5q6ps3PtHEfuUH3LNMhNdFSUp5VDNhMJcmPs5K3ucnvqdfyiy rHEr3svlLjdv5H6dPeycYETpaa2kWQuHvX9kBUU3WiwM2qo5PeLOKGi7czWga3eXs4NN PIHQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=isn+vkb2; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id dc22si9308480edb.328.2020.12.21.20.21.59; Mon, 21 Dec 2020 20:22:22 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=isn+vkb2; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1725895AbgLVEV2 (ORCPT + 99 others); Mon, 21 Dec 2020 23:21:28 -0500 Received: from mail.kernel.org ([198.145.29.99]:57446 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1725820AbgLVEV2 (ORCPT ); Mon, 21 Dec 2020 23:21:28 -0500 Received: by mail.kernel.org (Postfix) with ESMTPSA id 5363722ADC; Tue, 22 Dec 2020 04:20:47 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1608610847; bh=6WW6bchu/hFRXl913wHiGV8Ial4KKNDAr7VAEGblU1k=; h=Date:From:To:Cc:Subject:Reply-To:References:In-Reply-To:From; b=isn+vkb2O2t1irCOzkUOYOdSMqy+yOtuxF87CSzxlAvcWAcwNi54j0JcfGXKRt6zc a4SaC9U98fX9aQ+wLOtvw9DS2wD3Qi37ZWqvjHkGc5/fJ+bPGor6kU4LdyBv03Bdap 9cn21kZyAEAXB/F+fOhH2UJrIjp0Vaxs674CeiZLXVGFElT8naGhToaaOs0NOZMcmb NdilQ8w/aQD0WtDzuPG9qwTG6TMbT0Qm9Pjm1bu7QBgV/2suoW557rM8bR1vFFhnn+ G0JXefyQdtqGzduDWA0dHjQKhrXI6ISrXpLxqiEjb6cO8p3LGl48jcZ1mL/BfjYDYC oOcpiTzOZn/Wg== Received: by paulmck-ThinkPad-P72.home (Postfix, from userid 1000) id 16D8C352275F; Mon, 21 Dec 2020 20:20:47 -0800 (PST) Date: Mon, 21 Dec 2020 20:20:47 -0800 From: "Paul E. McKenney" To: Frederic Weisbecker Cc: LKML , "Rafael J . Wysocki" , Peter Zijlstra , Ingo Molnar , Fabio Estevam , stable@vger.kernel.org, Thomas Gleixner , Len Brown , Pengutronix Kernel Team , NXP Linux Team , Daniel Lezcano , Shawn Guo , Sascha Hauer Subject: Re: [PATCH 1/4] sched/idle: Fix missing need_resched() check after rcu_idle_enter() Message-ID: <20201222042047.GZ2657@paulmck-ThinkPad-P72> Reply-To: paulmck@kernel.org References: <20201222013712.15056-1-frederic@kernel.org> <20201222013712.15056-2-frederic@kernel.org> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20201222013712.15056-2-frederic@kernel.org> User-Agent: Mutt/1.9.4 (2018-02-28) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Dec 22, 2020 at 02:37:09AM +0100, Frederic Weisbecker wrote: > Entering RCU idle mode may cause a deferred wake up of an RCU NOCB_GP > kthread (rcuog) to be serviced. > > Usually a wake up happening while running the idle task is spotted in > one of the need_resched() checks carefully placed within the idle loop > that can break to the scheduler. > > Unfortunately in default_idle_call(), the call to rcu_idle_enter() is > already beyond the last need_resched() check and we may halt the CPU > with a resched request unhandled, leaving the task hanging. > > Fix this with performing a last minute need_resched() check after > calling rcu_idle_enter(). > > Reported-by: Paul E. McKenney > Fixes: 96d3fd0d315a (rcu: Break call_rcu() deadlock involving scheduler and perf) > Cc: stable@vger.kernel.org > Cc: Peter Zijlstra > Cc: Rafael J. Wysocki > Cc: Thomas Gleixner > Cc: Ingo Molnar > Signed-off-by: Frederic Weisbecker Tested-by: Paul E. McKenney > --- > kernel/sched/idle.c | 18 ++++++++++++------ > 1 file changed, 12 insertions(+), 6 deletions(-) > > diff --git a/kernel/sched/idle.c b/kernel/sched/idle.c > index 305727ea0677..1af60dc50beb 100644 > --- a/kernel/sched/idle.c > +++ b/kernel/sched/idle.c > @@ -109,15 +109,21 @@ void __cpuidle default_idle_call(void) > rcu_idle_enter(); > lockdep_hardirqs_on(_THIS_IP_); > > - arch_cpu_idle(); > + /* > + * Last need_resched() check must come after rcu_idle_enter() > + * which may wake up RCU internal tasks. > + */ > + if (!need_resched()) { > + arch_cpu_idle(); > + raw_local_irq_disable(); > + } > > /* > - * OK, so IRQs are enabled here, but RCU needs them disabled to > - * turn itself back on.. funny thing is that disabling IRQs > - * will cause tracing, which needs RCU. Jump through hoops to > - * make it 'work'. > + * OK, so IRQs are enabled after arch_cpu_idle(), but RCU needs > + * them disabled to turn itself back on.. funny thing is that > + * disabling IRQs will cause tracing, which needs RCU. Jump through > + * hoops to make it 'work'. > */ > - raw_local_irq_disable(); > lockdep_hardirqs_off(_THIS_IP_); > rcu_idle_exit(); > lockdep_hardirqs_on(_THIS_IP_); > -- > 2.25.1 >