Received: by 2002:a05:6a10:f347:0:0:0:0 with SMTP id d7csp1159047pxu; Fri, 27 Nov 2020 00:47:34 -0800 (PST) X-Google-Smtp-Source: ABdhPJyp06n/SmMb16yHicjlUnBFJVej63oIohT+EnBpCq16JqDBbKB/bHRjQJ3ZHtJzQcMYYA7O X-Received: by 2002:a17:906:2857:: with SMTP id s23mr6619957ejc.218.1606466853934; Fri, 27 Nov 2020 00:47:33 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1606466853; cv=none; d=google.com; s=arc-20160816; b=CtP1BOf4W4i1IGuwb+y9oHzHqTWeZ//In0kA3LsPT3yroXmJfTHWXcRftCvJRLN+UK RVRmp0f1R8wyJMLAJCsLjQJoJ/BgZigge6itzwp3SxpDM0D5zSd7vt+JQSzzgSeIsvbB LWTDM4hgGb8DSkOD9mxqlTf1NALBKrUNFa7RUJx+f3Ot8LdU9zlHRu/H1DZColZo4mBy jT+r5JIboheDfZQrPHRCunC2LydVCvgtTkY/kUiz/PsD8CuUXuYHb08Xz9OBWLaWc/lH +YMDwQdJPN4/rQ0+K0d4rnYtOQY/OzzOATX/1oCRZEyZTyTqXnnhI/pyEfvf9cXD2m+G zwuQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-disposition :mime-version:references:reply-to:message-id:subject:cc:to:from:date :dkim-signature; bh=dCGi71YXS4tVI9L9D5TR+Tys7hGWG3k3SUuGFanvp90=; b=s7PQ71qRWtillbQPRi4sJGpzR2U7HJu86m/YYz9JomCZCvhHV5kvG6wuZIh9q1D9bF Vsev2x4CJJdels92wX6fKEBkcJdo7R8jmVpbxC9vEAfYosPO9EPR3ykFozqiwVdsqCVo DOu5aKgB8vaYVtcM+i323iLJkEEhil5VuUL325bMccjMfrhw2lio4+lpp7rc6S7x8srn QEXckFAx1tq9SZ1gOam82whPULjrwk66zd6YvuOEPVuQUottIBpUgwIkIbXg+7UtSn5S 0I9yGZTE6+XgNJ56UcUTuiSY6rhdxnSwniJjF/1QrMFzyZS2jSRoh5fib6w7bkfQwC9D 2cNQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=BXuNNY3u; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id eb9si4211301edb.436.2020.11.27.00.47.11; Fri, 27 Nov 2020 00:47:33 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=default header.b=BXuNNY3u; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2392041AbgKZXyl (ORCPT + 99 others); Thu, 26 Nov 2020 18:54:41 -0500 Received: from mail.kernel.org ([198.145.29.99]:41260 "EHLO mail.kernel.org" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2388768AbgKZXyl (ORCPT ); Thu, 26 Nov 2020 18:54:41 -0500 Received: from paulmck-ThinkPad-P72.home (50-39-104-11.bvtn.or.frontiernet.net [50.39.104.11]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by mail.kernel.org (Postfix) with ESMTPSA id 871A52070E; Thu, 26 Nov 2020 23:54:40 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=default; t=1606434880; bh=W6yohlbRS+7Z1QywB+o8uI8Ck+aAOhVyrWRlMzT8nwU=; h=Date:From:To:Cc:Subject:Reply-To:References:In-Reply-To:From; b=BXuNNY3uGVwq+5eVEi0JMpfGbEY5EWhaJ1IdbmFp7srY5K9Ksd7VQHRyv69sskMLX xzBAy5YJtfKa0w0OKcwRTHI/EnxorNWZGf9xh88g/+CqF8vM1W3b9KQwUSiFcLGM/0 8GPI+R6ob4g9+1AEIT1usxVHg2xtUBgrbkrWeurI= Received: by paulmck-ThinkPad-P72.home (Postfix, from userid 1000) id 51C8D35230FE; Thu, 26 Nov 2020 15:54:40 -0800 (PST) Date: Thu, 26 Nov 2020 15:54:40 -0800 From: "Paul E. McKenney" To: Dexuan Cui Cc: "boqun.feng@gmail.com" , Ingo Molnar , "rcu@vger.kernel.org" , vkuznets , Michael Kelley , "linux-kernel@vger.kernel.org" Subject: Re: kdump always hangs in rcu_barrier() -> wait_for_completion() Message-ID: <20201126235440.GT1437@paulmck-ThinkPad-P72> Reply-To: paulmck@kernel.org References: <20201126154630.GR1437@paulmck-ThinkPad-P72> <20201126214226.GS1437@paulmck-ThinkPad-P72> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.9.4 (2018-02-28) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Nov 26, 2020 at 10:59:19PM +0000, Dexuan Cui wrote: > > From: Paul E. McKenney > > Sent: Thursday, November 26, 2020 1:42 PM > > > > > > Another possibility is that rcu_state.gp_kthread is non-NULL, but that > > > > something else is preventing RCU grace periods from completing, but in > > > > > > It looks like somehow the scheduling is not working here: in rcu_barrier() > > > , if I replace the wait_for_completion() with > > > wait_for_completion_timeout(&rcu_state.barrier_completion, 30*HZ), the > > > issue persists. > > > > Have you tried using sysreq-t to see what the various tasks are doing? > > Will try it. > > BTW, this is a "Generation 2" VM on Hyper-V, meaning sysrq only starts to > work after the Hyper-V para-virtualized keyboard driver loads... So, at this > early point, sysrq is not working. :-( I'll have to hack the code and use a > virtual NMI interrupt to force the sysrq handler to be called. Whatever works! > > Having interrupts disabled on all CPUs would have the effect of disabling > > the RCU CPU stall warnings. > > Thanx, Paul > > I'm sure the interrupts are not disabled. Here the VM only has 1 virtual CPU, > and when the hang issue happens the virtual serial console is still responding > when I press Enter (it prints a new line) or Ctrl+C (it prints ^C). > > Here the VM does not use the "legacy timers" (PIT, Local APIC timer, etc.) at all. > Instead, the VM uses the Hyper-V para-virtualized timers. It looks the Hyper-V > timer never fires in the kdump kernel when the hang issue happens. I'm > looking into this... I suspect this hang issue may only be specific to Hyper-V. Fair enough, given that timers not working can also suppress RCU CPU stall warnings. ;-) Thanx, Paul