Received: by 2002:a05:6a10:1a4d:0:0:0:0 with SMTP id nk13csp1686345pxb; Wed, 9 Feb 2022 02:10:48 -0800 (PST) X-Google-Smtp-Source: ABdhPJw6FmBzrvhKEjR7hElsmktASjwS/bX21B+vzr4SlL6gRqo9asTNl8r7wL7sUIzYg7Dawc2N X-Received: by 2002:a63:110:: with SMTP id 16mr1295607pgb.586.1644401447519; Wed, 09 Feb 2022 02:10:47 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1644401447; cv=none; d=google.com; s=arc-20160816; b=rX+GxiRiFRqvDpDLjUBOPOVH55PJy3NCZX8w2lxJpGjgcYFRwBB9zo1jf9mb5b7emE oOAWlbsGIxOvP/f5AG2KC82IixJSLXmq0Iaq6Taw0sFldDiWJYRPfjDd2Y7h61iI3kTP 8ZKgsmfdxVYk2ZMuD4+puALGXS5+/Xq1MvQAFl08d2pgBTElajBOOlIQaHX86xo57rz8 Bt9dtd2rD92KuHD1JpIls8fJJ79vK0KgjcNclW6AGckcfMoSgs33VY6RiRobGQKLHX5E 97bDWCtKZg/V0PzoIJFBKINeHrZhXPbHuKnm29vtQl8jp8tZ/KeqlytJj3ODSdbMkbO6 1CaA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:in-reply-to:content-transfer-encoding :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:dkim-signature; bh=qez3iOhlc2X2L1ucsWsrqpgZHHvarEAFe6QOnj+QIHg=; b=OLhdUca3towKKU81fxdZxLH0d0nchXj2g+Y8zPiz5AQMn4i3Z60nY4GqAWkdgpudfR XNvZjg7F6nOu5AizE4hzi1nmhxwtekXgtd9EefhKPPuWa2B0gqt7JI0U9nNOHYovWPSC uPpuSxGUSi17s4oi8RTJa2Wx7Pbd80jXRYcHrprcslG1v45XQa3nHkX0a1xCKxenTk3A 5fPI5baDpX1RD2pCjlJHy/C/S6e1pOTWxZNQ2OOlWliOiPad+DUbQGprXXq+d/iG+nX4 0xEUyND4gyTeuDE0Qi8qo/svuXYqzHIw0MfwAKMv0k+EMC3vv9ACbrHeGGst/qfoDGgh uZwg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=k2Mb2BQl; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id y24si5433058pgi.356.2022.02.09.02.10.34; Wed, 09 Feb 2022 02:10:47 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=k2Mb2BQl; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1355332AbiBHNXQ (ORCPT + 99 others); Tue, 8 Feb 2022 08:23:16 -0500 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:40686 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1351396AbiBHNU0 (ORCPT ); Tue, 8 Feb 2022 08:20:26 -0500 Received: from sin.source.kernel.org (sin.source.kernel.org [IPv6:2604:1380:40e1:4800::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 704E0C03BFFA for ; Tue, 8 Feb 2022 05:17:10 -0800 (PST) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by sin.source.kernel.org (Postfix) with ESMTPS id A8E5BCE182D for ; Tue, 8 Feb 2022 13:17:08 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 3E9BFC004E1; Tue, 8 Feb 2022 13:17:06 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1644326226; bh=pyYWSe+Jjk7H9EMWk07P5RtmfXzKKM1LErV2wfS8qKg=; h=Date:From:To:Cc:Subject:References:In-Reply-To:From; b=k2Mb2BQlgKuqbJaDUtjCkPOrmzwYLXU48ytWbBxEOlY5sBc9caCg1cY5DFmj+Q0Q5 VtQDkm/UyWabPnK13Y4FZilRsArQJb6LvzhbwSWxpiJofz2EMt42FvqfR2U4cXnqCn BTgXhYEtoXNWR8IrOUU7SihX8Nkj9aadSVugloSDB7u9GRd8Ffiz2k3PimkHOGfASe uJqt0eoPytb+fHeBNjqsLQglRLI37wL6wnK4bxiXNr0Mp1hwRGiAZQ1oKpM6tXESyA 097YSjs9s2skEMTczes+grTL11i90cg9DB8WePmZayo5N2tdxYiMkfWpn6ZkqSowYM KHZR/mL9ePDxw== Date: Tue, 8 Feb 2022 14:17:03 +0100 From: Frederic Weisbecker To: Paul Menzel Cc: Frederic Weisbecker , Thomas Gleixner , Ingo Molnar , LKML , linuxppc-dev@lists.ozlabs.org Subject: Re: ppc64le: `NOHZ tick-stop error: Non-RCU local softirq work is pending, handler #20!!!` when turning off SMT Message-ID: <20220208131703.GA538566@lothringen> References: <0baca95b-771f-2217-1098-2d0eee568ea7@molgen.mpg.de> MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <0baca95b-771f-2217-1098-2d0eee568ea7@molgen.mpg.de> X-Spam-Status: No, score=-7.1 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,RCVD_IN_DNSWL_HI, SPF_HELO_NONE,SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, Feb 08, 2022 at 08:32:37AM +0100, Paul Menzel wrote: > Dear Linux folks, > > > On the POWER8 server IBM S822LC running Ubuntu 21.10, Linux 5.17-rc1+ built > with > > $ grep HZ /boot/config-5.17.0-rc1+ > CONFIG_NO_HZ_COMMON=y > # CONFIG_HZ_PERIODIC is not set > CONFIG_NO_HZ_IDLE=y > # CONFIG_NO_HZ_FULL is not set > CONFIG_NO_HZ=y > # CONFIG_HZ_100 is not set > CONFIG_HZ_250=y > # CONFIG_HZ_300 is not set > # CONFIG_HZ_1000 is not set > CONFIG_HZ=250 > > once warned about a NOHZ tick-stop error, when I executed `sudo > /usr/sbin/ppc64_cpu --smt=off` (so that KVM would work). I see, so I assume this sets some CPUs offline, right? > > ``` > $ dmesg > [ 0.000000] Linux version 5.17.0-rc1+ > (pmenzel@flughafenberlinbrandenburgwillybrandt.molgen.mpg.de) (Ubuntu clang > version 13.0.0-2, LLD 13.0.0) #1 SMP Fri Jan 28 17:13:04 CET 2022 > […] > [271272.030262] NOHZ tick-stop error: Non-RCU local softirq work is pending, > handler #20!!! > [271272.305726] NOHZ tick-stop error: Non-RCU local softirq work is pending, > handler #20!!! > [271272.549790] NOHZ tick-stop error: Non-RCU local softirq work is pending, > handler #20!!! > [271274.885167] NOHZ tick-stop error: Non-RCU local softirq work is pending, > handler #20!!! > [271275.113896] NOHZ tick-stop error: Non-RCU local softirq work is pending, > handler #20!!! > [271275.412902] NOHZ tick-stop error: Non-RCU local softirq work is pending, > handler #20!!! > [271275.625245] NOHZ tick-stop error: Non-RCU local softirq work is pending, > handler #20!!! > [271275.833107] NOHZ tick-stop error: Non-RCU local softirq work is pending, > handler #20!!! > [271276.041391] NOHZ tick-stop error: Non-RCU local softirq work is pending, > handler #20!!! > [271277.244880] NOHZ tick-stop error: Non-RCU local softirq work is pending, > handler #20!!! > ``` That's IRQ_POLL_SOFTIRQ. The problem here is probably that some of these softirqs are pending even though ksoftirqd has been parked. I see there is irq_poll_cpu_dead() that migrates the pending queue once the CPU is finally dead, so this is well handled. I'm preparing a patch to fix the warning. Thanks.