Received: by 2002:a25:ad19:0:0:0:0:0 with SMTP id y25csp3219105ybi; Mon, 29 Jul 2019 03:00:05 -0700 (PDT) X-Google-Smtp-Source: APXvYqy60g9NuRab2mCwuMxTwQDLKE6828N52GA/URD2sNds4O1Ih5dAsQnwjKvCDCVZLYPyzDSq X-Received: by 2002:a17:902:a5c7:: with SMTP id t7mr110689190plq.288.1564394404926; Mon, 29 Jul 2019 03:00:04 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1564394404; cv=none; d=google.com; s=arc-20160816; b=AoOGIMB50A2V64sTqtxW+IptBITDYgsAbRYft07/PLAllQtANUO3LnUBO40oMEypGz 0iCO5c+LxErJFQ4yglGBAa2WIqbx8DlyIIQ8kf7ZarX4fTLwYQYudlj0ThRecnzMKf2N n1HIGCqAZflVfDuVCdNcy8nI4vAN97LbSci99IguS5r63cubpwN7SeHYlxooD0N9/LkG 4tW2w7J308dEFk5ERbUbPbirw3S+iPGGyePN1f+9/RmvYLO+xf/H9uwQxPDIgkDWc9Oz 6zAtmg93vmV3hbqe5xrvJHftfac7f8r1XwxEYsbpcWilMPUsafF3QkGQ5GjnpbcZbi9I FYpA== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:user-agent:references :message-id:in-reply-to:subject:cc:to:from:date; bh=25IITtiBOJJhJ3Zj9Y+tWxfhwAXcjfef5vVZdI2iOlM=; b=lhnOIyOH9tFJDauWAA9nUcxOVqR7Im9FHxZXwOv5ekZwTqaMDKTC/WnqCi53VRuD2T Jcd+MzepaB6iIBV7/pI0E867zoxNNubwH1YCdYfvOGNH9rot4JSiprLApHxX2SI8PYnR W/JfJL6GbV9i17n3sw3fjRn73ecWiyQinCIB2rg+UdwIiDCaDxA+ocrfe6EThiOWpp9c hPhl/zewzn9nW1uE1XzJmCFwNWZfYE5vU2Z2WQIh4eGoiXFxAmE0kHZx1fDtbV483m0k 8sKRz9f1DsRyHR3z6Z3tx1YYdP1/ygo0EdcrDnH+s98YfFlbub51X2ODSKV0ibavNRmc hwLA== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id w17si29027220pfj.69.2019.07.29.02.59.49; Mon, 29 Jul 2019 03:00:04 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1728151AbfG2J6h (ORCPT + 99 others); Mon, 29 Jul 2019 05:58:37 -0400 Received: from Galois.linutronix.de ([193.142.43.55]:53103 "EHLO Galois.linutronix.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1726470AbfG2J6h (ORCPT ); Mon, 29 Jul 2019 05:58:37 -0400 Received: from pd9ef1cb8.dip0.t-ipconnect.de ([217.239.28.184] helo=nanos) by Galois.linutronix.de with esmtpsa (TLS1.2:DHE_RSA_AES_256_CBC_SHA256:256) (Exim 4.80) (envelope-from ) id 1hs2QD-0006V0-KF; Mon, 29 Jul 2019 11:58:25 +0200 Date: Mon, 29 Jul 2019 11:58:24 +0200 (CEST) From: Thomas Gleixner To: Peter Zijlstra cc: Guenter Roeck , x86@kernel.org, Ingo Molnar , linux-kernel@vger.kernel.org, Borislav Petkov Subject: Re: sched: Unexpected reschedule of offline CPU#2! In-Reply-To: <20190729093545.GV31381@hirez.programming.kicks-ass.net> Message-ID: References: <20190727164450.GA11726@roeck-us.net> <20190729093545.GV31381@hirez.programming.kicks-ass.net> User-Agent: Alpine 2.21 (DEB 202 2017-01-01) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII X-Linutronix-Spam-Score: -1.0 X-Linutronix-Spam-Level: - X-Linutronix-Spam-Status: No , -1.0 points, 5.0 required, ALL_TRUSTED=-1,SHORTCIRCUIT=-0.0001 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Mon, 29 Jul 2019, Peter Zijlstra wrote: > On Sat, Jul 27, 2019 at 09:44:50AM -0700, Guenter Roeck wrote: > > [ 61.348866] Call Trace: > > [ 61.349392] kick_ilb+0x90/0xa0 > > [ 61.349629] trigger_load_balance+0xf0/0x5c0 > > [ 61.349859] ? check_preempt_wakeup+0x1b0/0x1b0 > > [ 61.350057] scheduler_tick+0xa7/0xd0 > > kick_ilb() iterates nohz.idle_cpus_mask to find itself an idle_cpu(). > > idle_cpus_mask() is set from nohz_balance_enter_idle() and cleared from > nohz_balance_exit_idle(). nohz_balance_enter_idle() is called from > __tick_nohz_idle_stop_tick() when entering nohz idle, this includes the > cpu_is_offline() clause of the idle loop. > > However, when offline, cpu_active() should also be false, and this > function should no-op. Ha. That reboot mess is not clearing cpu active as it's not going through the regular cpu hotplug path. It's using reboot IPI which 'stops' the cpus dead in their tracks after clearing cpu online.... Thanks, tglx