Received: by 2002:a25:c593:0:0:0:0:0 with SMTP id v141csp3883369ybe; Mon, 9 Sep 2019 00:11:33 -0700 (PDT) X-Google-Smtp-Source: APXvYqxz+wB1krYrlaJSltXA6WC96zV2wIkwIRzz8B6YYewW4kUf8luWSP26/bOgYnjGR7fuy6Hg X-Received: by 2002:aa7:cb4e:: with SMTP id w14mr21793663edt.230.1568013093203; Mon, 09 Sep 2019 00:11:33 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1568013093; cv=none; d=google.com; s=arc-20160816; b=rCqxdlbgc7tDLeXnIERJleGqcKjvGGx1pWFZyoGdFn4L6km2TzG+f36WlAv8ucyXwK MAz/VmIQ88ImKoF1DhZhP8ACSia+5DeMEASiEsECAhsb5ckDgO4QGGDpKcxWT0wZxWiT /vy6GV0OdD+9pg4UXB3lBu2PijtObMTxFa0cVKTpIz6iG/H9jIYBsL8SbCpU55ru10/8 WKEJLb2uPzmfGPaULTUZTyxgCzWvY4qSnm/tpdaaZixhS3+GGkAioLPxklqsFwnWeNYV QQM3nPBcTcJwu5C+W8/w7akR9usIV8iVG1dcU1jbzD2KbO8y3dnSFfgKrKAMbp1XCu1u bROQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:mime-version:user-agent:references :message-id:in-reply-to:subject:cc:to:from:date; bh=31Frim5jP8/i3Y/XMWjduKj3l+iFus8l8evNaUBLHOU=; b=sVehlX2Xiq4y8C5dFUFxb/Ju0IyO13F67M5FZH91sk1XWG7hHNMfdbjwfwDBvS5AJH q0JLMZa7PUGWU7NcHqL0uQVS128gklOvaVQq0KPaVF5OOnpwjQ9QQw/YR+8B2QA7hfi2 ZZwhuIEbs5LbaEFT6XB5V/vqJdn2E/jhbOwc5fnM94GupgFUispWuFu/nVrbTT8MI98t s7/dxrhA4G7JwXFX0xHj2X3fhxI7skuYw3jyKEVcTbCCRpCm4CAnoieG2LqUqVB/XBIX HI3rmZQmEvPEWFV+ttMDrvuZCaVKjKmp8odUfoV6+qFalarY/zsBTh4nrikikf7zBsax AFig== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id e27si8518558edd.172.2019.09.09.00.11.07; Mon, 09 Sep 2019 00:11:33 -0700 (PDT) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2405162AbfIGUM5 (ORCPT + 99 others); Sat, 7 Sep 2019 16:12:57 -0400 Received: from Galois.linutronix.de ([193.142.43.55]:49631 "EHLO Galois.linutronix.de" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2395252AbfIGUM4 (ORCPT ); Sat, 7 Sep 2019 16:12:56 -0400 Received: from p5de0b6c5.dip0.t-ipconnect.de ([93.224.182.197] helo=nanos) by Galois.linutronix.de with esmtpsa (TLS1.2:DHE_RSA_AES_256_CBC_SHA256:256) (Exim 4.80) (envelope-from ) id 1i6h4n-0001b8-OG; Sat, 07 Sep 2019 22:12:53 +0200 Date: Sat, 7 Sep 2019 22:12:52 +0200 (CEST) From: Thomas Gleixner To: Chris Wilson cc: Linus Torvalds , Linux List Kernel Mailing , Bandan Das Subject: Re: Linux 5.3-rc7 In-Reply-To: <156786988815.13300.14460569616117208043@skylake-alporthouse-com> Message-ID: References: <156785100521.13300.14461504732265570003@skylake-alporthouse-com> <156786727951.13300.15226856788926071603@skylake-alporthouse-com> <156786988815.13300.14460569616117208043@skylake-alporthouse-com> User-Agent: Alpine 2.21 (DEB 202 2017-01-01) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII X-Linutronix-Spam-Score: -1.0 X-Linutronix-Spam-Level: - X-Linutronix-Spam-Status: No , -1.0 points, 5.0 required, ALL_TRUSTED=-1,SHORTCIRCUIT=-0.0001 Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Sat, 7 Sep 2019, Chris Wilson wrote: > Quoting Thomas Gleixner (2019-09-07 16:00:17) > > Does this only happen with that CPU0 hotplug stuff enabled or on CPUs other > > than CPU0 as well? That hotplug CPU0 stuff is a bandaid so I wouldn't be > > surprised if we broke that somehow. > > If I ignore cpu0 in that test and so use > > [ 133.847187] smpboot: CPU 1 is now offline > [ 134.861861] x86: Booting SMP configuration: > [ 134.861875] smpboot: Booting Node 0 Processor 1 APIC 0x2 > [ 134.880218] smpboot: CPU 2 is now offline > [ 135.893806] smpboot: Booting Node 0 Processor 2 APIC 0x1 > [ 135.935115] smpboot: CPU 3 is now offline > [ 136.949760] smpboot: Booting Node 0 Processor 3 APIC 0x3 > > that has run for 10 minutes without failure, so it seems confined to > cpu0 hotplugging. All we are doing in the test to generate the hotplugs > is: Right, but you also have that config bit enabled which allows CPU0 hotplug which usually is off even in testing and that's why nobody noticed so far. So I looked at that code and I know why it's broken. I guess we'll end up reverting that commit for now as fixing it proper will be not just a one liner. Thanks for providing all the information! tglx