Received: by 2002:a89:48b:0:b0:1f5:f2ab:c469 with SMTP id a11csp1403387lqd; Thu, 25 Apr 2024 14:46:21 -0700 (PDT) X-Forwarded-Encrypted: i=3; AJvYcCVEqquuuEyA5qK5hzAw8trKQtqBS9EHgCbAvIjMHCkoprvcHI4EIFe/BzhO2A4YB0kdF88phoAfF1sx0X09Yi5h/nxnXkt5lsa/gM+hLw== X-Google-Smtp-Source: AGHT+IFGvn62uhoTWU9TRPjAFHXVJw8xTzjTnTac00mEmuyLP0eaCtuA+oY1UyycZzt0OEedrtAE X-Received: by 2002:a05:6a20:f394:b0:1a3:55d2:1489 with SMTP id qr20-20020a056a20f39400b001a355d21489mr918133pzb.7.1714081581503; Thu, 25 Apr 2024 14:46:21 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1714081581; cv=pass; d=google.com; s=arc-20160816; b=lrYthtpqLx4qSFbjly6AVfOR5vLU97jM4DbwskvhBJIceORSe3I/oQ0BsmJuwPOdzm VA26EGg9mtZg958CY+e8KQ8AEcQQHoDh40B1/ITfAuVh3ymY1/Y82S0OIBVtLGimOfEd EGkDXzipingfYoMUY+oOZC+70AESh6ZlQgRLQjbh6BvIEl2oUO+DcHBDfV2d1aGTt+eX 5DVlU1ktAhU03ya9JSMi939eVWbjP1W/btNJ2sQZe5xidkCDTz5Fsd8QaxSbuWiYSQnU NKPcEM3ATu/5OnXjswevzOW40hRX6B6D08wCB2AuSqpW+r+zQNGA/MMLZUoZ+fgvGBEk J8zQ== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=mime-version:list-unsubscribe:list-subscribe:list-id:precedence :message-id:date:references:in-reply-to:subject:cc:to:dkim-signature :dkim-signature:from; bh=UaaNtzca06bXu38h5O1TFgrZCt/zSA1O6qAPUMjaY6M=; fh=Yq9XNnTVecDx9sqhC0GBlFd0uGzJi6MUaKyn94/j9a8=; b=wiKPEcWTwirYeGuGuBfNKFnEqdCNGN2cmGXUFebZ5maswgD/0l4frRvQlsigKeJ5wI JBMZX8GGOkJDt8qhsaj7RcSWHEYj7Zo3rIQy5KDxtJ0nBUfO1awmYHjWPmeg3brMSPn8 F+SRkP0pMJsm2QfboV7Otm+FntzdTzCkd4kCrCwAuJ2GXqrAc8ovYJpIhAqr8IKzcApo WpaJfNwYjFJMnksRlTOC4vg0Zp6gT/ZEMSq4OOFaBDQkQcOxiLphnlg0dSPxzA8S+HlI eYKwEw7gg7A4FpPbD7W80bPCz6x6fOwA8eZt3X1eU15Qv6ze+UnwiypvYFGpd9KKT6Ic xWMA==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@linutronix.de header.s=2020 header.b=YzlkcV+6; dkim=neutral (no key) header.i=@linutronix.de header.s=2020e; arc=pass (i=1 spf=pass spfdomain=linutronix.de dkim=pass dkdomain=linutronix.de dmarc=pass fromdomain=linutronix.de); spf=pass (google.com: domain of linux-kernel+bounces-159230-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:40f1:3f00::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-159230-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=linutronix.de Return-Path: Received: from sy.mirrors.kernel.org (sy.mirrors.kernel.org. [2604:1380:40f1:3f00::1]) by mx.google.com with ESMTPS id jc12-20020a056a006c8c00b006eae83fa41asi13405749pfb.151.2024.04.25.14.46.20 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 25 Apr 2024 14:46:21 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel+bounces-159230-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:40f1:3f00::1 as permitted sender) client-ip=2604:1380:40f1:3f00::1; Authentication-Results: mx.google.com; dkim=pass header.i=@linutronix.de header.s=2020 header.b=YzlkcV+6; dkim=neutral (no key) header.i=@linutronix.de header.s=2020e; arc=pass (i=1 spf=pass spfdomain=linutronix.de dkim=pass dkdomain=linutronix.de dmarc=pass fromdomain=linutronix.de); spf=pass (google.com: domain of linux-kernel+bounces-159230-linux.lists.archive=gmail.com@vger.kernel.org designates 2604:1380:40f1:3f00::1 as permitted sender) smtp.mailfrom="linux-kernel+bounces-159230-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=QUARANTINE dis=NONE) header.from=linutronix.de Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by sy.mirrors.kernel.org (Postfix) with ESMTPS id 3B26DB239CB for ; Thu, 25 Apr 2024 21:42:54 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 0908D15573B; Thu, 25 Apr 2024 21:42:48 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=linutronix.de header.i=@linutronix.de header.b="YzlkcV+6"; dkim=permerror (0-bit key) header.d=linutronix.de header.i=@linutronix.de header.b="dkQ9IDDJ" Received: from galois.linutronix.de (Galois.linutronix.de [193.142.43.55]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 47B8A1FC4 for ; Thu, 25 Apr 2024 21:42:44 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=193.142.43.55 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1714081366; cv=none; b=HW5PWEMTHo1HokOxjgLT23NgOa52BCu+FXkrCzGylUegvviSysY88wxAysvEuTN1bGG7wOF2vWuyBtsa5GD9OTVaJw6Ki2n98dBMoiOl6LvY+ScXxZhmD4QftPLKdRzMYxvYVDMDbrEt6UYeSB5NeUV5QQPeFmkG3E/lYHHdp8Q= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1714081366; c=relaxed/simple; bh=FzKdYEKLWGjvznZ3ZLuDkvQAtXkLbCroU6soGsSlzR0=; h=From:To:Cc:Subject:In-Reply-To:References:Date:Message-ID: MIME-Version:Content-Type; b=e1oS01Lip6pWOTVRcbuGyBU3upIpQHlY++y0dV/2EdnlUe4tLvMCRRkeqrzZ2k1bqzG9zl7k4D2xidv4zMQ+zFOiil3oFpSuylwaJ+OVTVynieKUTkvrMEA+z92Yigf0QjUbJOzijpmQCkkp0A7qDwvkbvBipR8+LB72QdI4tjo= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=linutronix.de; spf=pass smtp.mailfrom=linutronix.de; dkim=pass (2048-bit key) header.d=linutronix.de header.i=@linutronix.de header.b=YzlkcV+6; dkim=permerror (0-bit key) header.d=linutronix.de header.i=@linutronix.de header.b=dkQ9IDDJ; arc=none smtp.client-ip=193.142.43.55 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=linutronix.de Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=linutronix.de From: Thomas Gleixner DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linutronix.de; s=2020; t=1714081362; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=UaaNtzca06bXu38h5O1TFgrZCt/zSA1O6qAPUMjaY6M=; b=YzlkcV+6L6ZX239UTura25dxrYpZPQZycb5B17QuIDYa6kBKSVtZROxzyROma61VLWqNbB zmg7tCAa7XnxeVwqgh/8tLSf8RHtOq0Fd059HG5uBnajRHn+pBMetzo2w2ibmQuwYOsx+O K4lvCtcAB/d7vPCCKVfOJ+YEwPujmTacCFFcn+1OvGiSvSCR1GbHhas/O0mL+9yaGhWKf4 LQftUF6/TscPJJitEmpScAS+1oatppyF7Z3cbHMEccIFoGXL7PSDgqouu37cSwQyzDq5ys F0sW0EGhKAYHTIUx2m1FTigy8RSyuPtWrOKgk9zLKHd66DBv65e4QfCHVYRl7w== DKIM-Signature: v=1; a=ed25519-sha256; c=relaxed/relaxed; d=linutronix.de; s=2020e; t=1714081362; h=from:from:reply-to:subject:subject:date:date:message-id:message-id: to:to:cc:cc:mime-version:mime-version:content-type:content-type: in-reply-to:in-reply-to:references:references; bh=UaaNtzca06bXu38h5O1TFgrZCt/zSA1O6qAPUMjaY6M=; b=dkQ9IDDJsCfXJu08NkfefRN/tpZBaw571q1v2C9M7EKRXUeDuPgeM/QDqmbkbLMITu/4ZQ lhEWHog3Pz7AG0Cw== To: Lyude Paul , Borislav Petkov Cc: x86@kernel.org, linux-kernel@vger.kernel.org Subject: Re: Early boot regression from f0551af0213 ("x86/topology: Ignore non-present APIC IDs in a present package") In-Reply-To: <3b1d16e357c1f9badeef405366492f05af26c085.camel@redhat.com> References: <3d77cb89857ee43a9c31249f4eab7196013bc4b4.camel@redhat.com> <20240418082703.GCZiDZVyra7qOQbyqn@fat_crate.local> <87plumxz4x.ffs@tglx> <87le59vw1y.ffs@tglx> <3a0afe545747e5314a9cb6bbaa9ce90b259ddfac.camel@redhat.com> <87edautcmz.ffs@tglx> <3b1d16e357c1f9badeef405366492f05af26c085.camel@redhat.com> Date: Thu, 25 Apr 2024 23:42:13 +0200 Message-ID: <878r11t8zu.ffs@tglx> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Type: text/plain Lyude! On Thu, Apr 25 2024 at 11:56, Lyude Paul wrote: > On Thu, 2024-04-25 at 04:11 +0200, Thomas Gleixner wrote: >> >> Can you please boot a kernel with the commit in question reverted and >> add 'possible_cpus=8' to the kernel command line? >> >> In theory this should fail too. > > Yep - tried booting a kernel with f0551af0213 reverted and > possible_cpus=8, it definitely looks like that crashes things as well > in the same way. Good. That means it's a problem which existed before but went unnoticed. > Also - it scrolled off the screen before I had a chance to write it > down, but I'm -fairly- sure I saw some sort of complaint about "16 [or > some double digit number] processors exceeds max number of 8". Which > is quite interesting, as this is definitely just a quad core ryzen > processor with hyperthreading - so there should only be 8 threads. Right, that's what we saw with the debug patch. The ACPI/MADT table is clearly bonkers. The effect of it is that it pretends that the system has 16 possible CPUs: [ 0.089381] CPU topo: Allowing 8 present CPUs plus 8 hotplug CPUs Which in turn changes the sizing of the per CPU data and affects some other details which depend on the number of possible CPUs. But that should not matter at all because the system scaling should be sufficient with 8 CPUs, but it does not for some completely non-obvious reasons. Can you please try to increase possible_cpus=N on the command line one by one and check when it actually starts to "work" again. One other thing to try is to boot with 'possible_cpus=8' and 'intremap=off' and see whether that makes a difference. I really have no idea where to look and not having the early boot messages in case of the fail is not helpful as I can't add meaningful debug to it. I just checked: the motherboard has a serial port, so it would be extremly helpful to hook up a serial cable to this thing and enable serial console on the kernel command line. That way we might eventually see information which is emitted before it fails to validate the timer interrupt. Thanks, tglx