Received: by 2002:ab2:6816:0:b0:1f9:5764:f03e with SMTP id t22csp1146757lqo; Fri, 17 May 2024 12:07:30 -0700 (PDT) X-Forwarded-Encrypted: i=3; AJvYcCV5cYZXBxaHHFnYSUsRYeV+K8j37KV6fB/E93MdWKpZ3r9Fv4PoZxgLuURXMSRdlkKCPaVcge5Mp82K26iAv5WTXUSpGQnAqZwQ7JvObw== X-Google-Smtp-Source: AGHT+IG7HviWgdb/gpI0nKjIC6Qwhqm9soUAAmxxT4RbPWWRWX52IQkOoQJBplNse/dQxbTKLe1F X-Received: by 2002:a50:8d4e:0:b0:571:bed1:3a27 with SMTP id 4fb4d7f45d1cf-5734d6f2d33mr16311284a12.38.1715972850217; Fri, 17 May 2024 12:07:30 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1715972850; cv=pass; d=google.com; s=arc-20160816; b=NIwAj6CDDO65WzkTzoWNWlswb+0b6WBva+fJTRefW5Recr5rSJY/JTt4azrrML5nMo zp8i5D7MHXmsPtpcXRHz6sJvx1gKs2fTt4mzoR/3qnltreemkiGEu1/KpjHfvB2fyev1 C2DVNzezBLvXLuS71LoIqT2alBg/uEffVzY413S8+SHPUI0jg5UpPBDpMLehBRgL/gts l6VYjeQ1EadtYZxrNNk9E44uf9pLgR/PHoE4YlChViiTNzjlQVYq9Ic1PV9wLApmKwrv Ycn6QaQLC/xEdwvOm7L1yfNR0CkRXAIkNtxZDPdM0gxMYta6VmwZL5DoWUChUsTfAmLf Xfzw== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=content-transfer-encoding:cc:to:subject:message-id:date:from :in-reply-to:references:mime-version:list-unsubscribe:list-subscribe :list-id:precedence:dkim-signature; bh=ESul9fXdOZviKhsyrfFKBITgTaaOWzDkRdGDygsrF7U=; fh=ICFGtJ260xaNdExRtzrTdKAVb4rv0e71NxW+le7Z6as=; b=bA9P+lvXSu7BOx7GTMaoE88vHDBZ0IrYw6SZuRaDe1Vr24EMbPrFoVyiSgbrYg5VFW zJ9Noz9wTgsWSUamnMtfoerryn3NoakPu6Wti0g8i0Ld1MXyDRSYmi5vTsgAox332snz cB1hZVhnXQ1u/zFnycbtXhilmPq+SSCXpKKkJ1BNs9ZHSu92i5Ob7RuE2IgqngK8+m3p kr0reCzuGbozVovNHWVbNGZc2WJqWyTWPgPwwwa2lXvsk8ynMnxD+iFWQpFkCqzX9632 ofKZfgainS5DTrUj7aaZRKFhQGGaHSTiKOYmOu0pCEF77a6fV+q3KXCmbnZF1RB8JVgr D0dA==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=TD3YDl+t; arc=pass (i=1 dkim=pass dkdomain=kernel.org); spf=pass (google.com: domain of linux-kernel+bounces-182533-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.80.249 as permitted sender) smtp.mailfrom="linux-kernel+bounces-182533-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Return-Path: Received: from am.mirrors.kernel.org (am.mirrors.kernel.org. [147.75.80.249]) by mx.google.com with ESMTPS id 4fb4d7f45d1cf-5733bec09aasi10369730a12.155.2024.05.17.12.07.30 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Fri, 17 May 2024 12:07:30 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel+bounces-182533-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.80.249 as permitted sender) client-ip=147.75.80.249; Authentication-Results: mx.google.com; dkim=pass header.i=@kernel.org header.s=k20201202 header.b=TD3YDl+t; arc=pass (i=1 dkim=pass dkdomain=kernel.org); spf=pass (google.com: domain of linux-kernel+bounces-182533-linux.lists.archive=gmail.com@vger.kernel.org designates 147.75.80.249 as permitted sender) smtp.mailfrom="linux-kernel+bounces-182533-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=kernel.org Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by am.mirrors.kernel.org (Postfix) with ESMTPS id F0CD31F24573 for ; Fri, 17 May 2024 19:01:19 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 8005413E41B; Fri, 17 May 2024 19:00:46 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b="TD3YDl+t" Received: from smtp.kernel.org (aws-us-west-2-korg-mail-1.web.codeaurora.org [10.30.226.201]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by smtp.subspace.kernel.org (Postfix) with ESMTPS id 9591713E414; Fri, 17 May 2024 19:00:45 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=10.30.226.201 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1715972445; cv=none; b=ENpfjET6et0HN8yOfCdAMySWPYopu7+A87LQdoydZHSfxdTTYriYpjDv5C2cd7yJFw/z/6175depQ6x3th3WcrT4ew0ljrLGTz0yVRfV+Xfeav3xWfZx9fmol8U/x500+VFBzf0NR3JYijY3nK6y6cVCzh5dP9wE7urF5npZVT0= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1715972445; c=relaxed/simple; bh=ADnkPt6I97GHoVi8MXMcPa3bF85S6IMqkxYA1vtcreo=; h=MIME-Version:References:In-Reply-To:From:Date:Message-ID:Subject: To:Cc:Content-Type; b=Dh7yDNJp7utO3etS/+UlC6tfvgFa1BJORkfzVhPPsgw8YjMti7LgjvXmE+o6lXmyj7ldWaWjvnrmDYMikz96gitTpo/THKa5YnZ1lrZNZi7YQJsaM+53bZNCAIe6DFP4lvHcr9vqpQHGuF+oXUC0Iv/0M8u3fZRhQVp8bQirKW0= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dkim=pass (2048-bit key) header.d=kernel.org header.i=@kernel.org header.b=TD3YDl+t; arc=none smtp.client-ip=10.30.226.201 Received: by smtp.kernel.org (Postfix) with ESMTPSA id 1C91BC4AF0B; Fri, 17 May 2024 19:00:45 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=kernel.org; s=k20201202; t=1715972445; bh=ADnkPt6I97GHoVi8MXMcPa3bF85S6IMqkxYA1vtcreo=; h=References:In-Reply-To:From:Date:Subject:To:Cc:From; b=TD3YDl+tH9Qq1XpiA5U7xKZ86nvTAj0TK4kPw6UjQHrIix512K9cs2WG1BN+U6xcy 189qHR0xfnHAmg1cIqWn1OMR1733ImnBPScO2fLyivkAi+G64Xrjzwpy+O2U3/mNUp I6qrVo+5e99XGkcVanOkKP7HL7BvJ9FO8YJ8crIRxGNl3eDFMD1IEIKxjyEMrSnUF1 qkDfSWV0XveBhLWc6To03E6W0PpivejQtWqzpx29t88fb5IsBRJz+W6WFFm1fQm7yO eAcEwAKtGFW3zbFNsF3NyqpOo0PId7cimuv+S2JS42aTnO2flYKac1QNWy9yna+Zf8 65xAIziICuTzA== Received: by mail-oi1-f178.google.com with SMTP id 5614622812f47-3c992e5c821so299619b6e.2; Fri, 17 May 2024 12:00:45 -0700 (PDT) X-Forwarded-Encrypted: i=1; AJvYcCXODlLLWal98CsdD8gGya0EbkoVj3kH0KMHgAy59UP3dFsDJSj+Zjzvlt99ETWh1zUmCA0NMiV9PYg01VQ5qQ+IVJ0p1k63WJj8Oe0JKUqWdMdlhbrvfyPGMLf5Vhpg3zEK+fV1iNU= X-Gm-Message-State: AOJu0Yy4YTulQI0cQ5f2O7mdd8LbLDscQEQ1/LxVMY7gHdi5ulw+mgb3 LlJvD0OephGyZ1HnGVKrTp07f2Jm43wZv9F68FIkQ6zOXa8Qf/7xhCAedxEFOLP54pP4pSjImHG wZKjt3kaXrdpRnbooG8zI5WGmSPo= X-Received: by 2002:a05:6808:148f:b0:3c8:4227:4fb0 with SMTP id 5614622812f47-3c997056b60mr26057516b6e.2.1715972444363; Fri, 17 May 2024 12:00:44 -0700 (PDT) Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 References: <87o79cjjik.fsf@kernel.org> <20240511184847.GCZj-9j2sh1Akpt9iS@fat_crate.local> <20240511184945.GDZj-9yaOEWqf1ng8u@fat_crate.local> <87h6f4jdrq.fsf@kernel.org> <878r0djxgc.fsf@kernel.org> <874jb0jzx5.fsf@kernel.org> <20240514160555.GCZkOL41oB3hBt45eO@fat_crate.local> <87msoofjg1.fsf@kernel.org> <35086bb6-ee11-4ac6-b8ba-5fab20065b54@intel.com> <871q60ffnr.fsf@kernel.org> In-Reply-To: <871q60ffnr.fsf@kernel.org> From: "Rafael J. Wysocki" Date: Fri, 17 May 2024 21:00:33 +0200 X-Gmail-Original-Message-ID: Message-ID: Subject: Re: [regression] suspend stress test stalls within 30 minutes To: Kalle Valo Cc: Dave Hansen , Borislav Petkov , Pawan Gupta , Thomas Gleixner , Ingo Molnar , Dave Hansen , "Rafael J. Wysocki" , x86@kernel.org, linux-pm@vger.kernel.org, linux-kernel@vger.kernel.org, regressions@lists.linux.dev, Jeff Johnson Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable On Fri, May 17, 2024 at 8:37=E2=80=AFPM Kalle Valo wrote= : > > Dave Hansen writes: > > > On 5/17/24 10:15, Kalle Valo wrote: > >> Borislav Petkov writes: > >>> There might be some #GP or so in the logs in case we've managed to f*= ck > >>> up microcode application which emulates that IBRS MSR bit and the > >>> actual toggling or so when suspending... > >> So the weird part is that when the bug happens (ie. suspend stalls) I > >> can access the box normally using ssh and I don't see anything special > >> in dmesg. Below is a full copy of dmesg output after the suspend > >> stalled. Do note that I copied this dmesg before I updated microcode s= o > >> it will still show the old microcode version. > >> > >> Let me know if you need more info. > > > > Kalle, could you remind us what we're seeing here? Does this show 30 > > working rtcwake tests followed by a failure at "rtcwake test 31" where > > the system failed to suspend? > > Correct. So basically what I do is that I start the nuc box, ssh into it > and run: > > sudo su > for i in {1..400}; do echo "rtcwake test $i" > /dev/kmsg; rtcwake -m mem = -s 10; sleep 10; done > > Here's the start of first loop: > > [ 54.945105] rtcwake test 1 > [ 55.162603] PM: suspend entry (deep) > [ 55.168875] Filesystems sync: 0.006 seconds > [ 55.182427] Freezing user space processes > [ 55.191498] Freezing user space processes completed (elapsed 0.008 sec= onds) > [ 55.191711] OOM killer disabled. > [ 55.191805] Freezing remaining freezable tasks > [ 55.193507] Freezing remaining freezable tasks completed (elapsed 0.00= 1 seconds) > [ 55.194056] printk: Suspending console(s) (use no_console_suspend to d= ebug) > [ 55.244962] e1000e: EEE TX LPI TIMER: 00000011 > > Now I leave the box to run it's test. I come back later to see that the > for loop has stalled and the box is not going into suspend gain. I ssh > into the machine and see this in dmesg: > > [ 449.061525] rtcwake test 31 > [ 449.176854] PM: suspend entry (deep) > [ 449.179072] Filesystems sync: 0.002 seconds This means that ksys_sync_helper() has run, so it blocks somewhere in enter_state() around suspend_prepare(). Can please echo 1 (as root) to /sys/power/pm_debug_messages and retest? This should allow us to see more in the log.