Received: by 2002:a05:6358:3188:b0:123:57c1:9b43 with SMTP id q8csp2804305rwd; Wed, 14 Jun 2023 07:33:20 -0700 (PDT) X-Google-Smtp-Source: ACHHUZ7kaSLXJeAGWPg3JHmpSpX4rNpdfmLLzpqumIfKGK1Lq0s0baaLBm/BENUhH5VtRUoYblHl X-Received: by 2002:a17:907:3d93:b0:94f:2a13:4e01 with SMTP id he19-20020a1709073d9300b0094f2a134e01mr15934875ejc.74.1686753200478; Wed, 14 Jun 2023 07:33:20 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1686753200; cv=none; d=google.com; s=arc-20160816; b=iS6mE4ZUJ49VPG/cve4bZkPamAcsnl4+1Z8K9GnZ8rQ2cJmdeyfs2G0U2KHmmI5RFS dDETwRIKPHrok72r84RB2tl6bqpewkvmjrltz/VhddUgCdM7ddCU5szGMgrrtVQw+852 U+daCdIDVcPFRXQE3LkzI+y0IzxNAK7Lj2quNrOxkq4v5b3mShndUE2L0sbArVRj+j2f 7ackzZ3eS/WmEYHNv6yXjU/h+9ufos/iBKN7SRgQopCeQeuuY9kuZ+C/w1BFX4phC+Np xxk43YbjhItEl91s8D9TNTZmqLg2GJS0iYUoscuCA5AJIf22g5sHUdkhFSaQK/YI9E5w dNbg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:content-language:subject:user-agent:mime-version :date:message-id:dkim-signature; bh=AuOUmQUyB3LBRB4iHogY+py0I5sFH1hn9FMxVrOj9Fk=; b=aENafKnkxFHhBqSJtr6Fr4zWvIPPfYYZxD3kyVgGt4cH0fc7N2k0pqlCs1CY18MWsi 8O/gVkj8ye0nLvz4Sua2xfagltYr+eTIuqZO4nmjv7mdk0tPTxuLuRMbkRkkIDCwqplV RXgdYK2eVyvE+mO52+ixs7nbtM6To07x7VEyKLQlTKTOTWUw8xz/ooSlQhgR5G7GI08/ R4c5+KFnjYXYTQzuKe5Tva7n6mkZ0d9Gx03ubf3lzi5LgAV8WO9rCJ5aYpioHFvP2/x9 MJfkMIpCjZpiLqKfEoIbq4h71gJ0SW5GWDZN/gY1QMOpl1r+fNubBJvFvzkGiOR1w5My 7Vew== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@amazon.com header.s=amazon201209 header.b=EkbPFfCB; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=amazon.com Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id l16-20020a1709066b9000b00977cd76373bsi8811884ejr.138.2023.06.14.07.32.55; Wed, 14 Jun 2023 07:33:20 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@amazon.com header.s=amazon201209 header.b=EkbPFfCB; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=QUARANTINE sp=QUARANTINE dis=NONE) header.from=amazon.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S245195AbjFNN6O (ORCPT + 99 others); Wed, 14 Jun 2023 09:58:14 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:60246 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S236567AbjFNN6N (ORCPT ); Wed, 14 Jun 2023 09:58:13 -0400 Received: from smtp-fw-9106.amazon.com (smtp-fw-9106.amazon.com [207.171.188.206]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 78F0319A; Wed, 14 Jun 2023 06:58:12 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=amazon.com; i=@amazon.com; q=dns/txt; s=amazon201209; t=1686751093; x=1718287093; h=message-id:date:mime-version:subject:to:cc:references: from:in-reply-to:content-transfer-encoding; bh=AuOUmQUyB3LBRB4iHogY+py0I5sFH1hn9FMxVrOj9Fk=; b=EkbPFfCBwt0a/XAsNBobRlBGh+904mbaD64qjTJcUmjee0snayq9562P F/Y5lxD6ogeU0Q7bd/BpMHA/q8WWxsVtrHcXEWxsgCUt8DbBmlof7n9f2 Qm1rkXSGXA7FK5pIMR60LBrY9WRpduMr8tcPbG+COA/gw354hrnKXbCW+ I=; X-IronPort-AV: E=Sophos;i="6.00,242,1681171200"; d="scan'208";a="654146650" Received: from pdx4-co-svc-p1-lb2-vlan3.amazon.com (HELO email-inbound-relay-pdx-2c-m6i4x-e7094f15.us-west-2.amazon.com) ([10.25.36.214]) by smtp-border-fw-9106.sea19.amazon.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 14 Jun 2023 13:58:07 +0000 Received: from EX19MTAUEB001.ant.amazon.com (pdx1-ws-svc-p6-lb9-vlan2.pdx.amazon.com [10.236.137.194]) by email-inbound-relay-pdx-2c-m6i4x-e7094f15.us-west-2.amazon.com (Postfix) with ESMTPS id A0FEA415B3; Wed, 14 Jun 2023 13:58:06 +0000 (UTC) Received: from EX19D028UEC003.ant.amazon.com (10.252.137.159) by EX19MTAUEB001.ant.amazon.com (10.252.135.108) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.1118.26; Wed, 14 Jun 2023 13:58:06 +0000 Received: from [10.95.176.26] (10.95.176.26) by EX19D028UEC003.ant.amazon.com (10.252.137.159) with Microsoft SMTP Server (version=TLS1_2, cipher=TLS_ECDHE_RSA_WITH_AES_256_GCM_SHA384) id 15.2.1118.26; Wed, 14 Jun 2023 13:58:04 +0000 Message-ID: Date: Wed, 14 Jun 2023 09:57:56 -0400 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Thunderbird/102.11.2 Subject: Re: Observing RCU stalls in kernel 5.4/5.10/5.15/6.1 stable trees Content-Language: en-US To: Sven-Haegar Koch CC: Sebastian Andrzej Siewior , "gregkh@linuxfoundation.org" , "Bhatnagar, Rishabh" , "linux-kernel@vger.kernel.org" , "tglx@linutronix.de" , "sashal@kernel.org" , , "stable@vger.kernel.org" , References: <12c6f9a3-d087-b824-0d05-0d18c9bc1bf3@amazon.com> <2023061428-compacter-economic-b648@gregkh> <20230614092045.tNY8USjq@linutronix.de> <4c4178a1-1050-ced4-e6fb-f95c3bdefc98@amazon.com> <2a3fa097-8ba0-5b0e-f506-779fee5b8fef@sdinet.de> From: Luiz Capitulino In-Reply-To: <2a3fa097-8ba0-5b0e-f506-779fee5b8fef@sdinet.de> Content-Type: text/plain; charset="UTF-8"; format=flowed Content-Transfer-Encoding: 7bit X-Originating-IP: [10.95.176.26] X-ClientProxiedBy: EX19D040UWA002.ant.amazon.com (10.13.139.113) To EX19D028UEC003.ant.amazon.com (10.252.137.159) X-Spam-Status: No, score=-2.2 required=5.0 tests=BAYES_00,DKIMWL_WL_HIGH, DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU,DKIM_VALID_EF,NICE_REPLY_A, RCVD_IN_DNSWL_NONE,SPF_HELO_NONE,SPF_NONE,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 2023-06-14 09:45, Sven-Haegar Koch wrote: > > > > On Wed, 14 Jun 2023, Luiz Capitulino wrote: > >> On 2023-06-14 05:20, Sebastian Andrzej Siewior wrote: >> >>> On 2023-06-14 11:14:49 [+0200], gregkh@linuxfoundation.org wrote: >>>> Oops, missed this. >>>> >>>> Yes, there might be, can you do 'git bisect' and track down the patch >>>> that fixed this? >>> >>> There was a report of a lockup during boot in VMs yesterday. If I >>> remember correctly this still exists and might be related to this >>> report. I'm going to have a look. >> >> Thanks, Sebastian. Do you have a link for the discussion? > > May be this, talking about the same commit as cause as this thread: > > Subject: Re: [PATCH] timekeeping: Align tick_sched_timer() with the HZ > tick. -- regression report > https://lore.kernel.org/lkml/5a56290d-806e-b9a5-f37c-f21958b5a8c0@grsecurity.net/ Thank you, Sven. Sebastian, except for the detailed analysis which we haven't done yet, the issue described by Mathias matches 100% what we're observing. Also, we do observe this on bare-metal instances which could mean that the initial reports are against VMs because those are rebooted more often (our quick reproducer boots hundreds of instances in AWS and only 1 or 2 reproduces this). IMHO, I'd suggest we revert this for now from Linus tree and stable trees. We can help testing for the fix maybe for the next merge window. - Luiz > > May not have been the best idea to respond with such big analysis to a 3 > months old dead thread, gets lost extremely easy. > > c'ya > sven-haegar > > -- > Three may keep a secret, if two of them are dead. > - Ben F.