Received: by 2002:a05:6358:51dd:b0:131:369:b2a3 with SMTP id 29csp365963rwl; Wed, 9 Aug 2023 16:08:41 -0700 (PDT) X-Google-Smtp-Source: AGHT+IHHg4xuuQAsjoxkyBVAUAzi8snH1PBGIrOmR98t2XTn+9LfKMSl8oS4LeHTDmw5ecTprQhd X-Received: by 2002:a17:90b:e0b:b0:268:38f5:86ac with SMTP id ge11-20020a17090b0e0b00b0026838f586acmr548398pjb.24.1691622520931; Wed, 09 Aug 2023 16:08:40 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1691622520; cv=none; d=google.com; s=arc-20160816; b=x31qZZnXUt6a7/BZVsLarydV0OkQIq0TtAaIOS0DgJ1Liw9N/DbJYhBPcGv4QXMhGU GwP3qRg8S1iYRrggkMlK5EiqitSB79r1ntKDjr1nrGvUs5pfI4C2V2WBgRcXuU9DtzO/ czVADAoQrxOExhta5pzZUY1ko/GkQmKH4r6XTk9nJiE6mh0JW48axvZWVLcT1GEQAi3E oPEB5/T3LfOeqqNAGhiFrh5f2lG8uchfRtRTDS8mX2Vsv0jBpdcrzt6Fv0uTxStX+S4i 8S2Ba5zgauJEOV8q3boej4DQmbbMi3cSPy+7HNsfJEkcHdAHJuDYD8RyVVpwu+s+BfBm L5Vw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:content-language:subject:user-agent:mime-version :date:message-id:sender:dkim-signature; bh=hfnCZjZmqBGVlOLZIrSAWs+Ngt6gotXhjV95+8R8uJU=; fh=X7dpwKSNP083sUxNdGMK7RB+4qzqCnCL253wxKQkKsU=; b=EBsARd9yT0eaylRwl7UUMT6aydc7r+0nAJ5vOW+FkQ3b+mXHN5rXKO4jUb89/m2eXQ gLdmgVGslDuZSSCCCokqFC3J2wQWmmBd0ZIu1+nZXNx1Hv02tg8WPFWb/NgQ534r2Nn/ 6688XQDx8ePQphLZY9/oPcdhBTTC0UP7oHAeQfiijays7/25Rv/drAlfRbzSnvOsUs2G DpiYyAMRWPUxtWMzF4VfRkaeOGPpXBDKd2DDw2DgkkJboi5WPSFRKtYMvKfGLNFT2IW3 jPAijf4hC2jXVtZdUAFI+g9zs0XzIf4JBJtB8w4QIvo4BgjBNFXnp4Q8uziWE9qamMfD R5JQ== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@gmail.com header.s=20221208 header.b=Pu99zN9J; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id pj1-20020a17090b4f4100b0025c1d114af0si235230pjb.93.2023.08.09.16.08.28; Wed, 09 Aug 2023 16:08:40 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; dkim=pass header.i=@gmail.com header.s=20221208 header.b=Pu99zN9J; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S232713AbjHIVpw (ORCPT + 99 others); Wed, 9 Aug 2023 17:45:52 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:54598 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229641AbjHIVpt (ORCPT ); Wed, 9 Aug 2023 17:45:49 -0400 Received: from mail-il1-x134.google.com (mail-il1-x134.google.com [IPv6:2607:f8b0:4864:20::134]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 2C4611BFA; Wed, 9 Aug 2023 14:45:48 -0700 (PDT) Received: by mail-il1-x134.google.com with SMTP id e9e14a558f8ab-34983226e16so1015475ab.2; Wed, 09 Aug 2023 14:45:48 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20221208; t=1691617547; x=1692222347; h=content-transfer-encoding:in-reply-to:from:references:cc:to :content-language:subject:user-agent:mime-version:date:message-id :sender:from:to:cc:subject:date:message-id:reply-to; bh=hfnCZjZmqBGVlOLZIrSAWs+Ngt6gotXhjV95+8R8uJU=; b=Pu99zN9JXgQjGw6QvoXR38L2RZRrdYFNgjW0Y2IQTk2t+htTFCaqhTRSXVV80s9VpT qh66vnPK7tPOvRdtcGrI8SW9vRfhIt4CuGEf7y26B/EGnOJTmAehBedgHjECOS/tcJXY yXOk3s+ltfhdO2O8JO4RibKvXZNM/YwxU8hrGr+mILYFFM2kvA6LdvDENtzGu7BuiEge LR8skysU1fg/rveJQB8EDXJH8TOJr+r3J7jontqOZ7YxlMKtWFgk6Zzj0Ra9DVIRwzq2 pkc6OPlXZWhpDEY1MG8enk7+zjLd8y9jd3ugEbaO3HBESFoKVXEi0WBQdLuEdMCDs26F LMTg== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20221208; t=1691617547; x=1692222347; h=content-transfer-encoding:in-reply-to:from:references:cc:to :content-language:subject:user-agent:mime-version:date:message-id :sender:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=hfnCZjZmqBGVlOLZIrSAWs+Ngt6gotXhjV95+8R8uJU=; b=a1sUlK//QHTmEKD0xICES4UUz51XHiG4+qZ08MHUvkp6430rQYdi3naSIE0q2l/Qmc xs/BC2p80F/dQwFQECW8DgMC4nNXtAU/wFae+VLMwHqf8xmDETsfIEhMVASYyU1TOyL+ ghaORAGWHTQWuuCpfzZbWrfa6BXkhCVzHU2jbqoXkGLpFGra/djJ7/65QKRFNgb37aKW GAFmZiCFn6UCg2H9SQlVZhZ+IphFmcfX9iBbV/XYsdnoww1hZY1AVjN1vLQN9waWTgsS 91Y0La/YfneESiGqghBfKJVup1AYf/XhjUYX4aE1hbvNWw8F2Jef2wWoeOgAnpvD4ltF pEUA== X-Gm-Message-State: AOJu0YzshVFa/IG9FY0FT56e7mb3mTEOsso0xj2Dc7BJhjnpqpDz4IZP /Kjc002k9eV7/0gs4cGEFpA= X-Received: by 2002:a05:6e02:b43:b0:345:d3bc:8882 with SMTP id f3-20020a056e020b4300b00345d3bc8882mr468043ilu.24.1691617547384; Wed, 09 Aug 2023 14:45:47 -0700 (PDT) Received: from ?IPV6:2600:1700:e321:62f0:329c:23ff:fee3:9d7c? ([2600:1700:e321:62f0:329c:23ff:fee3:9d7c]) by smtp.gmail.com with ESMTPSA id o17-20020a92dad1000000b00345840d442csm4391063ilq.66.2023.08.09.14.45.44 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Wed, 09 Aug 2023 14:45:46 -0700 (PDT) Sender: Guenter Roeck Message-ID: <4dbe72a3-50ea-051c-96ba-d709b33d3a98@roeck-us.net> Date: Wed, 9 Aug 2023 14:45:44 -0700 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Thunderbird/102.13.0 Subject: Re: [PATCH 5.15 00/92] 5.15.126-rc1 review Content-Language: en-US To: Joel Fernandes Cc: Greg Kroah-Hartman , stable@vger.kernel.org, patches@lists.linux.dev, linux-kernel@vger.kernel.org, torvalds@linux-foundation.org, akpm@linux-foundation.org, shuah@kernel.org, patches@kernelci.org, lkft-triage@lists.linaro.org, pavel@denx.de, jonathanh@nvidia.com, f.fainelli@gmail.com, sudipm.mukherjee@gmail.com, srw@sladewatkins.net, rwarsow@gmx.de, conor@kernel.org, paulmck@kernel.org References: <20230809103633.485906560@linuxfoundation.org> <20230809135326.GE3031656@google.com> <35e4b770-2ead-4a19-ad01-fa75996adef4@roeck-us.net> <20230809201413.GA3374446@google.com> <6b05a082-41a7-f0cf-c0a4-1cced8d5a230@roeck-us.net> From: Guenter Roeck In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-2.1 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_EF,FREEMAIL_ENVFROM_END_DIGIT, FREEMAIL_FORGED_FROMDOMAIN,FREEMAIL_FROM,HEADER_FROM_DIFFERENT_DOMAINS, NICE_REPLY_A,RCVD_IN_DNSWL_BLOCKED,SPF_HELO_NONE,SPF_PASS, URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On 8/9/23 13:39, Joel Fernandes wrote: > On Wed, Aug 9, 2023 at 4:38 PM Guenter Roeck wrote: >> >> On 8/9/23 13:14, Joel Fernandes wrote: >>> On Wed, Aug 09, 2023 at 12:25:48PM -0700, Guenter Roeck wrote: >>>> On Wed, Aug 09, 2023 at 02:35:59PM -0400, Joel Fernandes wrote: >>>>> On Wed, Aug 9, 2023 at 12:18 PM Guenter Roeck wrote: >>>>>> >>>>>> On 8/9/23 06:53, Joel Fernandes wrote: >>>>>>> On Wed, Aug 09, 2023 at 12:40:36PM +0200, Greg Kroah-Hartman wrote: >>>>>>>> This is the start of the stable review cycle for the 5.15.126 release. >>>>>>>> There are 92 patches in this series, all will be posted as a response >>>>>>>> to this one. If anyone has any issues with these being applied, please >>>>>>>> let me know. >>>>>>>> >>>>>>>> Responses should be made by Fri, 11 Aug 2023 10:36:10 +0000. >>>>>>>> Anything received after that time might be too late. >>>>>>>> >>>>>>>> The whole patch series can be found in one patch at: >>>>>>>> https://www.kernel.org/pub/linux/kernel/v5.x/stable-review/patch-5.15.126-rc1.gz >>>>>>>> or in the git tree and branch at: >>>>>>>> git://git.kernel.org/pub/scm/linux/kernel/git/stable/linux-stable-rc.git linux-5.15.y >>>>>>>> and the diffstat can be found below. >>>>>>> >>>>>>> Not necesscarily new with 5.15 stable but 3 of the 19 rcutorture scenarios >>>>>>> hang with this -rc: TREE04, TREE07, TASKS03. >>>>>>> >>>>>>> 5.15 has a known stop machine issue where it hangs after 1.5 hours with cpu >>>>>>> hotplug rcutorture testing. Me and tglx are continuing to debug this. The >>>>>>> issue does not show up on anything but 5.15 stable kernels and neither on >>>>>>> mainline. >>>>>>> >>>>>> >>>>>> Do you by any have a crash pattern that we could possibly use to find the crash >>>>>> in ChromeOS crash logs ? No idea if that would help, but it could provide some >>>>>> additional data points. >>>>> >>>>> The pattern shows as a hard hang, the system is unresponsive and all CPUs >>>>> are stuck in stop_machine. Sometimes it recovers on its own from the >>>>> hang and then RCU immediately gives stall warnings. It takes 1.5 hour >>>>> to reproduce and sometimes never happens for several hours. >>>>> >>>>> It appears related to CPU hotplug since gdb showed me most of the CPUs >>>>> are spinning in multi_cpu_stop() / stop machine after the hang. >>>>> >>>> >>>> Hmm, we do see lots of soft lockups with multi_cpu_stop() in the backtrace, >>>> but not with v5.15.y but with v5.4.y. The actual hang is in stop_machine_yield(). >>> >>> Interesting. It looks similar as far as the stack dump in gdb goes, here are >>> the stacks I dumped with the hang I referred to: >>> https://paste.debian.net/1288308/ >>> >> >> That link gives me "Entry not found". > > Yeah that was weird. Here it is again: https://pastebin.com/raw/L3nv1kH2 I found a couple of crash reports from chromeos-5.10, one of them complaining about RCU issues. I sent you links via IM. Nothing from 5.15 or later, though. Guenter