Received: by 2002:a05:7412:37c9:b0:e2:908c:2ebd with SMTP id jz9csp2301481rdb; Thu, 21 Sep 2023 14:42:46 -0700 (PDT) X-Google-Smtp-Source: AGHT+IGdBnkyjwGYUseVzAldSbWH12iUzQ5AnmElWQHS9KB67896sMILdNNQhqBnum8b3tkQbZk9 X-Received: by 2002:aa7:9055:0:b0:68f:c309:9736 with SMTP id n21-20020aa79055000000b0068fc3099736mr6661479pfo.3.1695332566368; Thu, 21 Sep 2023 14:42:46 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1695332566; cv=none; d=google.com; s=arc-20160816; b=BERwoQ5Dcqv7SSB5gNQjmoprlPUm9922URwl/OHZ1J3ZGOoiE289+4MkZ7t8TWfDHj ztYCGVTbwy8rauX02iKOfcgkOWRtwlVqd7y3hiXxRkdv9vMPdMxvaws2JTmMTmP0oP4L 3cA/ppmkypD2IJHj+t7M1AyXwc5V27lJHIPo/qa15K39HV6WnkVEz4gnIEt7hPpVhI/N G/yhjkmzqFtdvuWmgA6x7LsLSK9+srI7DTsIXINjWKK8ZZkLpchofF0MtpQIdozpYZ9Q gYd+GgUXcOSepoaFtQxeCyf3wMAWbZ91OyUUs9cjCONy1kGkxJaregwKUjspuneb+PaS byFw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :content-language:references:cc:to:subject:user-agent:mime-version :date:message-id; bh=6uwFNrmwcg4PlviIO5C96tICUwcgSBRw90RtXrMkL6Y=; fh=qq7cjzEzLPi0i/Wte0mKfuL1CJRlo4lXveAR0CconmU=; b=FNU076LVwF6xa53WGwfQ6X3VjcBuvhWhPl/YYmefE3tLyLgE4g5HmdVdCtpWk/RsNP 0ZuCQj7fferhDh3aRKkVr3jhVYH2drij/PtXh2ktW9fYR0tJd+IfddCV+zTt6IPuXuBD C9N/JpyAikAeHH/MVK50Mv4+nh5uuveRBjJXDE2WKcl4StWfg3zWMssLPMrma/TD2t0/ jtRYWwUPs+uHznPnUk2Y38W3QZAN0CR7nYWf7ex9NrTOfpdkffbUN6yjsLkHzLobFt/5 l1B7F5VqCNYvVkuMpjTUzkffL3KvpyZfy+ASC4aUpfVZT8hKLHtCO5YGsQqKcL62tL22 g4uQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.34 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from howler.vger.email (howler.vger.email. [23.128.96.34]) by mx.google.com with ESMTPS id z5-20020aa78885000000b0068e45c6ca3fsi2509748pfe.93.2023.09.21.14.42.46 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 21 Sep 2023 14:42:46 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.34 as permitted sender) client-ip=23.128.96.34; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.34 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: from out1.vger.email (depot.vger.email [IPv6:2620:137:e000::3:0]) by howler.vger.email (Postfix) with ESMTP id 777FF80A1670; Thu, 21 Sep 2023 12:41:31 -0700 (PDT) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.103.10 at howler.vger.email Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229785AbjIUTl0 (ORCPT + 99 others); Thu, 21 Sep 2023 15:41:26 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:36632 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229780AbjIUTlZ (ORCPT ); Thu, 21 Sep 2023 15:41:25 -0400 Received: from wp530.webpack.hosteurope.de (wp530.webpack.hosteurope.de [80.237.130.52]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 0E661E58 for ; Thu, 21 Sep 2023 12:41:19 -0700 (PDT) Received: from [2a02:8108:8980:2478:8cde:aa2c:f324:937e]; authenticated by wp530.webpack.hosteurope.de running ExIM with esmtpsa (TLS1.3:ECDHE_RSA_AES_128_GCM_SHA256:128) id 1qjEll-0000d8-Jk; Thu, 21 Sep 2023 10:10:41 +0200 Message-ID: <2a298145-b6ec-4b4e-bd45-0cddf62b6052@leemhuis.info> Date: Thu, 21 Sep 2023 10:10:39 +0200 MIME-Version: 1.0 User-Agent: Mozilla Thunderbird Subject: Re: what to do on no reproducer case? (was Re: Fwd: Uhhuh. NMI received for unknown reason 3d/2d/ on CPU xx) To: Bagas Sanjaya Cc: Linux Kernel Mailing List , Linux Regressions References: <0e6de51b-da1d-43f0-ffae-fb521febbe07@gmail.com> Content-Language: en-US, de-DE From: Thorsten Leemhuis In-Reply-To: Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 7bit X-bounce-key: webpack.hosteurope.de;regressions@leemhuis.info;1695325279;d42e2885; X-HE-SMSGID: 1qjEll-0000d8-Jk X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00, RCVD_IN_DNSWL_BLOCKED,SPF_HELO_NONE,SPF_PASS,URIBL_BLOCKED autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org X-Greylist: Sender passed SPF test, not delayed by milter-greylist-4.6.4 (howler.vger.email [0.0.0.0]); Thu, 21 Sep 2023 12:41:31 -0700 (PDT) On 20.09.23 02:27, Bagas Sanjaya wrote: > [addressing to Thorsten] > > On Sat, Sep 02, 2023 at 07:20:55AM +0700, Bagas Sanjaya wrote: >> I notice a regression report on Bugzilla [1]. Quoting from it: >> >>> seems to be a regression since 6.5 release: >>> the infamous error message from the kernel on this 32c/64t threadripper: >>>> [ 2046.269103] perf: interrupt took too long (3141 > 3138), lowering >>>> kernel.perf_event_max_sample_rate to 63600 >>>> [ 2405.049567] Uhhuh. NMI received for unknown reason 2d on CPU 48. >>>> [ 2405.049571] Dazed and confused, but trying to continue >>>> [ 2406.902609] Uhhuh. NMI received for unknown reason 2d on CPU 33. >>>> [ 2406.902612] Dazed and confused, but trying to continue >>>> [ 2423.978918] Uhhuh. NMI received for unknown reason 2d on CPU 33. >>>> [ 2423.978921] Dazed and confused, but trying to continue > [...] >>> according to dmesg, this happens without any special reason (I didn't even notice) >>> some googling points at a ACPI C state problem on AMD CPUs a few years ago >>> in 5.14 kernels, I didn't see it. >> >> See Bugzilla for the full thread. >> >> Anyway, I'm adding this regression to be tracked by regzbot: >> >> #regzbot introduced: v6.4..v6.5 https://bugzilla.kernel.org/show_bug.cgi?id=217857 > > This regression looks stalled: on Bugzilla, the reporter keeps asking to me, > for which I'm not the expert of involved subsystem. And apparently, he still > had not any reproducer yet (is it triggered by random chance?). Should I > mark this as inconclusive? Yes, without a reliable bisection result there sometimes is not much we can do -- apart from prodding various developers directly and asking for help or an idea. But in this case that's not worth it afaics, as messages like https://lore.kernel.org/all/e08e33d5-4f6d-91aa-f335-9404d16a983c@amd.com/ indicate that it might be a hardware problem and not really a regression. Hence: #regzbot resolve: inconclusive: not bisected and might be a hardware problem after all Ciao, Thorsten (wearing his 'the Linux kernel's regression tracker' hat) -- Everything you wanna know about Linux kernel regression tracking: https://linux-regtracking.leemhuis.info/about/#tldr If I did something stupid, please tell me, as explained on that page.