Received: by 2002:a05:6358:7058:b0:131:369:b2a3 with SMTP id 24csp9014494rwp; Wed, 19 Jul 2023 20:40:31 -0700 (PDT) X-Google-Smtp-Source: APBJJlE5VAxNbLgMXqNxgrgvVWQ7Q7wq7tzjLXfIfv3yCz3j3AXqnfYoF6zq6tWfEDcW/X/tH2gL X-Received: by 2002:a05:622a:190c:b0:403:f31a:804 with SMTP id w12-20020a05622a190c00b00403f31a0804mr5548279qtc.14.1689824431565; Wed, 19 Jul 2023 20:40:31 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1689824431; cv=none; d=google.com; s=arc-20160816; b=Z8SIDQgH/HsRQNh3cKjAzmBE+5DCz6AJT8UazDZXpQ3BUNbmWO1I0BdVn3ZA3+MY2u t/2qGDCx4CQHeNfJn+6PGlkqxpS4nqcHNEw4gZyDRh+6O7ndavSlu1J1c952IOIzMl50 ZTko3k10/Xw01UTyhprQEHQ2g7A9JPt/obVzQwBP5xhmKsQwGsVUizOiJY/UZ4I3jWWx BNUrjhQdjls8V8/4rBc7WEcwYuLb1eSCwaEEKgvwnS+ikKClCzdIfsZNVsLhsjGW1Bbb 2oXhIccjBAtmNeeggqlvI27e4vQ1EsMu91RkLAnC0Nw4mNYx8di9dKMmrklxjcwZepZJ a7vw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:user-agent:in-reply-to:content-transfer-encoding :content-disposition:mime-version:references:message-id:subject:cc :to:from:date; bh=D+H6dzi2BnulBFsv2dervcqUqPdOxzUBHcVXiBRj8mQ=; fh=vFtAxG/7TEOsmzzLZz+mF52M/XW+jtkPNkqXo0TUJpQ=; b=bWvt6xfbYDxMep1LGMCUPJYt+d3Y3qMEm8AOXJqyzo+zGGE0TBmGXMEMQSwnyxwKuo L+yfpD3+izJ/K9jTS7Y0N3EWW3EMsvdgu6Ak7YffhggrriDRtHoOCQoRgh5oyb5zozge nUom2xVCd7XxfKDXwbDp6rLbxEKHw7QAnFN5vpGxGlk+PyxwgDCCpHdjEpKYgnVPA2cr BKFiK1wh/p+ppc4RqxXrGq+ahBn/8ADfQxfmHmaxVZIcAsgS48/QEoiYZDRCTt5vkPaz V2ATVffAeKTm3h/0Nz3qISY2FTAoW3qOufVmfkzVIBxuLgMaXA2UiT5mgonFUZ0rSiIX hpZw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id x24-20020a63f718000000b0055b57fa6d19si4774955pgh.208.2023.07.19.20.40.19; Wed, 19 Jul 2023 20:40:31 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229818AbjGTDJq (ORCPT + 99 others); Wed, 19 Jul 2023 23:09:46 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:38476 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229463AbjGTDJp (ORCPT ); Wed, 19 Jul 2023 23:09:45 -0400 X-Greylist: delayed 366 seconds by postgrey-1.37 at lindbergh.monkeyblade.net; Wed, 19 Jul 2023 20:09:42 PDT Received: from lobo.ruivo.org (lobo.ruivo.org [173.14.175.98]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id B1E811739 for ; Wed, 19 Jul 2023 20:09:42 -0700 (PDT) Received: by lobo.ruivo.org (Postfix, from userid 1011) id 66E9252C72; Wed, 19 Jul 2023 23:03:35 -0400 (EDT) X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net X-Spam-Level: X-Spam-Status: No, score=-1.9 required=5.0 tests=BAYES_00,SPF_HELO_NONE, SPF_NONE,T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 Received: from jake.ruivo.org (bob.qemu.ruivo [192.168.72.19]) by lobo.ruivo.org (Postfix) with ESMTPA id 87ADF529EE; Wed, 19 Jul 2023 23:03:17 -0400 (EDT) Received: by jake.ruivo.org (Postfix, from userid 1000) id 6356F220029; Wed, 19 Jul 2023 23:03:17 -0400 (EDT) Date: Wed, 19 Jul 2023 23:03:17 -0400 From: Aristeu Rozanski To: Tony Luck Cc: rostedt@goodmis.org, linux-kernel@vger.kernel.org Subject: Re: rasdaemon broke between v6.0 and v6.3? Message-ID: <20230720030317.GC94963@cathedrallabs.org> References: MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: User-Agent: Mutt/2.2.9 (2022-11-12) Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, Jul 19, 2023 at 04:35:54PM -0700, Tony Luck wrote: > [resend as plain text - sorry for the earlier HTML] > > An internal team is seeing tests that worked on v6.0 fail on v6.3. The problem is that > rasdaemon isn’t waking up to process the “mce_record” trace events. > > Manually checking for them works: > > root@R-4251:/sys/kernel/debug/tracing>systemctl stop rasdaemon > root@R-4251:/sys/kernel/debug/tracing> > root@R-4251:/sys/kernel/debug/tracing> > root@R-4251:/sys/kernel/debug/tracing>echo 1 > events/mce/mce_record/enable > root@R-4251:/sys/kernel/debug/tracing> > root@R-4251:/sys/kernel/debug/tracing>cat trace_pipe > <...>-235 [000] ..... 596.892583: mce_record: CPU: 0, MCGc/s: f000c15/0, MC13: 8c00004200800090, IPID: 0000000000000000, ADDR/MISC/SYND: 0000000123450000/08000a80c2982086/0000000000000000, RIP: 00:<0000000000000000>, TSC: 14120b051a1, PROCESSOR: 0:c06f1, TIME: 1689802780, SOCKET: 0, APIC: 0 > kworker/0:2-235 [000] ..... 597.204343: mce_record: CPU: 0, MCGc/s: f000c15/0, MC255: 9c0000000000009f, IPID: 0000000000000000, ADDR/MISC/SYND: 0000000123450000/000000000000008c/0000000000000000, RIP: 00:<0000000000000000>, TSC: 0, PROCESSOR: 0:c06f1, TIME: 1689802781, SOCKET: 0, APIC: 0 > > So their tests are injecting errors, and the trace event is firing. > > Is there some updated version of rasdaemon needed? > > Some kernel CONFIG option problem? Looks like you're hitting the issue this commit is supposed to fix: http://git.infradead.org/users/mchehab/rasdaemon.git/commit/6986d818e6d2c846c001fc7211b5a4153e5ecd11 -- Aristeu