Received: by 2002:a05:6358:7058:b0:131:369:b2a3 with SMTP id 24csp9397844rwp; Thu, 20 Jul 2023 04:30:44 -0700 (PDT) X-Google-Smtp-Source: APBJJlGPfwoQLokBdZKU4BtsehgUb91g45wF7mfbq/6QjexEmMKlDOw6gWTunUgXI9MRdX4aTPDo X-Received: by 2002:a17:907:970e:b0:965:6075:d0e1 with SMTP id jg14-20020a170907970e00b009656075d0e1mr5953346ejc.72.1689852644106; Thu, 20 Jul 2023 04:30:44 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1689852644; cv=none; d=google.com; s=arc-20160816; b=A5dKkZTEMGrgYi6+KC/IFcL9Y68KN/cnuJay5g3WvMAbrzFEncyxeG8x7Ku2VJ6QKq ptFOlR/Zem8TCtuZTiRYOQwMi4IUCiViU6HUgcJpxXUPyxO6tZ1OxK3aayrueBLTyQGs bzZPijIQTSgFm89MmTsxPaNz5JTqgQRwSs0ieKUBPl5OmPjHfhsuj3kxtc+EKFpujfqp dyw8j7qh3L3TR84Dlp2hlLmLSBpQGa7SztCMMm+93TgL5/07ONzA6t4h59jR8Y6sp0Vv R08zpWH7LbYzGCzM5eWzJ1Pa/kbuH2v2X38/FGqgSYyHLkePT++AMhO+vm2UpEib0QZV 129Q== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:mime-version :references:in-reply-to:message-id:subject:cc:to:from:date; bh=45QyGBZyJg2cUGM4Rqd8FCMSZyx0JsPzGLM1pEUZy5o=; fh=124WbmmyqHRsRDiaAQN14wzlF/2J3O2udIfCTI4c5/w=; b=P0NyHk51taPAlP/GteHS7YKBQ7mYH++EzhpWns4aQWNvbWLWfGhiY0lP7CCVbUiIKH yQHx6+2nnOsna/PcNVNglExMsAy/2wEIE+lwFueX+Ho9LdHT0XfAg7veQlzWKfuO5AiW 32fQkv6NnnEjWsWphCtuFyHluE9IK+rHdarJMiSx4MQtSom6Kpdx8dj7D8K59hL6CpLX 1wD02ROCLs4xiwxAENoWyFtdJ3lZE5di1X36wTqa45BUYs32WfSu6X9qWPtPHyxVSLkH RZfwR7i9XPKYDADlVuyRPk6RFzIEuFwYNHO9jvIov3w3ni0cxSbHydaBb9bNnbn2UBX4 v3vg== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from out1.vger.email (out1.vger.email. [2620:137:e000::1:20]) by mx.google.com with ESMTP id f27-20020a170906085b00b0099351214a94si532438ejd.648.2023.07.20.04.30.18; Thu, 20 Jul 2023 04:30:43 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) client-ip=2620:137:e000::1:20; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:20 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229704AbjGTKkf convert rfc822-to-8bit (ORCPT + 99 others); Thu, 20 Jul 2023 06:40:35 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:44092 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S229592AbjGTKkc (ORCPT ); Thu, 20 Jul 2023 06:40:32 -0400 Received: from dfw.source.kernel.org (dfw.source.kernel.org [IPv6:2604:1380:4641:c500::1]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id A775510F1 for ; Thu, 20 Jul 2023 03:40:31 -0700 (PDT) Received: from smtp.kernel.org (relay.kernel.org [52.25.139.140]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (2048 bits)) (No client certificate requested) by dfw.source.kernel.org (Postfix) with ESMTPS id 36D46619D6 for ; Thu, 20 Jul 2023 10:40:31 +0000 (UTC) Received: by smtp.kernel.org (Postfix) with ESMTPSA id 0A708C433C7; Thu, 20 Jul 2023 10:40:29 +0000 (UTC) Date: Thu, 20 Jul 2023 06:40:28 -0400 From: Steven Rostedt To: Tony Luck Cc: Aristeu Rozanski , linux-kernel@vger.kernel.org Subject: Re: rasdaemon broke between v6.0 and v6.3? Message-ID: <20230720064028.1aeb3c18@gandalf.local.home> In-Reply-To: References: X-Mailer: Claws Mail 3.19.1 (GTK+ 2.24.33; x86_64-pc-linux-gnu) MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8BIT X-Spam-Status: No, score=-1.7 required=5.0 tests=BAYES_00, HEADER_FROM_DIFFERENT_DOMAINS,RCVD_IN_DNSWL_BLOCKED,SPF_HELO_NONE, SPF_PASS,T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, 19 Jul 2023 16:35:54 -0700 Tony Luck wrote: > [resend as plain text - sorry for the earlier HTML] > > An internal team is seeing tests that worked on v6.0 fail on v6.3. The problem is that > rasdaemon isn’t waking up to process the “mce_record” trace events. > > Manually checking for them works: > > root@R-4251:/sys/kernel/debug/tracing>systemctl stop rasdaemon > root@R-4251:/sys/kernel/debug/tracing> > root@R-4251:/sys/kernel/debug/tracing> > root@R-4251:/sys/kernel/debug/tracing>echo 1 > events/mce/mce_record/enable > root@R-4251:/sys/kernel/debug/tracing> > root@R-4251:/sys/kernel/debug/tracing>cat trace_pipe > <...>-235 [000] ..... 596.892583: mce_record: CPU: 0, MCGc/s: f000c15/0, MC13: 8c00004200800090, IPID: 0000000000000000, ADDR/MISC/SYND: 0000000123450000/08000a80c2982086/0000000000000000, RIP: 00:<0000000000000000>, TSC: 14120b051a1, PROCESSOR: 0:c06f1, TIME: 1689802780, SOCKET: 0, APIC: 0 > kworker/0:2-235 [000] ..... 597.204343: mce_record: CPU: 0, MCGc/s: f000c15/0, MC255: 9c0000000000009f, IPID: 0000000000000000, ADDR/MISC/SYND: 0000000123450000/000000000000008c/0000000000000000, RIP: 00:<0000000000000000>, TSC: 0, PROCESSOR: 0:c06f1, TIME: 1689802781, SOCKET: 0, APIC: 0 > > So their tests are injecting errors, and the trace event is firing. > > Is there some updated version of rasdaemon needed? > > Some kernel CONFIG option problem? > A bug was fixed that I think affected rasdaemon. commit 3e46d910d8acf94e5360126593b68bf4fee4c4a1 Author: Shiju Jose Date: Thu Feb 2 18:23:09 2023 +0000 tracing: Fix poll() and select() do not work on per_cpu trace_pipe and trace_pipe_raw Make sure /sys/kernel/tracing/buffer_percent = 0 -- Steve