Received: by 2002:a05:6a10:9848:0:0:0:0 with SMTP id x8csp335920pxf; Wed, 17 Mar 2021 06:34:06 -0700 (PDT) X-Google-Smtp-Source: ABdhPJzYC7komE7MtEyzAgNAK3VvvpgokJ4yOxhNMy4l92KXMcZb6jR9k/l1fqhwfQcHkhRs/iaY X-Received: by 2002:aa7:c916:: with SMTP id b22mr42953761edt.299.1615988046136; Wed, 17 Mar 2021 06:34:06 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1615988046; cv=none; d=google.com; s=arc-20160816; b=h/4BWxC2fKXNjU/kuuyrwBHzBLruaYOXS56MNmWdycSRR7lMs6h4gFRTkJz/S9DaMh k7NYb/WgBj3n8pTsrL+jO8N9inGpagncGbAYKF5aSkwKSjBia+wSt/iba7DosTCVlkNa GU3gS12CoMWgVIDNlO2wp72mc89ltW2cjWF/FpT61A/jfhoauIq+VbeI26pvmPmw97tk vwrdofUKdsCQ2vso7P2l9p3Hhjp8D1UqhlZNTE2n1wDeGTIy8+fvnojfzNbtSDMX73Pi CSZJ3+MntJsYAYbDBVizmgoO101Cs4BmU5pdyhQCmwSs0Ua/Jd3XYPSSso0HQAYFzyhm kOuQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:mime-version:user-agent:references:message-id :in-reply-to:subject:cc:to:from:date; bh=QukweoOAnffK9/4kzvXr4BGYoRY9kehoncFPdMclOd0=; b=bPTxL8Jnpv+8Q8pNoSa8fVb6xSOSUbmJwqZeP8rL4jOW/P9es15ZCZoAYVgqY1Sdxr UBrNzed4o8oW6N2VvlgeQ+rJN7Taw85mNhHRwH1GvY1QkqEZghEjgsdHoFqmyscpjHFG kb+xnzQL15kp/nqpo+XYPv5HlcTd3jbb6/6aqcZJTgr2fj8SGtf74Y93R8NRBsd4ZdVL p3y+4hcE6IzsZYpWwk/tt9g77tQwFVAiuUvIpqPLtE2/82vgF5zO3qxjpbxPwqS2809/ 4oWd2w1a1XoUyTDysKxBwgDAPksQUyxMHZYDFRrHSDUuUusEvb+Bh796fvXMShEV7Jzm c7mQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ispras.ru Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id do13si16516711ejc.87.2021.03.17.06.33.43; Wed, 17 Mar 2021 06:34:06 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ispras.ru Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S230255AbhCQNcg (ORCPT + 99 others); Wed, 17 Mar 2021 09:32:36 -0400 Received: from mail.ispras.ru ([83.149.199.84]:50480 "EHLO mail.ispras.ru" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231186AbhCQNcW (ORCPT ); Wed, 17 Mar 2021 09:32:22 -0400 Received: from monopod.intra.ispras.ru (unknown [10.10.3.121]) by mail.ispras.ru (Postfix) with ESMTPS id C9695407627D; Wed, 17 Mar 2021 13:32:17 +0000 (UTC) Date: Wed, 17 Mar 2021 16:32:17 +0300 (MSK) From: Alexander Monakov To: Peter Zijlstra cc: Ingo Molnar , Kim Phillips , Jiri Olsa , Borislav Petkov , Tom Lendacky , x86@kernel.org, lkml , Alexander Shishkin , Arnaldo Carvalho de Melo , Stanislav Kozina , Michael Petlan , Pierre Amadio , onatalen@redhat.com, darcari@redhat.com, "Rafael J. Wysocki" Subject: Re: unknown NMI on AMD Rome In-Reply-To: Message-ID: References: <36397980-f897-147f-df55-f37805d869c9@amd.com> <20210317084829.GA474581@gmail.com> User-Agent: Alpine 2.20.13 (LNX 116 2015-12-14) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Wed, 17 Mar 2021, Peter Zijlstra wrote: > On Wed, Mar 17, 2021 at 09:48:29AM +0100, Ingo Molnar wrote: > > > https://developer.amd.com/wp-content/resources/56323-PUB_0.78.pdf > > > > So: > > > > > > 1215 IBS (Instruction Based Sampling) Counter Valid Value > > May be Incorrect After Exit From Core C6 (CC6) State > > > > Description > > > > If a core's IBS feature is enabled and configured to generate an interrupt, including NMI (Non-Maskable > > Interrupt), and the IBS counter overflows during the entry into the Core C6 (CC6) state, the interrupt may be > > issued, but an invalid value of the valid bit may be restored when the core exits CC6. > > Potential Effect on System > > > > The operating system may receive interrupts due to an IBS counter event, including NMI, and not observe an > > valid IBS register. Console messages indicating "NMI received for unknown reason" have been observed on > > Linux systems. > > > > Suggested Workaround: None > > Fix Planned: No fix planned > > Should be simple enough to disable CC6 while IBS is in use. Kim, can you > please make that happen? Wouldn't that "magically" significantly speed up workloads running under 'perf top', in case they don't saturate the CPUs? Scheduling gets much snappier if the target CPU doesn't need to wake up from deep sleep :) Alternatively, would you consider adding the errata reference to the printk message when IBS is in use, and rate-limit it so it doesn't flood dmesg? Then the user will know what's going on, and may choose to temporarily disable C-states using the 'cpupower' tool. Alexander