Received: by 2002:a05:6a10:9848:0:0:0:0 with SMTP id x8csp4452222pxf; Tue, 16 Mar 2021 13:59:17 -0700 (PDT) X-Google-Smtp-Source: ABdhPJz1TbqLYE2AjmOLPrmnpCY52bVDxAc+R6j5MBjahcDX5f7Cr/mq0j6gDj8X3o1GnSTmJGmN X-Received: by 2002:a17:907:72d5:: with SMTP id du21mr32686475ejc.167.1615928357407; Tue, 16 Mar 2021 13:59:17 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1615928357; cv=none; d=google.com; s=arc-20160816; b=In0WBG+Sgj4avQU/E/xYuVeP2m4FQRQPvrMEBfRgISGhgbQhpRArYUPfbGXhrkIB7d lAWxkPN+VixJ+ngEBNXf+s1Qzu4Z3dpnIIDMRemqoDq2ROPNqgGIlnuKl55auOHeV1vn 19t7FYMo5Fkbt1g6IrMWm6HrXK7JFB0vdM/tGk/ad16VgriuUd5jjFDdlIy1QAcRCXsj BgqF2SbpY1JXmTQFY1BfM85V8xOVhPxOC3KyHoNxId6T9pM6vd+fSPWH7b6A7HoVUW8Z tHRQpM2Q0MGxQWhzq8W8tKA1l8mzHcwS7on+YjroTRewdQ06hOAbHdLJtc2lEeWWzA41 Ct6w== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:mime-version:user-agent:references:message-id :in-reply-to:subject:cc:to:from:date; bh=Qc41YBjhwYhBV8c2o+4/pUCRysHhbxz5foqWkL/TaM0=; b=C6OYYKlKVXlnK55P0OAVoaXVRpiDIQJmVFRit62lU85ff/fqc39I1JppCRNqT4Xra3 PkT9yaRB2EU6B0lN2nnjc/zS9IZxygEfuEA7Quis6nWn4LCpTuAbwJClJItwWHvcdrDp TqTA7de/+W2ptoNA03D69H4613EQL8xAmWQMSQ8BHBY7hJnSJJfDXFlg2MXQqWlP2auU RIZFlaFImQQnbqe+YtuFjQqYmsVv/mEI3ae5ohfr889jxALVlHn0806RQpM6OrmIJPg0 FlQ/uXMoz4o32to0aqpU9oWtmE41wxAQIX2qDUtGHXSGjpU7rxR+5IqOgDP7NAz6y6fD bNoQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ispras.ru Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id g20si15539984edy.270.2021.03.16.13.58.55; Tue, 16 Mar 2021 13:59:17 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=ispras.ru Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S237746AbhCPQs5 (ORCPT + 99 others); Tue, 16 Mar 2021 12:48:57 -0400 Received: from mail.ispras.ru ([83.149.199.84]:41790 "EHLO mail.ispras.ru" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S237230AbhCPQsW (ORCPT ); Tue, 16 Mar 2021 12:48:22 -0400 Received: from monopod.intra.ispras.ru (unknown [10.10.3.121]) by mail.ispras.ru (Postfix) with ESMTPS id C4AE540755F6; Tue, 16 Mar 2021 16:48:19 +0000 (UTC) Date: Tue, 16 Mar 2021 19:48:19 +0300 (MSK) From: Alexander Monakov To: Adam Borowski cc: Jiri Olsa , Borislav Petkov , Tom Lendacky , Peter Zijlstra , x86@kernel.org, lkml , Alexander Shishkin , Arnaldo Carvalho de Melo , Stanislav Kozina , Michael Petlan , Pierre Amadio , onatalen@redhat.com, darcari@redhat.com Subject: Re: unknown NMI on AMD Rome In-Reply-To: Message-ID: References: User-Agent: Alpine 2.20.13 (LNX 116 2015-12-14) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, 16 Mar 2021, Adam Borowski wrote: > On Tue, Mar 16, 2021 at 04:45:02PM +0100, Jiri Olsa wrote: > > hi, > > when running 'perf top' on AMD Rome (/proc/cpuinfo below) > > with fedora 33 kernel 5.10.22-200.fc33.x86_64 > > > > we got unknown NMI messages: > > > > [ 226.700160] Uhhuh. NMI received for unknown reason 3d on CPU 90. > > [ 226.700162] Do you have a strange power saving mode enabled? > > [ 226.700163] Dazed and confused, but trying to continue > > > > also when discussing ths with Borislav, he managed to reproduce easily > > on his AMD Rome machine > > Likewise, 3c on Pinnacle Ridge. I've also seen it on Renoir, and it appears related to PMU interrupt racing against C-state entry/exit. Disabling C2 and C3 via 'cpupower' is enough to avoid those NMIs in my case. IIRC there were a few patches related to this area from AMD in the past. Alexander