Received: by 2002:a25:1985:0:0:0:0:0 with SMTP id 127csp599909ybz; Wed, 15 Apr 2020 14:54:17 -0700 (PDT) X-Google-Smtp-Source: APiQypL4BS483Z+eG+/NYKaNf7J9Q6HS4uDC6d2tMCSLiXknMELq710pArlbYGFz7Aqnmy6gCHSZ X-Received: by 2002:a50:e809:: with SMTP id e9mr19705565edn.182.1586987657307; Wed, 15 Apr 2020 14:54:17 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1586987657; cv=none; d=google.com; s=arc-20160816; b=LrQxZxblFPFtB0ZNd6RFbeyTQi6nQ5bUD/jhtSCGMbBaJLRjXSjvBse7VUAYwxVdF3 RpXyEpwN/iUMlPWmEwdCr2cUVOxcTUeREYO0yIDTXESvIFxD1FaL7aTm5QDk7QvF64bR BKcmjWnRBJ7vyHS6oORmQPCUmnyETo74r4Ov/ho/IAydHJVcy9+XILCSs+9fPdDNsWDD lI/7Pcr7RK4Xpogd0bnm/ZC+2aLb7hhoo77LqC+oXaB1ccrx5WFyaUA2iXCNeliTRJ1R lQIF23WdX969mVYwWWAJSWZiv+Er9eYfGE1chn6rfawoeyjKhAhejLRer0fwvY8wFzzK dR5g== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:content-transfer-encoding:mime-version :user-agent:references:in-reply-to:date:cc:to:from:subject :message-id:ironport-sdr:ironport-sdr; bh=S4muGVylCVbb+a23jFkL1UsS+gr6mdNDx3RDerfhWwI=; b=dDlP+WEOFESZ/7GAjLrtNHEnoB6safUmt3qDf8zRuMCeFYyquNFoTuQFjBZGmPJouG rNdDVl+xLLuWICGB5aEqT8nf3pZ0Xne7DHgtEnA5g0/4b6dbr7xFxJ/KBwRCPEuzzcn2 3nSsP31lYP15tx7Vi0XX6EKqJmM7V/8YW3pBKFu4kFuF8p3IXqZyc0kCQkVbO1VqttSM cv0IlTbq/zmgdSpKgLI3xwWi+uPX2hRATtJLuHPWm+PztEXTRH84cRxqd8P2EM7+dESa Cx2PCHoj2TpHmae1gigvMrr2XaRjOh6nb8UzwQhAiL9tAHS+O7pLTAEIYRt5oHcK9JUP IMOQ== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Return-Path: Received: from vger.kernel.org (vger.kernel.org. [23.128.96.18]) by mx.google.com with ESMTP id q4si12939596edn.161.2020.04.15.14.53.53; Wed, 15 Apr 2020 14:54:17 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) client-ip=23.128.96.18; Authentication-Results: mx.google.com; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 23.128.96.18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=intel.com Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S2505326AbgDNT6Z (ORCPT + 99 others); Tue, 14 Apr 2020 15:58:25 -0400 Received: from mga18.intel.com ([134.134.136.126]:62397 "EHLO mga18.intel.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S2505317AbgDNT6R (ORCPT ); Tue, 14 Apr 2020 15:58:17 -0400 IronPort-SDR: iWC8AtW5tZSY/zT8y1nVewR3JcuaoC7zTX7ARFhpd5cV/FbBvd2joj14j6KwcIUWoSGVEIVMSU 5qeL+b4vZH3A== X-Amp-Result: SKIPPED(no attachment in message) X-Amp-File-Uploaded: False Received: from orsmga001.jf.intel.com ([10.7.209.18]) by orsmga106.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 14 Apr 2020 12:58:12 -0700 IronPort-SDR: pdkL40ELjhuGUKWhIbAR8yzLKgEiDVcAac7aIY1qps6kzICYdSwEa96h/zysx5ct0vNL07nkcF bqYXZDkhB8ZA== X-IronPort-AV: E=Sophos;i="5.72,384,1580803200"; d="scan'208";a="332280312" Received: from spandruv-mobl.amr.corp.intel.com ([10.134.69.31]) by orsmga001-auth.jf.intel.com with ESMTP/TLS/ECDHE-RSA-AES256-GCM-SHA384; 14 Apr 2020 12:58:11 -0700 Message-ID: <24c4ac84671b97b5092413689b4bf224b73bc51b.camel@linux.intel.com> Subject: Re: [PATCH 3/3] x86/mce/therm_throt: allow disabling the thermal vector altogether From: Srinivas Pandruvada To: "Jason A. Donenfeld" Cc: LKML , linux-edac@vger.kernel.org, X86 ML , Arnd Bergmann , bberg@redhat.com, bp@suse.de Date: Tue, 14 Apr 2020 12:58:04 -0700 In-Reply-To: References: <20200407063345.4484-1-Jason@zx2c4.com> <20200407063345.4484-3-Jason@zx2c4.com> <0e189a4fe1e69b08afc859ce83623a0e5ea0c08b.camel@linux.intel.com> <4b75ec34ccff5abdc0b1c04a5ac39455ddd4f49b.camel@linux.intel.com> Content-Type: text/plain; charset="UTF-8" User-Agent: Evolution 3.34.2 (3.34.2-1.fc31) MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Tue, 2020-04-14 at 13:41 -0600, Jason A. Donenfeld wrote: > On Tue, Apr 14, 2020 at 8:45 AM Srinivas Pandruvada > wrote: > > On Mon, 2020-04-13 at 22:21 -0600, Jason A. Donenfeld wrote: > > > On Mon, Apr 13, 2020 at 9:38 PM Srinivas Pandruvada > > > wrote: > > > > On Tue, 2020-04-07 at 00:33 -0600, Jason A. Donenfeld wrote: > > > > > The thermal IRQ handler uses 1.21% CPU on my system when it's > > > > > hot > > > > > from > > > > > compiling things. Indeed looking at /proc/interrupts reveals > > > > > quite a > > > > > lot > > > > I am curious why you are hitting threshold frequently? > > > > What is rdmsr 0x1a2 > > > > > > 5640000 > > You are getting too many interrupts at 95C. You should look at your > > cooling system. > > > > > > > of events coming in. Beyond logging them, the existing > > > > > drivers on > > > > > the > > > > > system don't appear to do very much that I'm interested in. > > > > > So, > > > > > add a > > > > > way to disable this entirely so that I can regain precious > > > > > CPU > > > > > cycles. > > > > It is showing amount of time system is running in a constrained > > > > environment. Lots of real time and HPC folks really care about > > > > this. > > > > > > Which is why this patch adds an option, not a full removal or > > > something. Real time and HPC people can keep their expensive > > > interrupt. Other people with different varieties of system > > > disable > > > it. > > Generally compile time flag is not desirable. If it is what > > required > > then we should have boot time flag something in lines of existing > > "int_pln_enable" option. > > Generally it is desirable, and extremely common too. This thermal > code > -- which mostly functions to print some messages into kmsg -- is very > verbose. This is not something I want to compile into smaller > systems. > This is the reason why kconfig has options in the first place. I'm > not > sure yet-another boottime flag makes sense for this. Can you send log which is still showing verbose prints with the latest kernel? I can see interrupts will still fire. If it is, then temperature trend is still above 95C and cooling systems is not in control. In another window, print in loop (with sleep 1) /sys/class/thermal/thermal_zone*/temp for the zone for which "type == x86_pkg_temp"