Return-Path: Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1753716Ab0H0ALN (ORCPT ); Thu, 26 Aug 2010 20:11:13 -0400 Received: from smtp-03.mandic.com.br ([200.225.81.143]:34079 "EHLO smtp-03.mandic.com.br" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1752540Ab0H0ALK (ORCPT ); Thu, 26 Aug 2010 20:11:10 -0400 Message-ID: <4C770299.6000708@cesarb.net> Date: Thu, 26 Aug 2010 21:11:05 -0300 From: Cesar Eduardo Barros User-Agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.2.8) Gecko/20100806 Fedora/3.1.2-1.fc13 Thunderbird/3.1.2 MIME-Version: 1.0 To: Joe Perches CC: Jesse Barnes , Matthew Garrett , platform-driver-x86@vger.kernel.org, linux-kernel@vger.kernel.org Subject: Re: [PATCH] intel_ips: quieten "power or thermal limit exceeded" messages References: In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit Sender: linux-kernel-owner@vger.kernel.org List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Content-Length: 2832 Lines: 76 Em 26-08-2010 20:33, Joe Perches escreveu: > On Thu, 26 Aug 2010, Cesar Eduardo Barros wrote: > >> On my Dell Inspiron N4010, one of these messages is printed every five >> seconds. Change both to dev_dbg to quieten them even more. > > I think you should instead fix your hardware or maybe change > your thermal throttling settings. How can I know if this laptop is broken or not? According to Jesse's reply, the BIOS lowered the limit, which could explain why it hits the limit more often: intel ips 0000:00:1f.6: Warning: CPU TDP doesn't match expected value (found 25, expected 35) The thermal throttling seems to be also managed by the BIOS (and there are no settings about it on the BIOS setup that I remember): CPU: Physical Processor ID: 0 CPU: Processor Core ID: 0 mce: CPU supports 9 MCE banks CPU0: Thermal monitoring handled by SMI using mwait in idle threads. [...] CPU0: Intel(R) Core(TM) i3 CPU M 330 @ 2.13GHz stepping 02 Booting Node 0, Processors #1 CPU1: Thermal monitoring handled by SMI #2 CPU2: Thermal monitoring handled by SMI #3 Ok. CPU3: Thermal monitoring handled by SMI Brought up 4 CPUs And ACPI is of no help, it shows two thermal zones, one always showing 27 C, the other always showing 0 C (and even though that is below the reported thresholds of 55 C and 71 C for the fans, the fan still speeds up and down on its own, showing that it is just the reporting that is broken). The fan does slow down to almost nothing (or even off) when idle and spins up to a strong hot breeze when compiling the kernel (make -j8), so the thermal monitoring on the BIOS seems to be working fine. I would expect broken hardware to run the fan on high speed all the time (not cooling enough) or low speed all the time (thermal sensor broken). The coretemp module (which is not autoloaded) seems to be more helpful; it shows between 43000 and 45000 for temp1_input on both cores when idle, going up to 50000-56000 when lightly loaded (I have not looked at it yet while doing a heavy compile). Is there a way to know if all this is just an oddness of this model, or if there is something which is not working quite right? (All the output above is from 2.6.35.3; I am not running 2.3.36-rc2+ right now because it hangs on resume, and I have not yet had the time to look at it.) > >> @@ -607,7 +607,7 @@ static bool mcp_exceeded(struct ips_driver *ips) >> - dev_info(&ips->dev->dev, >> + dev_dbg(&ips->dev->dev, >> "MCP power or thermal limit exceeded\n"); > > -- Cesar Eduardo Barros cesarb@cesarb.net cesar.barros@gmail.com -- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/