Received: by 10.223.185.116 with SMTP id b49csp6218415wrg; Thu, 8 Mar 2018 03:56:08 -0800 (PST) X-Google-Smtp-Source: AG47ELu8s16NjqTLvCrSKRc0UlC7a+8L9vmT8WymeQz164c7iyr5wc+G8l8dfrfYW3OAt53gVDl3 X-Received: by 2002:a17:902:6ac2:: with SMTP id i2-v6mr23387219plt.368.1520510168711; Thu, 08 Mar 2018 03:56:08 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1520510168; cv=none; d=google.com; s=arc-20160816; b=cXwzSpkoy/zajyWueNBr186WZsOk0LmPP5z/FykPtZZL5l6Yb8wJl1kbO3FqxhYAnR 2EG9Il0GYMWDchCE0irazqDzXdVVI7i0DbaQL15vik9+wkeogqxZjR3TWfPpQ+1BUl1R fSCvQrYVNb98M8doOAtD7zig/cyEIZgCv5KiY8hJuu6/AXEB5UTVpxxgCIsF7BEWbLoX oYQl1uLKtx6wfwwzwgLTvX8MUAnR7KqqJpZH5flZggHScdjOddVV/pdC/TFRH1YVayzZ nuOQSmu8huJIvSxCgLnzTK7AUYO0PN1EI5klyPX003E5vWeicMqnWWZJk2y8sWiQtEjT vLXg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-transfer-encoding:content-disposition:mime-version :references:message-id:subject:cc:to:from:date:dkim-signature :arc-authentication-results; bh=Ew4bAajO1wfyf96akdHWbi/AxlCDS6xhYRp+mCqgDyw=; b=XNxxBlLjDVXHA4BbgwFb4IT4ZsKcyggyGzPfp1vseMUW8jIyqvsmnSuzRNGaiFbFJE SJ4XtW28uCs0qv7k880xv5Lt2Phmf+k+UlQY5NvYUAWV6DgXrOUzeil1KXDrEDszeQOo DiCzbuApH6MSquSqPSxUGhSpm7sulZMD2aujXjM8GKN8+MevTCkhOON8pXH2MayRKYUl 7EwyiaSbtnAY+ASPcg73OBDOiv4vqcyqml6XIZmMKFRY6QfGP875sTw68Rd+hu6QHHRC oeXD4/7nabc58i+HMD2cW/ZPjnCWoVdAnktBvl29nFCunyxC49jZRALvHUHlWaahyqUF oLgg== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=RxuMv5az; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 34-v6si14648510plc.368.2018.03.08.03.55.54; Thu, 08 Mar 2018 03:56:08 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=RxuMv5az; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755772AbeCHLyt (ORCPT + 99 others); Thu, 8 Mar 2018 06:54:49 -0500 Received: from mail-wr0-f196.google.com ([209.85.128.196]:36219 "EHLO mail-wr0-f196.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1755736AbeCHLyq (ORCPT ); Thu, 8 Mar 2018 06:54:46 -0500 Received: by mail-wr0-f196.google.com with SMTP id v111so5397940wrb.3 for ; Thu, 08 Mar 2018 03:54:46 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; h=date:from:to:cc:subject:message-id:references:mime-version :content-disposition:content-transfer-encoding:in-reply-to :user-agent; bh=Ew4bAajO1wfyf96akdHWbi/AxlCDS6xhYRp+mCqgDyw=; b=RxuMv5az04hKfg9d0wtZVl5x74AVqyT9vefkigr879/1uW+426pVGgZl/blu9Bypm8 vbCCm21CwIqPiI/WI+MgWI5yqP4CrlRrUgH63yz0+8S4/iAjSOagBdS42LU0e3Nyej8A yW4eoWDT9Vb2W1IfbREoipab6Km8rtoKWWGxw= X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:date:from:to:cc:subject:message-id:references :mime-version:content-disposition:content-transfer-encoding :in-reply-to:user-agent; bh=Ew4bAajO1wfyf96akdHWbi/AxlCDS6xhYRp+mCqgDyw=; b=X9JugCKv3T99j2oqcT5CWivHRsEmaMFpEPckEReeLAQrN77UXe7Imv8QZ8UCOBrkBW 7J9AVBlIquK3ZLnIJxGhZSv5ZSXpkkS1gyKrVkDjbF8J8DsnIgt7IWzdsWLeXEr/9iDu jGairv/eLGCp+g6qQ6d8rCq6Hej17dPs6SZUiNmEGqMAyTYn73aL2H4jhI3i25MExBAQ VpIRr3u+g/uPSTyWel4ikGM/qX26lRpswW6jjx0koVQhFB81/6wUhG7sEFzjDmMgmqCA NogSOLVF7lA6+19jnE/4CTVsK9f4lAE4wajqtc3cXC7iFwrD9MKhqBZEUZ+dGjbzAkG6 TaEw== X-Gm-Message-State: APf1xPDw15CbxaPwV7Zneru0BaNGvnmfJ+14hkS6jZIpPoQ3IwYEWW1G GqfPTicTsAOr3KIfBbAy4KvLXw== X-Received: by 10.223.201.15 with SMTP id m15mr20810682wrh.54.1520510085228; Thu, 08 Mar 2018 03:54:45 -0800 (PST) Received: from oak.lan (cpc141214-aztw34-2-0-cust773.18-1.cable.virginm.net. [86.9.19.6]) by smtp.gmail.com with ESMTPSA id c14sm17005815wmi.16.2018.03.08.03.54.43 (version=TLS1_2 cipher=ECDHE-RSA-CHACHA20-POLY1305 bits=256/256); Thu, 08 Mar 2018 03:54:44 -0800 (PST) Date: Thu, 8 Mar 2018 11:54:41 +0000 From: Daniel Thompson To: Pavel Machek Cc: Daniel Lezcano , edubezval@gmail.com, kevin.wangtao@linaro.org, leo.yan@linaro.org, vincent.guittot@linaro.org, amit.kachhap@gmail.com, linux-kernel@vger.kernel.org, javi.merino@kernel.org, rui.zhang@intel.com, linux-pm@vger.kernel.org, Jonathan Corbet , "open list:DOCUMENTATION" Subject: Re: [PATCH V2 5/7] thermal/drivers/cpu_cooling: Add idle cooling device documentation Message-ID: <20180308115441.edb4jtpodj2qut2n@oak.lan> References: <1519226968-19821-1-git-send-email-daniel.lezcano@linaro.org> <1519226968-19821-6-git-send-email-daniel.lezcano@linaro.org> <20180306231906.GB28911@amd> <84fa8a3c-28bf-41ae-8ed7-9dd348b1cde9@linaro.org> <20180308085949.GB17761@amd> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <20180308085949.GB17761@amd> User-Agent: NeoMutt/20170113 (1.7.2) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org On Thu, Mar 08, 2018 at 09:59:49AM +0100, Pavel Machek wrote: > Hi! > > > >> +Under certain circumstances, the SoC reaches a temperature exceeding > > >> +the allocated power budget or the maximum temperature limit. The > > > > > > I don't understand. Power budget is in W, temperature is in > > > kelvin. Temperature can't exceed power budget AFAICT. > > > > Yes, it is badly worded. Is the following better ? > > > > " > > Under certain circumstances a SoC can reach the maximum temperature > > limit or is unable to stabilize the temperature around a temperature > > control. > > > > When the SoC has to stabilize the temperature, the kernel can act on a > > cooling device to mitigate the dissipated power. > > > > When the maximum temperature is reached and to prevent a catastrophic > > situation a radical decision must be taken to reduce the temperature > > under the critical threshold, that impacts the performance. > > > > " > > Actually... if hardware is expected to protect itself, I'd tone it > down. No need to be all catastrophic and critical... But yes, better. Makes sense. For a thermally overcommitted but passively cooled device work close to max operating temperature it is not a critical situation requiring a radical reaction, it is normal operation. Put another way, it would severely bogus to attach KERN_CRITICAL messages to reaching the cooling threshold. Daniel. > > > Critical here, critical there. I have trouble following > > > it. Theoretically hardware should protect itself, because you don't > > > want kernel bug to damage your CPU? > > > > There are several levels of protection. The first level is mitigating > > the temperature from the kernel, then in the temperature sensor a reset > > line will trigger the reboot of the CPUs. Usually it is a register where > > you write the maximum temperature, from the driver itself. I never tried > > to write 1000?C in this register and see if I can burn the board. > > > > I know some boards have another level of thermal protection in the > > hardware itself and some other don't. > > > > In any case, from a kernel point of view, it is a critical situation as > > we are about to hard reboot the system and in this case it is preferable > > to drop drastically the performance but give the opportunity to the > > system to run in a degraded mode. > > Agreed you want to keep going. In ACPI world, we shutdown when > critical trip point is reached, so this is somehow confusing. > > > >> +Solutions: > > >> +---------- > > >> + > > >> +If we can remove the static and the dynamic leakage for a specific > > >> +duration in a controlled period, the SoC temperature will > > >> +decrease. Acting at the idle state duration or the idle cycle > > > > > > "should" decrease? If you are in bad environment.. > > > > No, it will decrease in any case because of the static leakage drop. The > > bad environment will impact the speed of this decrease. > > I meant... if ambient temperature is 105C, there's not much you can do > to cool system down :-). > > > >> +Idle Injection: > > >> +--------------- > > >> + > > >> +The base concept of the idle injection is to force the CPU to go to an > > >> +idle state for a specified time each control cycle, it provides > > >> +another way to control CPU power and heat in addition to > > >> +cpufreq. Ideally, if all CPUs of a cluster inject idle synchronously, > > >> +this cluster can get into the deepest idle state and achieve minimum > > >> +power consumption, but that will also increase system response latency > > >> +if we inject less than cpuidle latency. > > > > > > I don't understand last sentence. > > > > Is it better ? > > > > "Ideally, if all CPUs, belonging to the same cluster, inject their idle > > cycle synchronously, the cluster can reach its power down state with a > > minimum power consumption and static leakage drop. However, these idle > > cycles injection will add extra latencies as the CPUs will have to > > wakeup from a deep sleep state." > > Extra comma "CPUs , belonging". But yes, better. > > > Thanks! > > You are welcome. Best regards, > Pavel > -- > (english) http://www.livejournal.com/~pavelmachek > (cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html