Received: by 10.223.185.116 with SMTP id b49csp6069818wrg; Thu, 8 Mar 2018 01:02:20 -0800 (PST) X-Google-Smtp-Source: AG47ELtdlzqrSzdAc1itDPtjswLuQ/iHe3iiSntx6G2S1mgUqZWDG2QnppLQscRiaxhqpQrSxpZt X-Received: by 2002:a17:902:74c3:: with SMTP id f3-v6mr23458876plt.444.1520499739926; Thu, 08 Mar 2018 01:02:19 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1520499739; cv=none; d=google.com; s=arc-20160816; b=BvFqz5Opr4EM1Jrqrm6us3SI2+bE+/GNmD91wKKcuJJx2DPVeIue5ys7eAvgKHKGtp 2xD5nhV5Iw3SvcRTy4emy79NfcMmpG4nTKd5jrTtycjQrZJwhR5Er4P/Th8sw2/NjUxH uE2PyCLMKB21Vs4EE0h9cxnOHp7aQBaaYQiZOLdKVpBaKWbo0MRXo9xPoyYM+Jlfjieg q4UFktPzFsqp1TqzVWuVRjZrBNPCY8aHmjK/z09i1nJ1iBXnSs1DVjJsxjKKxme5hdv7 Iz+6tJUek4r8FHNsBpDg08qRSkXnALTcw7XpaW/7CrXOkRd2K0VwmJ1oxvbdJuC2K3aa ZLBg== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:sender:user-agent:in-reply-to :content-disposition:mime-version:references:message-id:subject:cc :to:from:date:arc-authentication-results; bh=UzO6WmA+fy99Zq2/NvkJBnfdSy9o+RAiOEHm14Obtgo=; b=GVnt6KuticY+zFKtHKEnCmXpU+OB82mk9MzQ/IlGnY4itvbAaxpTXF/tes/4RK5HsN 78NGk0egD0mdU8oGHWenwbYs6EA6J1EStLvJVLvprpGusEfNu6DRPvaKIBQ7QrwrbC4y Jz+KiycxJZO0+QjbvFFcoQsaUjhSHK1zGjkQVrDOtrKW8XC3sZifoqcZwfXcU4uBaFTB XLVxDUttiiz1pMxUyUlatun3tjLwX2bS1PooygApQVScjy5ngNisYNI08hPJ7tD4Bu4M 8ZKtobAGmjmmGGxcwLVOvjowGIIimUhBkq3/7DoV1vxMP/dNCXMpwLrIFQZCOSPhFX/b cLVw== ARC-Authentication-Results: i=1; mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Return-Path: Received: from vger.kernel.org (vger.kernel.org. [209.132.180.67]) by mx.google.com with ESMTP id 14-v6si14382052plb.444.2018.03.08.01.02.03; Thu, 08 Mar 2018 01:02:19 -0800 (PST) Received-SPF: pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) client-ip=209.132.180.67; Authentication-Results: mx.google.com; spf=pass (google.com: best guess record for domain of linux-kernel-owner@vger.kernel.org designates 209.132.180.67 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S1755391AbeCHI7x (ORCPT + 99 others); Thu, 8 Mar 2018 03:59:53 -0500 Received: from atrey.karlin.mff.cuni.cz ([195.113.26.193]:46380 "EHLO atrey.karlin.mff.cuni.cz" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1751489AbeCHI7v (ORCPT ); Thu, 8 Mar 2018 03:59:51 -0500 Received: by atrey.karlin.mff.cuni.cz (Postfix, from userid 512) id C5FBB802AE; Thu, 8 Mar 2018 09:59:49 +0100 (CET) Date: Thu, 8 Mar 2018 09:59:49 +0100 From: Pavel Machek To: Daniel Lezcano Cc: edubezval@gmail.com, kevin.wangtao@linaro.org, leo.yan@linaro.org, vincent.guittot@linaro.org, amit.kachhap@gmail.com, linux-kernel@vger.kernel.org, javi.merino@kernel.org, rui.zhang@intel.com, daniel.thompson@linaro.org, linux-pm@vger.kernel.org, Jonathan Corbet , "open list:DOCUMENTATION" Subject: Re: [PATCH V2 5/7] thermal/drivers/cpu_cooling: Add idle cooling device documentation Message-ID: <20180308085949.GB17761@amd> References: <1519226968-19821-1-git-send-email-daniel.lezcano@linaro.org> <1519226968-19821-6-git-send-email-daniel.lezcano@linaro.org> <20180306231906.GB28911@amd> <84fa8a3c-28bf-41ae-8ed7-9dd348b1cde9@linaro.org> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="p4qYPpj5QlsIQJ0K" Content-Disposition: inline In-Reply-To: <84fa8a3c-28bf-41ae-8ed7-9dd348b1cde9@linaro.org> User-Agent: Mutt/1.5.23 (2014-03-12) Sender: linux-kernel-owner@vger.kernel.org Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org --p4qYPpj5QlsIQJ0K Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: quoted-printable Hi! > >> +Under certain circumstances, the SoC reaches a temperature exceeding > >> +the allocated power budget or the maximum temperature limit. The > >=20 > > I don't understand. Power budget is in W, temperature is in > > kelvin. Temperature can't exceed power budget AFAICT. >=20 > Yes, it is badly worded. Is the following better ? >=20 > " > Under certain circumstances a SoC can reach the maximum temperature > limit or is unable to stabilize the temperature around a temperature > control. >=20 > When the SoC has to stabilize the temperature, the kernel can act on a > cooling device to mitigate the dissipated power. >=20 > When the maximum temperature is reached and to prevent a catastrophic > situation a radical decision must be taken to reduce the temperature > under the critical threshold, that impacts the performance. >=20 > " Actually... if hardware is expected to protect itself, I'd tone it down. No need to be all catastrophic and critical... But yes, better. > > Critical here, critical there. I have trouble following > > it. Theoretically hardware should protect itself, because you don't > > want kernel bug to damage your CPU? >=20 > There are several levels of protection. The first level is mitigating > the temperature from the kernel, then in the temperature sensor a reset > line will trigger the reboot of the CPUs. Usually it is a register where > you write the maximum temperature, from the driver itself. I never tried > to write 1000=B0C in this register and see if I can burn the board. >=20 > I know some boards have another level of thermal protection in the > hardware itself and some other don't. >=20 > In any case, from a kernel point of view, it is a critical situation as > we are about to hard reboot the system and in this case it is preferable > to drop drastically the performance but give the opportunity to the > system to run in a degraded mode. Agreed you want to keep going. In ACPI world, we shutdown when critical trip point is reached, so this is somehow confusing. > >> +Solutions: > >> +---------- > >> + > >> +If we can remove the static and the dynamic leakage for a specific > >> +duration in a controlled period, the SoC temperature will > >> +decrease. Acting at the idle state duration or the idle cycle > >=20 > > "should" decrease? If you are in bad environment.. >=20 > No, it will decrease in any case because of the static leakage drop. The > bad environment will impact the speed of this decrease. I meant... if ambient temperature is 105C, there's not much you can do to cool system down :-). > >> +Idle Injection: > >> +--------------- > >> + > >> +The base concept of the idle injection is to force the CPU to go to an > >> +idle state for a specified time each control cycle, it provides > >> +another way to control CPU power and heat in addition to > >> +cpufreq. Ideally, if all CPUs of a cluster inject idle synchronously, > >> +this cluster can get into the deepest idle state and achieve minimum > >> +power consumption, but that will also increase system response latency > >> +if we inject less than cpuidle latency. > >=20 > > I don't understand last sentence. >=20 > Is it better ? >=20 > "Ideally, if all CPUs, belonging to the same cluster, inject their idle > cycle synchronously, the cluster can reach its power down state with a > minimum power consumption and static leakage drop. However, these idle > cycles injection will add extra latencies as the CPUs will have to > wakeup from a deep sleep state." Extra comma "CPUs , belonging". But yes, better. > Thanks! You are welcome. Best regards, Pavel --=20 (english) http://www.livejournal.com/~pavelmachek (cesky, pictures) http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blo= g.html --p4qYPpj5QlsIQJ0K Content-Type: application/pgp-signature; name="signature.asc" Content-Description: Digital signature -----BEGIN PGP SIGNATURE----- Version: GnuPG v1 iEYEARECAAYFAlqg+4UACgkQMOfwapXb+vKU0wCfbjtWxUW28ovYCibyPFj58riC uxAAoJ2b+u3mbj2t8Ot+rzWiYQsLcYYQ =u5n0 -----END PGP SIGNATURE----- --p4qYPpj5QlsIQJ0K--