Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.0 (2014-02-07) on aws-us-west-2-korg-lkml-1.web.codeaurora.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by smtp.lore.kernel.org (Postfix) with ESMTP id 5BA00C74A4B for ; Mon, 13 Mar 2023 09:38:05 +0000 (UTC) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S229956AbjCMJiE (ORCPT ); Mon, 13 Mar 2023 05:38:04 -0400 Received: from lindbergh.monkeyblade.net ([23.128.96.19]:50886 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S231179AbjCMJhs (ORCPT ); Mon, 13 Mar 2023 05:37:48 -0400 Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 78F892B600; Mon, 13 Mar 2023 02:36:09 -0700 (PDT) Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 47A8A2F4; Mon, 13 Mar 2023 02:36:15 -0700 (PDT) Received: from [10.57.18.52] (unknown [10.57.18.52]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id E32313F71A; Mon, 13 Mar 2023 02:35:29 -0700 (PDT) Message-ID: <8727651b-88ec-efe7-eed2-1ff08faf22b8@arm.com> Date: Mon, 13 Mar 2023 09:35:27 +0000 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:102.0) Gecko/20100101 Thunderbird/102.7.1 Subject: Re: [PATCH] thermal/core/power_allocator: avoid cdev->state can not be reset Content-Language: en-US To: Xuewen Yan Cc: Di Shen , daniel.lezcano@linaro.org, rafael@kernel.org, amitk@kernel.org, rui.zhang@intel.com, linux-pm@vger.kernel.org, linux-kernel@vger.kernel.org, xuewen.yan@unisoc.com, Qais Yousef References: <20230309135515.1232-1-di.shen@unisoc.com> From: Lukasz Luba In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Hi Xuewen, On 3/13/23 01:40, Xuewen Yan wrote: > Hi Lukasz > > On Fri, Mar 10, 2023 at 11:56 PM Lukasz Luba wrote: >> >> Hi Di, >> >> On 3/9/23 13:55, Di Shen wrote: >>> Commit 0952177f2a1f (thermal/core/power_allocator: Update once cooling devices when temp is low) >>> add a update flag to update cooling device only once when temp is low. >>> But when the switch_on_temp is set to be a higher value, the cooling device state >>> may not be reset to max, because the last_temp is smaller than the switch_on_temp. >>> >>> For example: >>> First: >>> swicth_on_temp=70 control_temp=85; >>> >>> Then userspace change the trip_temp: >>> swicth_on_temp=45 control_temp=55 cur_temp=54 >>> >>> Then userspace reset the trip_temp: >>> swicth_on_temp=70 control_temp=85 cur_temp=57 last_temp=54 >>> >>> At this time, the cooling device state should be reset to be max. >>> However, because cur_temp(57) < switch_on_temp(70) >>> last_temp(54) < swicth_on_temp(70) --> update = false >>> When update is false, the cooling device state can not be reset. >> >> That's a tricky use case. How is that now possible, > > We use the trip_temp in the Android System. Often, we set different > control temperatures in different scenarios, > and when we change the switch_on_temp from small to bigger, we find > the power can not be reset to be max. I see, thanks for letting me know that this is Android. > > >>> >>> So delete the update condition, so that the cooling device state >>> could be reset. >> >> IMO this is not the desired solution. Daniel reported the issue that >> IPA triggers the event sent to user-space even when there is no need. >> That's the motivation for the 0952177f2a1f change. >> >> To address your scenario properly, we need an interface which allows >> to respond properly for such situation when someone from user-space >> writes a new value to something fundamental as trip point. >> >> You also have a kernel config enabled: >> CONFIG_THERMAL_WRITABLE_TRIPS >> which IMO is only for debug kernels for system integrator (according >> to the Kconfig description). > > Yes, we use it to meet the temperature control needs of different scenarios. > And now in android with google's GKI2.0, the config must be opened. OK > >> >> When you disable this config in your deploy/product kernel >> than this issue would disappear. >> >>> >>> Fixes: 0952177f2a1f (thermal/core/power_allocator: Update once cooling devices when temp is low) >>> Signed-off-by: Di Shen >>> --- >>> drivers/thermal/gov_power_allocator.c | 9 +++------ >>> 1 file changed, 3 insertions(+), 6 deletions(-) >>> >> >> That's why IMO this is not the solution. > > Yes, but I think we should fix the bug, although the > CONFIG_THERMAL_WRITABLE_TRIPS is just for debugging. > How about record the last_trip_temp, and when the last_temp > > last_trip_temp, set the update tobe true? Yes, if that config is used in Android then we must fix it. That last_trip_temp makes sense (but maybe name it last_switch_on_temp). Please put that new field into the IPA local struct power_allocator_params. We should store the trip temp value there every time power_allocator_throttle() is called. That can be called due to a write from user-space w/ a new trip point value, so should be OK. Regards, Lukasz