Received: by 2002:a05:6a10:7420:0:0:0:0 with SMTP id hk32csp838022pxb; Thu, 17 Feb 2022 16:13:31 -0800 (PST) X-Google-Smtp-Source: ABdhPJxmYKMTQGHAV0A6gCiAR9R+MW1Xij3Bp8S9yQXaNXOqtUOKF+RLFhGHqud1eo3iONVZ8uom X-Received: by 2002:aa7:90d5:0:b0:4e1:307c:d94a with SMTP id k21-20020aa790d5000000b004e1307cd94amr5466085pfk.38.1645143211360; Thu, 17 Feb 2022 16:13:31 -0800 (PST) ARC-Seal: i=1; a=rsa-sha256; t=1645143211; cv=none; d=google.com; s=arc-20160816; b=Nrj4So3vt77Ga8VeQzxWmCc2DtSemS76qxZGFr47l6phPTvS1bdTKhxtianL+YGvjr wVXkP6RmzDN9BsotUrpLDIZpDwWTP51ukzRrpk0Pfjej5o1T1jamQM3qGrImnN17ts9Z WLqPNnE9hzw14mB8bqwkiSEskiKmOe2KKGEb5d7oHaxa/Q41YPSrvtQDva6OdOQ6ZWja DHMuQlfj5cEYMu2R56/Ee2MpPWGWcVdki8/Jqtlg+5KsJxXL2yLVPJgOXaiHwlDQC5Wc 1H9Z4G1tAtaZ2DEdAdN8ePMwcuH7VD/kzvRCFGtfBR1yiMoznQ0aFKoEsdYmLfJC9pIR onZQ== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=list-id:precedence:content-transfer-encoding:in-reply-to:from :references:cc:to:content-language:subject:user-agent:mime-version :date:message-id:dkim-signature; bh=m7i8XTKn573xDW3h00Hw3tHZvPjE6WSVbAod5YiJM20=; b=moQNBQA4D6Q2bdp+K4x0JgphO0f7C6hfW+jH/EynY5C/EoFvZzG2mTMw0JzjAdQ2l7 PWVKJWiPgizI9DVfE4/xnKJ7PkIUgWG+XChbBW4tOCncYnM+sw/LBkYvSEZIvaKIQim1 6cJlQxgIfKHATixGUk4bMGaZBPwqil+OUHAUNplOaXV0x+39frAN93a92x2ttZV5OQFP 7zFT5aCFtHwNyq0d5hZP6foFMfsOohj55JbRI39qmUFbW+NEczejb0JdnRmnpMMc8JR6 9OzjsdXvJL94hSRsMDcFN4wa1Y45pROOQrQcK1xQWGmfdA5JJLZC7fXu3oAw+te+NO3h HX+g== ARC-Authentication-Results: i=1; mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=tp3XN+Vk; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Return-Path: Received: from lindbergh.monkeyblade.net (lindbergh.monkeyblade.net. [2620:137:e000::1:18]) by mx.google.com with ESMTPS id be3si10145328pgb.421.2022.02.17.16.13.31 (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Thu, 17 Feb 2022 16:13:31 -0800 (PST) Received-SPF: pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) client-ip=2620:137:e000::1:18; Authentication-Results: mx.google.com; dkim=pass header.i=@linaro.org header.s=google header.b=tp3XN+Vk; spf=pass (google.com: domain of linux-kernel-owner@vger.kernel.org designates 2620:137:e000::1:18 as permitted sender) smtp.mailfrom=linux-kernel-owner@vger.kernel.org; dmarc=pass (p=NONE sp=NONE dis=NONE) header.from=linaro.org Received: from vger.kernel.org (vger.kernel.org [23.128.96.18]) by lindbergh.monkeyblade.net (Postfix) with ESMTP id 4F00334E496; Thu, 17 Feb 2022 15:38:34 -0800 (PST) Received: (majordomo@vger.kernel.org) by vger.kernel.org via listexpand id S241983AbiBQS3B (ORCPT + 99 others); Thu, 17 Feb 2022 13:29:01 -0500 Received: from mxb-00190b01.gslb.pphosted.com ([23.128.96.19]:43850 "EHLO lindbergh.monkeyblade.net" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S244307AbiBQS2v (ORCPT ); Thu, 17 Feb 2022 13:28:51 -0500 Received: from mail-wr1-x42d.google.com (mail-wr1-x42d.google.com [IPv6:2a00:1450:4864:20::42d]) by lindbergh.monkeyblade.net (Postfix) with ESMTPS id 64B6138BB for ; Thu, 17 Feb 2022 10:27:47 -0800 (PST) Received: by mail-wr1-x42d.google.com with SMTP id e3so10595084wra.0 for ; Thu, 17 Feb 2022 10:27:47 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=linaro.org; s=google; h=message-id:date:mime-version:user-agent:subject:content-language:to :cc:references:from:in-reply-to:content-transfer-encoding; bh=m7i8XTKn573xDW3h00Hw3tHZvPjE6WSVbAod5YiJM20=; b=tp3XN+Vk26cTI4ZXkHGYWn7fHIOWb5xAQkYnV4HJ9Tu7w0WtdV5xcv59x6DDKBSHpX y58Vxv2Jd2PBlnESSVYr6m5Xm4EBnD4psyOQr5vwN4RqpBWnSN+mTZYDJV79NrH/T2Yg oJnR1e64m3wW2UWeGPHirsOhwj87apeyOvRn4BNngxRzORwqTD/f3mo6XnN+DWtfr0KO 3kS5C/JKtoH5sLB/Uy9t6nFKA5XVwvR3CvWwgKQb7cmDO/HBzdE+fyPY3M/0HF0DpHrG zsN9jm1u8qiFNVbXY6/JHUEpjU3ll6JGZCV20+x0Qnu9sUx1A+tOZ1CcvZADrAHKHlcQ 5zfw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20210112; h=x-gm-message-state:message-id:date:mime-version:user-agent:subject :content-language:to:cc:references:from:in-reply-to :content-transfer-encoding; bh=m7i8XTKn573xDW3h00Hw3tHZvPjE6WSVbAod5YiJM20=; b=bZpt7be4HKYusC6WwzhbaMbKv9cDTLmSOLOAX9dNY3VN5q/yuWSokJv/BxExMWT8j7 nVqTZwJzUeFHy54RzzNSaRoqV8+CMljle2yATnBFYgQqJsHAVaxJPL4ih3fVKqGUVLuv a/osAiw9fTyN+Y4XjL3S9hjKTEi6SL09b7IPjBlyDqWnbX3G+hxjYR3YM9cwSjGvrHEW M0AIvTPF0ASTy75V3zGpV1IGwZc/JND3PcIAZS4dFf85TkUHJQm6YVXYzTk76L4GfZz8 br7yLL8gNDfmTULikTFNuNwjWojfln4DUL9OZjqjDGKsQHfyWQ2EtCB/rHOTX/uw8J2o sfJw== X-Gm-Message-State: AOAM530EzgB3m56aZAOZsbir8fTabkzF1iaDRIzgK4yj8IHJbC2xbHuw wyW8V37Tl9ketk4JXI9Ka+M6FQ== X-Received: by 2002:a5d:4907:0:b0:1e3:b4c:4a66 with SMTP id x7-20020a5d4907000000b001e30b4c4a66mr3272609wrq.273.1645122465788; Thu, 17 Feb 2022 10:27:45 -0800 (PST) Received: from ?IPV6:2a01:e34:ed2f:f020:5828:300b:5226:ef02? ([2a01:e34:ed2f:f020:5828:300b:5226:ef02]) by smtp.googlemail.com with ESMTPSA id c8sm2615013wmq.34.2022.02.17.10.27.43 (version=TLS1_3 cipher=TLS_AES_128_GCM_SHA256 bits=128/128); Thu, 17 Feb 2022 10:27:45 -0800 (PST) Message-ID: <3651bf48-eb0f-1835-3c5e-1dbdc2730fb4@linaro.org> Date: Thu, 17 Feb 2022 19:27:36 +0100 MIME-Version: 1.0 User-Agent: Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101 Thunderbird/91.5.0 Subject: Re: [PATCH 1/2] thermal: cooling: Check Energy Model type in cpufreq_cooling and devfreq_cooling Content-Language: en-US To: Doug Anderson , Lukasz Luba Cc: Matthias Kaehlcke , LKML , Linux PM , amit daniel kachhap , Viresh Kumar , "Rafael J. Wysocki" , Amit Kucheria , Zhang Rui , Dietmar Eggemann , Pierre.Gondois@arm.com, Stephen Boyd , Rajendra Nayak , Bjorn Andersson , jorcrous@amazon.com, Rob Clark , Manaf Meethalavalappu Pallikunhi References: <20220207073036.14901-1-lukasz.luba@arm.com> <20220207073036.14901-2-lukasz.luba@arm.com> <4a7d4e94-1461-5bac-5798-29998af9793a@arm.com> <7c059f4f-7439-0cad-c398-96dbde4e49c1@linaro.org> <5b8ca53e-3595-85fd-5ae9-a5e8285e8513@arm.com> From: Daniel Lezcano In-Reply-To: Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 8bit X-Spam-Status: No, score=-2.0 required=5.0 tests=BAYES_00,DKIM_SIGNED, DKIM_VALID,DKIM_VALID_AU,HEADER_FROM_DIFFERENT_DOMAINS, MAILING_LIST_MULTI,NICE_REPLY_A,RDNS_NONE,SPF_HELO_NONE, T_SCC_BODY_TEXT_LINE autolearn=no autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on lindbergh.monkeyblade.net Precedence: bulk List-ID: X-Mailing-List: linux-kernel@vger.kernel.org Adding Manaf (QCom) in the loop On 17/02/2022 17:37, Doug Anderson wrote: > Hi, > > On Thu, Feb 17, 2022 at 2:47 AM Lukasz Luba wrote: >> >> Hi Daniel, >> >> On 2/17/22 10:10 AM, Daniel Lezcano wrote: >>> On 16/02/2022 18:33, Doug Anderson wrote: >>>> Hi, >>>> >>>> On Wed, Feb 16, 2022 at 7:35 AM Lukasz Luba wrote: >>>>> >>>>> Hi Matthias, >>>>> >>>>> On 2/9/22 10:17 PM, Matthias Kaehlcke wrote: >>>>>> On Wed, Feb 09, 2022 at 11:16:36AM +0000, Lukasz Luba wrote: >>>>>>> >>>>>>> >>>>>>> On 2/8/22 5:25 PM, Matthias Kaehlcke wrote: >>>>>>>> On Tue, Feb 08, 2022 at 09:32:28AM +0000, Lukasz Luba wrote: >>>>>>>>> >>>>>>>>> >>>>> >>>>> [snip] >>>>> >>>>>>>>> Could you point me to those devices please? >>>>>>>> >>>>>>>> arch/arm64/boot/dts/qcom/sc7180-trogdor-* >>>>>>>> >>>>>>>> Though as per above they shouldn't be impacted by your change, >>>>>>>> since the >>>>>>>> CPUs always pretend to use milli-Watts. >>>>>>>> >>>>>>>> [skipped some questions/answers since sc7180 isn't actually >>>>>>>> impacted by >>>>>>>> the change] >>>>>>> >>>>>>> Thank you Matthias. I will investigate your setup to get better >>>>>>> understanding. >>>>>> >>>>>> Thanks! >>>>>> >>>>> >>>>> I've checked those DT files and related code. >>>>> As you already said, this patch is safe for them. >>>>> So we can apply it IMO. >>>>> >>>>> >>>>> -------------Off-topic------------------ >>>>> Not in $subject comments: >>>>> >>>>> AFAICS based on two files which define thermal zones: >>>>> sc7180-trogdor-homestar.dtsi >>>>> sc7180-trogdor-coachz.dtsi >>>>> >>>>> only the 'big' cores are used as cooling devices in the >>>>> 'skin_temp_thermal' - the CPU6 and CPU7. >>>>> >>>>> I assume you don't want to model at all the power usage >>>>> from the Little cluster (which is quite big: 6 CPUs), do you? >>>>> I can see that the Little CPUs have small dyn-power-coeff >>>>> ~30% of the big and lower max freq, but still might be worth >>>>> to add them to IPA. You might give them more 'weight', to >>>>> make sure they receive more power during power split. >>>>> >>>>> You also don't have GPU cooling device in that thermal zone. >>>>> Based on my experience if your GPU is a power hungry one, >>>>> e.g. 2-4Watts, you might get better results when you model >>>>> this 'hot' device (which impacts your temp sensor reported value). >>>> >>>> I think the two boards you point at (homestar and coachz) are just the >>>> two that override the default defined in the SoC dtsi file. If you >>>> look in sc7180.dtsi you'll see 'gpuss1-thermal' which has a cooling >>>> map. You can also see the cooling maps for the littles. >>>> >>>> I guess we don't have a `dynamic-power-coefficient` for the GPU, >>>> though? Seems like we should, but I haven't dug through all the code >>>> here... >>> >>> The dynamic-power-coefficient is available for OPPs which includes >>> CPUfreq and devfreq. As the GPU is managed by devfreq, setting the >>> dynamic-power-coefficient makes the energy model available for it. >>> >>> However, the OPPs must define the frequency and the voltage. That is the >>> case for most platforms except on QCom platform. >>> >>> That may not be specified as it uses a frequency index and the hardware >>> does the voltage change in our back. The QCom cpufreq backend get the >>> voltage table from a register (or whatever) and completes the voltage >>> values for the OPPs, thus adding the information which is missing in the >>> device tree. The energy model can then initializes itself and allows the >>> usage of the Energy Aware Scheduler. >>> >>> However this piece of code is missing for the GPU part. >>> >> >> Thank you for joining the discussion. I don't know about that Qcom >> GPU voltage information is missing. >> >> If the voltage is not available (only the frequencies), there is >> another way. There is an 'advanced' EM which uses registration function: >> em_dev_register_perf_domain(). It uses a local driver callback to get >> power for each found frequency. It has benefit because there is no >> restriction to 'fit' into the math formula, instead just avg power >> values can be feed into EM. It's called 'advanced' EM [1]. > > It seems like there _should_ be a way to get the voltage out for GPU > operating points, like is done with cpufreq in > qcom_cpufreq_hw_read_lut(), but it might need someone with Qualcomm > documentation to help with it. Maybe Rajendra would be able to help? > Adding Jordon and Rob to this conversation in case they're aware of > anything. > > As you said, we could just list a power for each frequency, though. > > I'm actually not sure which one would be more accurate across a range > of devices with different "corners": specifying a dynamic power > coefficient used for all "corners" and then using the actual voltage > and doing the math, or specifying a power number for each frequency > and ignoring the actual voltage used. In any case we're trying to get > ballpark numbers and not every device will be exactly the same, so > probably it doesn't matter that much. > > >> Now we hit (again) the DT & EM issue (it's an old one, IIRC Morten >> was proposing from ~2014 this upstream, but EAS wasn't merged back >> then): >> where to store these power-freq values, which are then used by the >> callback. We have the 'dynamic-power-coefficient' in DT, but >> it has limitations. It would be good to have this simple array >> attached to the GPU/CPU node. IMHO it meet the requirement of DT, >> it describes the HW (it would have HZ and Watts values). >> >> Doug, Matthias could you have a look at that function and its >> usage, please [1]? >> If you guys would support me in this, I would start, with an RFC >> proposal, a discussion on LKML. >> >> [1] >> https://elixir.bootlin.com/linux/v5.17-rc4/source/Documentation/power/energy-model.rst#L87 > > Matthias: I think you've spent more time on the thermal stuff than me > so I'll assume you'll follow-up here. If not then please yell! > > Ideally, though, someone from Qualcomm would jump in an own this. > Basically it allows more intelligently throttling the GPU and CPU > together in tandem instead of treating them separately IIUC, right? > > -Doug -- Linaro.org │ Open source software for ARM SoCs Follow Linaro: Facebook | Twitter | Blog