Received: by 2002:ab2:5c0e:0:b0:1ef:a325:1205 with SMTP id i14csp14115lqk; Wed, 13 Mar 2024 14:51:57 -0700 (PDT) X-Forwarded-Encrypted: i=3; AJvYcCW2IPd2p7aHDLsh/OfZl1YZhn3mjNHbiwimNZbO4wi2Qk3ECU4/osREFxavUCSnXhu/DxKv+IY8Bmv/2HTTCpioQfbbb81IGPC/zAkEqw== X-Google-Smtp-Source: AGHT+IElliBxy/abEwGZH1wKOQyD0VMcICGcfP/63TzZ8GOxJ7ibtSP+efKzTaM0365NZNY2IqiW X-Received: by 2002:a17:902:eb51:b0:1dc:ce29:aad6 with SMTP id i17-20020a170902eb5100b001dcce29aad6mr45299pli.39.1710366716796; Wed, 13 Mar 2024 14:51:56 -0700 (PDT) ARC-Seal: i=2; a=rsa-sha256; t=1710366716; cv=pass; d=google.com; s=arc-20160816; b=zOVJihYtJhly1WP1MJL/OaLJRHR4xQOA+ydSPg+CtaYhvXztWdevYrSi+2DReyKL8D ZJC/fQRpLA0WsawYdaO5fXsccgZdTQ/qFJwxsAACeeWLL8sLiLEGwOVwscHBfbB0VKI8 fJIn+R1sH6WO+LX5G7L1OUBEVqvRaUlIRyrgMqFINExVle4MVdOV34g+xuDFlc7n1EbP ixtQwF3U/whMkU620WbsraD7yS2PAQRMI8nl7UdGek/9EHuOMmshCfMEeehOa+BGYwoR SfsAREXhMMWu+dF/tlQ7YDskzJXeIU424Ki9vI/p1jEWwWvJwuJ2r2vHpAhDn9Yry7Cd O+sg== ARC-Message-Signature: i=2; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20160816; h=in-reply-to:content-disposition:mime-version:list-unsubscribe :list-subscribe:list-id:precedence:references:message-id:subject:cc :to:from:date; bh=pnQEzM3uQ8ajixq+KnDzicwE9TcMJXWrpEi7Z8lFi8o=; fh=mTAMFAoeA6iN7Qccm+HRWuhWTG/PIiS6xCsJ/Q4XAzs=; b=r5dlIRMr/+OJIjBhBlZr5ZsK7P4B0mWo6Ffear7Uxg0groB6aA3ZmsS61jx1LqP+7t 0QzICOO53mT6vQGOwTwriSdDXNos7eN/qYq/616RH6DVv3p/4OgXY+jqQ1WxhrOsiMyS 7O16TKUboMDZLuUOnTnUGsqc2cz+cvHwbhAqRqhbnoz9bNWeJK7vDUEcWQDHVUeYEOx7 GLcxnmYBqBKTkFZndI4UimHjwTkljd1YRWM9ryN0n+3dWXlO+cYfQdQGH22JU1zfZs0T V0L8joAusagIh1U0stPK9tggJ6Yj1DiuZQSZ1eIgm/GQUQsNGD4BBTgCtAK72IEYpzu8 zE6w==; dara=google.com ARC-Authentication-Results: i=2; mx.google.com; arc=pass (i=1 spf=pass spfdomain=arm.com dmarc=pass fromdomain=arm.com); spf=pass (google.com: domain of linux-kernel+bounces-102568-linux.lists.archive=gmail.com@vger.kernel.org designates 139.178.88.99 as permitted sender) smtp.mailfrom="linux-kernel+bounces-102568-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Return-Path: Received: from sv.mirrors.kernel.org (sv.mirrors.kernel.org. [139.178.88.99]) by mx.google.com with ESMTPS id l9-20020a170903244900b001dd8be49c5fsi161351pls.94.2024.03.13.14.51.56 for (version=TLS1_3 cipher=TLS_AES_256_GCM_SHA384 bits=256/256); Wed, 13 Mar 2024 14:51:56 -0700 (PDT) Received-SPF: pass (google.com: domain of linux-kernel+bounces-102568-linux.lists.archive=gmail.com@vger.kernel.org designates 139.178.88.99 as permitted sender) client-ip=139.178.88.99; Authentication-Results: mx.google.com; arc=pass (i=1 spf=pass spfdomain=arm.com dmarc=pass fromdomain=arm.com); spf=pass (google.com: domain of linux-kernel+bounces-102568-linux.lists.archive=gmail.com@vger.kernel.org designates 139.178.88.99 as permitted sender) smtp.mailfrom="linux-kernel+bounces-102568-linux.lists.archive=gmail.com@vger.kernel.org"; dmarc=fail (p=NONE sp=NONE dis=NONE) header.from=arm.com Received: from smtp.subspace.kernel.org (wormhole.subspace.kernel.org [52.25.139.140]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (No client certificate requested) by sv.mirrors.kernel.org (Postfix) with ESMTPS id AC363286753 for ; Wed, 13 Mar 2024 21:51:38 +0000 (UTC) Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 97FE35917D; Wed, 13 Mar 2024 21:48:33 +0000 (UTC) Received: from foss.arm.com (foss.arm.com [217.140.110.172]) by smtp.subspace.kernel.org (Postfix) with ESMTP id 9DF8F59149 for ; Wed, 13 Mar 2024 21:48:29 +0000 (UTC) Authentication-Results: smtp.subspace.kernel.org; arc=none smtp.client-ip=217.140.110.172 ARC-Seal:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1710366512; cv=none; b=GPUNy5RsRpMTM59llNKMxhiSKzbe6G3E1ECJj93Hn3WitDz09owLmxSsLtqV2YvtvWWv6+fzyxdaD6ttaogQvykMjdVOfi+l4HMs0LTWu3RnIVmNs88PYigvDhpYnHp/u/yNsBVOeAxyn48o9a/KJMyPxPsYTtagEoCJnoFIuHQ= ARC-Message-Signature:i=1; a=rsa-sha256; d=subspace.kernel.org; s=arc-20240116; t=1710366512; c=relaxed/simple; bh=yZg7KACacTfRKVN99nqgcSirQenTdaOsy/RJ0hCnIE8=; h=Date:From:To:Cc:Subject:Message-ID:References:MIME-Version: Content-Type:Content-Disposition:In-Reply-To; b=f189zkFI3bL2XKXTdd68k4fW7BVwCyLZr7c55MKb42snmi4vWpcXZjhWa1cFmsQewKOsebcrodQEljmdFWh40ptmfAY6DEk5lU70IjWI5kU4yU87CHsph/CoXBFBmASO0KaTHD5OOMlGb2RNISttpq1dfji5lSrOyQqGP56Ht4g= ARC-Authentication-Results:i=1; smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=arm.com; spf=pass smtp.mailfrom=arm.com; arc=none smtp.client-ip=217.140.110.172 Authentication-Results: smtp.subspace.kernel.org; dmarc=pass (p=none dis=none) header.from=arm.com Authentication-Results: smtp.subspace.kernel.org; spf=pass smtp.mailfrom=arm.com Received: from usa-sjc-imap-foss1.foss.arm.com (unknown [10.121.207.14]) by usa-sjc-mx-foss1.foss.arm.com (Postfix) with ESMTP id 7C7F11007; Wed, 13 Mar 2024 14:49:05 -0700 (PDT) Received: from arm.com (usa-sjc-imap-foss1.foss.arm.com [10.121.207.14]) by usa-sjc-imap-foss1.foss.arm.com (Postfix) with ESMTPSA id D053A3F64C; Wed, 13 Mar 2024 14:48:23 -0700 (PDT) Date: Wed, 13 Mar 2024 22:47:36 +0100 From: Beata Michalska To: Vanshidhar Konda Cc: linux-kernel@vger.kernel.org, linux-arm-kernel@lists.infradead.org, ionela.voinescu@arm.com, sudeep.holla@arm.com, will@kernel.org, catalin.marinas@arm.com, vincent.guittot@linaro.org, sumitg@nvidia.com, yang@os.amperecomputing.com, lihuisong@huawei.com Subject: Re: [PATCH v3 2/3] arm64: Provide an AMU-based version of arch_freq_get_on_cpu Message-ID: References: <20240312083431.3239989-1-beata.michalska@arm.com> <20240312083431.3239989-3-beata.michalska@arm.com> Precedence: bulk X-Mailing-List: linux-kernel@vger.kernel.org List-Id: List-Subscribe: List-Unsubscribe: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: On Tue, Mar 12, 2024 at 07:12:37PM -0700, Vanshidhar Konda wrote: > On Tue, Mar 12, 2024 at 08:34:30AM +0000, Beata Michalska wrote: > > With the Frequency Invariance Engine (FIE) being already wired up with > > sched tick and making use of relevant (core counter and constant > > counter) AMU counters, getting the current frequency for a given CPU > > on supported platforms can be achieved by utilizing the frequency scale > > factor which reflects an average CPU frequency for the last tick period > > length. > > > > The solution is partially based on APERF/MPERF implementation of > > arch_freq_get_on_cpu. > > > > Suggested-by: Ionela Voinescu > > Signed-off-by: Beata Michalska > > --- > > arch/arm64/kernel/topology.c | 103 +++++++++++++++++++++++++++++++---- > > 1 file changed, 92 insertions(+), 11 deletions(-) > > > > diff --git a/arch/arm64/kernel/topology.c b/arch/arm64/kernel/topology.c > > index 1a2c72f3e7f8..42cb19c31719 100644 > > --- a/arch/arm64/kernel/topology.c > > +++ b/arch/arm64/kernel/topology.c > > @@ -17,6 +17,8 @@ > > #include > > #include > > #include > > +#include > > +#include > > > > #include > > #include > > @@ -88,18 +90,31 @@ int __init parse_acpi_topology(void) > > * initialized. > > */ > > static DEFINE_PER_CPU_READ_MOSTLY(unsigned long, arch_max_freq_scale) = 1UL << (2 * SCHED_CAPACITY_SHIFT); > > -static DEFINE_PER_CPU(u64, arch_const_cycles_prev); > > -static DEFINE_PER_CPU(u64, arch_core_cycles_prev); > > static cpumask_var_t amu_fie_cpus; > > > > +struct amu_cntr_sample { > > + u64 arch_const_cycles_prev; > > + u64 arch_core_cycles_prev; > > + unsigned long last_update; > > + seqcount_t seq; > > +}; > > + > > +static DEFINE_PER_CPU_SHARED_ALIGNED(struct amu_cntr_sample, cpu_amu_samples) = { > > + .seq = SEQCNT_ZERO(cpu_amu_samples.seq) > > +}; > > + > > void update_freq_counters_refs(void) > > { > > - this_cpu_write(arch_core_cycles_prev, read_corecnt()); > > - this_cpu_write(arch_const_cycles_prev, read_constcnt()); > > + struct amu_cntr_sample *amu_sample = this_cpu_ptr(&cpu_amu_samples); > > + > > + amu_sample->arch_core_cycles_prev = read_corecnt(); > > + amu_sample->arch_const_cycles_prev = read_constcnt(); > > } > > > > static inline bool freq_counters_valid(int cpu) > > { > > + struct amu_cntr_sample *amu_sample = per_cpu_ptr(&cpu_amu_samples, cpu); > > + > > if ((cpu >= nr_cpu_ids) || !cpumask_test_cpu(cpu, cpu_present_mask)) > > return false; > > > > @@ -108,8 +123,8 @@ static inline bool freq_counters_valid(int cpu) > > return false; > > } > > > > - if (unlikely(!per_cpu(arch_const_cycles_prev, cpu) || > > - !per_cpu(arch_core_cycles_prev, cpu))) { > > + if (unlikely(!amu_sample->arch_const_cycles_prev || > > + !amu_sample->arch_core_cycles_prev)) { > > pr_debug("CPU%d: cycle counters are not enabled.\n", cpu); > > return false; > > } > > @@ -152,20 +167,27 @@ void freq_inv_set_max_ratio(int cpu, u64 max_rate) > > > > static void amu_scale_freq_tick(void) > > { > > + struct amu_cntr_sample *amu_sample = this_cpu_ptr(&cpu_amu_samples); > > u64 prev_core_cnt, prev_const_cnt; > > u64 core_cnt, const_cnt, scale; > > > > - prev_const_cnt = this_cpu_read(arch_const_cycles_prev); > > - prev_core_cnt = this_cpu_read(arch_core_cycles_prev); > > + prev_const_cnt = amu_sample->arch_const_cycles_prev; > > + prev_core_cnt = amu_sample->arch_core_cycles_prev; > > + > > + write_seqcount_begin(&amu_sample->seq); > > > > update_freq_counters_refs(); > > > > - const_cnt = this_cpu_read(arch_const_cycles_prev); > > - core_cnt = this_cpu_read(arch_core_cycles_prev); > > + const_cnt = amu_sample->arch_const_cycles_prev; > > + core_cnt = amu_sample->arch_core_cycles_prev; > > > > + /* > > + * This should not happen unless the AMUs have been reset and the > > + * counter values have not been resroted - unlikely > > /resroted/restored > > > + */ > > if (unlikely(core_cnt <= prev_core_cnt || > > const_cnt <= prev_const_cnt)) > > - return; > > + goto leave; > > > > /* > > * /\core arch_max_freq_scale > > @@ -182,6 +204,10 @@ static void amu_scale_freq_tick(void) > > > > scale = min_t(unsigned long, scale, SCHED_CAPACITY_SCALE); > > this_cpu_write(arch_freq_scale, (unsigned long)scale); > > + > > + amu_sample->last_update = jiffies; > > +leave: > > + write_seqcount_end(&amu_sample->seq); > > } > > > > static struct scale_freq_data amu_sfd = { > > @@ -189,6 +215,61 @@ static struct scale_freq_data amu_sfd = { > > .set_freq_scale = amu_scale_freq_tick, > > }; > > > > +#define AMU_SAMPLE_EXP_MS 20 > > + > > +unsigned int arch_freq_get_on_cpu(int cpu) > > +{ > > + struct amu_cntr_sample *amu_sample; > > + unsigned long last_update; > > + unsigned int seq; > > + unsigned int freq; > > + u64 scale; > > + > > + if (!cpumask_test_cpu(cpu, amu_fie_cpus) || !arch_scale_freq_ref(cpu)) > > + return 0; > > + > > +retry: > > + amu_sample = per_cpu_ptr(&cpu_amu_samples, cpu); > > + > > + do { > > + seq = raw_read_seqcount_begin(&amu_sample->seq); > > + last_update = amu_sample->last_update; > > + } while (read_seqcount_retry(&amu_sample->seq, seq)); > > + > > + /* > > + * For those CPUs that are in full dynticks mode, > > + * and those that have not seen tick for a while > > + * try an alternative source for the counters (and thus freq scale), > > + * if available for given policy > > + */ > > + if (time_is_before_jiffies(last_update + msecs_to_jiffies(AMU_SAMPLE_EXP_MS))) { > > + struct cpufreq_policy *policy = cpufreq_cpu_get(cpu); > > + int ref_cpu = nr_cpu_ids; > > + > > + if (cpumask_intersects(housekeeping_cpumask(HK_TYPE_TICK), > > + policy->cpus)) > > + ref_cpu = cpumask_nth_and(cpu, policy->cpus, > > + housekeeping_cpumask(HK_TYPE_TICK)); > > + > > Could you help me understand why getting the frequency from another > housekeeping cpu would be a better than returning 0? Wouldn't different > CPUs in the HK_TYPE_TICK domain be running at independent frequencies? > May be adding this explanation to the patch commit message would help > people who look at this in the future? If the last AMU sample taken lost its assumed validity, we try another cpu within the same frequency domain, choosing housekeeping cpu as one that might have seen the tick within the last, assumed, 20ms. We stick to the cpus withn the same policy, and thus same frequency domain which means those cpus do operate at the same frequency. Now, in case of per-core dvfs this will bail out with '0' as the policy->cpus will contain single CPU. Having said that, this code is bogus as it does not handle needed wrapping in case currently considered cpu is the last one in this policy cpus mask - will send an update soon. I will also try to make the comment above more readable. --- BR Beata > > Thanks, > Vanshi > > > + cpufreq_cpu_put(policy); > > + if (ref_cpu >= nr_cpu_ids || ref_cpu == cpu) > > + /* No alternative to pull info from */ > > + return 0; > > + cpu = ref_cpu; > > + goto retry; > > + } > > + /* > > + * Reversed computation to the one used to determine > > + * the arch_freq_scale value > > + * (see amu_scale_freq_tick for details) > > + */ > > + scale = arch_scale_freq_capacity(cpu); > > + freq = scale * arch_scale_freq_ref(cpu); > > + freq >>= SCHED_CAPACITY_SHIFT; > > + > > + return freq; > > +} > > + > > static void amu_fie_setup(const struct cpumask *cpus) > > { > > int cpu; > > -- > > 2.25.1 > >