Re: [PATCH v3 3/6] cpufreq: Add an interface to mark inefficient frequencies
From: "Rafael J. Wysocki" <rafael@kernel.org>
Date: 2021-07-06 12:35:01
On Tue, Jul 6, 2021 at 10:13 AM Vincent Donnefort [off-list ref] wrote:
[...]quoted
quoted
What about a cpufreq_policy option that if sets would make cpufreq_frequency_table_target() skip inefficient OPPs while staying within the limit of max policy?That would work too, ->quoted
Each governor could decide to set it or not, but it would hide the efficiency resolution to the governor and allow drivers that implements ->target() to also implements support for inefficient OPPs.-> but alternatively there could be an additional cpufreq driver flag to be set by the drivers implementing ->target() and wanting to deal with CPUFREQ_RELATION_EFFICIENT themselves (an opt-in of sorts). So the governors that want it may pass CPUFREQ_RELATION_EFFICIENT to __cpufreq_driver_target() and then it will be passed to ->target() depending on whether or not the new driver flag is set.Of course, I can implement this instead of a cpufreq_policy flag in v4. I suppose then right fallback for CPUFREQ_RELATION_EFFICIENT in case the driver doesn't opt-in is CPUFREQ_RELATION_L.quoted
quoted
That flag could be set according to a new cpufreq_governor flag CPUFREQ_GOV_SKIP_INEFFICIENCIES? That could though modify behaviors like powersave_bias from ondemand. But if a frequency is inefficient, there's probably no power saving anyway.AFAICS, the userspace governor aside, using inefficient frequencies only works with the powersave governor. In the other cases, RELATION_L (say) can be interpreted as "the closest efficient frequency equal to or above the target" with the max policy limit possibly causing inefficient frequencies to be used if they are closer to the limit than the next efficient one. As a rule, the governors don't assume that there are any inefficient frequencies in the table. In fact, they don't make any assumptions regarding the contents of the frequency table at all. They don't even assume that the driver uses a frequency table in the first place.So all the governors, beside powersave and userspace would replace their RELATION_L with RELATION_EFFICIENT. I'll add the changes in v4. So if I sum-up: new RELATION_EFFICIENT that resolves RELATION_L to an higher efficient frequency (if necessary) within the limits of policy->max.
Yes. It can be called RELATION_E for brevity.
CPUfreq drivers can opt-in by setting an appropriate flag. If they do not, RELATION_EFFICIENT will be rewritten in RELATION_L.
Yes, and cpufreq_frequency_table_target() will take RELATION_E into account if set.
All governors but userspace and powersave would use RELATION_EFFICIENT instead of RELATION_L.
Yes.
If that works for you, I'll implement this in a v4, as well as some improvements for the CPUfreq/EM registration following the discussion with Viresh.
Sounds good, thanks!