Re: sched-freq locking

From: Juri Lelli <hidden>
Date: 2016-01-20 12:17:37
Also in: lkml

Possibly related (same subject, not in this thread)

2016-01-20 · Re: sched-freq locking · Punit Agrawal <hidden>
2016-01-20 · Re: sched-freq locking · Punit Agrawal <hidden>

[+Punit, Javi]

Hi,

On 19/01/16 17:24, Steve Muckle wrote:

On 01/19/2016 03:40 PM, Michael Turquette wrote:

quoted

Right, this was _the_ original impetus behind the design decision to
muck around with struct cpufreq_policy in the hot path which goes al
the way back to v1.

An alternative thought is that we can make copies of the relevant bits
of struct cpufreq_policy that we do not expect too change often. These
will not require any locks as they are mostly read-only data on the
scheduler side of the interface. Or we could even go all in and just
make local copies of the struct directly, during the GOV_START
perhaps, with:

I believe this is a good first step as it avoids reworking a huge amount
of locking and can get us to something functionally correct. It is what
I had proposed earlier, copying the enabled CPUs and freq table in
during the governor start callback. Unless there are objections to it
I'll add it to the next schedfreq RFC.

I fear that caching could break thermal. If everybody was already using
sched-freq interface to control frequency this won't probably be a
problem, but we are not yet there :(. So, IIUC by caching policy->max,
for example, we might affect what thermal expects from cpufreq.

quoted

...

quoted

Well if we're going to try an optimize out every single false-positive
wakeup then I think that the cleanest long term solution would be
rework the per-policy locking around struct cpufreq_policy to use a
raw spinlock.

It would be nice if the policy lock was a spinlock but I don't know how
easy that is. From a quick look at cpufreq there's a blocking notifier
chain that's called with rwsem held, so it looks messy. Potentially long
term indeed.

Right. Blocking notifiers are one problem, as I was saying to Peter
yesterday.

quoted

Also it'd be good I think to avoid building in an assumption that we'll
never want to run solely in the fast (atomic) path. Perhaps ARM won't,
and x86 may never use this, but it's reasonable to think another
platform might come along which uses cpufreq and has the capability to
kick off cpufreq transitions swiftly and without sleeping. Maybe ARM
platforms will evolve to have that capability.

The current design of the cpufreq subsystem and its interfaces have
made this choice for us. sched-freq is just another consumer of
cpufreq, and until cpufreq's own locking scheme is improved then we
have no choice.

I did not word that very well - I should have said, we should avoid
building in an assumption that we never want to try and run in the fast
path.

AFAICS, once we've calculated that a frequency change is required we can
down_write_trylock(&policy->rwsem) in the fast path and go ahead with
the transition, if the trylock succeeds and the driver supports fast
path transitions. We can fall back to the slow path (waking up the
kthread) if that fails.

quoted

This discussion is pretty useful. Should we Cc lkml to this thread?

Done (added linux-pm, PeterZ and Rafael as well).

This discussion is pretty interesting, yes. I'm a bit afraid people
bumped into this might have troubles understanding context, though.
And I'm not sure how to give them that context; maybe start a new thread
summarizing what has been discussed so far?

Best,

- Juri

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help