PM regression with commit 5de85b9d57ab PM runtime re-init in v4.5-rc1
From: stern@rowland.harvard.edu (Alan Stern)
Date: 2016-02-02 16:23:48
Also in:
linux-omap, linux-pm
On Tue, 2 Feb 2016, Ulf Hansson wrote:
That's not what we want. So I moved the checks to the actual
->runtime_resume() callback in the PM domain, instead of in the
omap_device_enable() function. A version 2 is attached. Please give it
at try.
[...]
From: Ulf Hansson <redacted>
Date: Tue, 2 Feb 2016 10:05:39 +0100
Subject: [PATCH V2] ARM: OMAP2+: omap-device: Allow devices to be pre-enabled
Commit 5de85b9d57ab ("PM / runtime: Re-init runtime PM states at probe
error and driver unbind") may re-initialize the runtime PM status of the
device to RPM_SUSPENDED at driver probe failures.
For the omap_hsmmc and likely also other omap drivers, which needs more
than one attempt to ->probe() (returning -EPROBE_DEFER), this commit
causes a regression at the PM domain level (omap hwmod).
The reason is that the drivers don't put back the device into low power
state while bailing out in ->probe to return -EPROBE_DEFER. This leads to
that pm_runtime_reinit() in driver core, is re-initializing the runtime PM
status from RPM_ACTIVE to RPM_SUSPENDED.But isn't that okay? When the system starts up, the PM core doesn't know the actual state of any device. It just marks each device as RPM_SUSPENDED and disabled as it is registered. It's the job of subsystems and drivers to make sure the stored state agrees with the actual physical state. So why shouldn't pm_runtime_reinit() leave the structure the same as it would be if the device had just been registered?
The next ->probe() attempt then triggers the ->runtime_resume() callback to be invoked, which means this happens two times in a row. At the PM domain level (omap hwmod) this is being treated as an error and thus the runtime PM status of the device isn't correctly synchronized with the runtime PM core.
Why doesn't the same problem arise the first time the device is probed? Or to put it another, why should a re-probe be any different from the initial probe?
In the end, ->probe() anyway succeeds (as the driver don't checks the error code from the runtime PM APIs), but results in that the PM domain always stays powered on. This because of the runtime PM core believes the device is RPM_SUSPENDED. Fix this regression by allowing devices to be pre-enabled when the PM domain's (omap hwmod) ->runtime_resume() callback is requested to enable the device. In such cases, return zero to synchronize the runtime PM status with the runtime PM core.
Is this really the right way to fix the problem? Alan Stern