RE: [PATCH v4 1/6] Documentation: arm: define DT idle states bindings
From: Charles Garcia-Tobin <hidden>
Date: 2014-06-19 07:33:55
Also in:
linux-arm-kernel, linux-pm
-----Original Message----- From: Santosh Shilimkar [mailto:santosh.shilimkar@ti.com] Sent: 18 June 2014 20:27 To: Lorenzo Pieralisi; Nicolas Pitre Cc: linux-arm-kernel@lists.infradead.org; linux-pm@vger.kernel.org; devicetree@vger.kernel.org; Mark Rutland; Sudeep Holla; Catalin Marinas; Charles Garcia-Tobin; Rob Herring; grant.likely@linaro.org; Peter De Schrijver; Daniel Lezcano; Amit Kucheria; Vincent Guittot; Antti Miettinen; Stephen Boyd; Kevin Hilman; Sebastian Capella; Tomasz Figa; Mark Brown; Paul Walmsley; Chander Kashyap Subject: Re: [PATCH v4 1/6] Documentation: arm: define DT idle states bindings On Wednesday 18 June 2014 01:36 PM, Lorenzo Pieralisi wrote:quoted
On Fri, Jun 13, 2014 at 06:33:35PM +0100, Nicolas Pitre wrote:[..]quoted
Ok, a minor tweak to the diagram above, min-residency should include energy costs related to idle entry and exit, but not the exit-latency itself, as long as the energy costs implied by exiting the state are factored out in the min-residency-us property. Hence, to sum it up, I attached below the updated bindings patch: I think we are close to an agreement, if anyone disagrees pleaseshoutquoted
as soon as possible so that we can still integrate changes.[..]quoted
-- >8 -- Subject: [PATCH] Documentation: arm: define DT idle states bindings ARM based platforms implement a variety of power management schemesthatquoted
allow processors to enter idle states at run-time. The parameters defining these idle states vary on a per-platformbasis forcingquoted
the OS to hardcode the state parameters in platform specific statictablesquoted
whose size grows as the number of platforms supported in the kernelincreasesquoted
and hampers device drivers standardization. Therefore, this patch aims at standardizing idle state device treebindings forquoted
ARM platforms. Bindings define idle state parameters inclusive ofentry methodsquoted
and state latencies, to allow operating systems to retrieve theconfigurationquoted
entries from the device tree and initialize the related powermanagementquoted
drivers, paving the way for common code in the kernel to deal withidlequoted
states and removing the need for static data in current and previouskernelquoted
versions. Reviewed-by: Sebastian Capella <redacted> Signed-off-by: Lorenzo Pieralisi <redacted> ---Nice work Lorenzo !! I have few comments/questions.quoted
Documentation/devicetree/bindings/arm/cpus.txt | 8 + .../devicetree/bindings/arm/idle-states.txt | 561+++++++++++++++++++++quoted
2 files changed, 569 insertions(+) create mode 100644 Documentation/devicetree/bindings/arm/idle-states.txtquoted
diff --git a/Documentation/devicetree/bindings/arm/cpus.txtb/Documentation/devicetree/bindings/arm/cpus.txtquoted
index 1fe72a0..a44d4fd 100644--- a/Documentation/devicetree/bindings/arm/cpus.txt +++ b/Documentation/devicetree/bindings/arm/cpus.txt@@ -215,6 +215,12 @@ nodes to be present and contain the propertiesdescribed below.quoted
Value type: <phandle> Definition: Specifies the ACC[2] node associated with thisCPU.quoted
+ - cpu-idle-states + Usage: Optional + Value type: <prop-encoded-array> + Definition: + # List of phandles to idle state nodes supported + by this cpu [3]. Example 1 (dual-cluster big.LITTLE system 32-bit):@@ -411,3 +417,5 @@ cpus { -- [1] arm/msm/qcom,saw2.txt [2] arm/msm/qcom,kpss-acc.txt +[3] ARM Linux kernel documentation - idle states bindings + Documentation/devicetree/bindings/arm/idle-states.txtdiff --git a/Documentation/devicetree/bindings/arm/idle-states.txtb/Documentation/devicetree/bindings/arm/idle-states.txtquoted
new file mode 100644 index 0000000..c9e1ec6--- /dev/null +++ b/Documentation/devicetree/bindings/arm/idle-states.txt@@ -0,0 +1,561 @@ +========================================== +ARM idle states binding description +========================================== + +========================================== +1 - Introduction +========================================== + +ARM systems contain HW capable of managing power consumptiondynamically,quoted
+where cores can be put in different low-power states (ranging fromsimplequoted
+wfi to power gating) according to OSPM policies. The CPU statesrepresenting s/OSPM/OS PM ?quoted
+the range of dynamic idle states that a processor can enter at run-time, can bequoted
+specified through device tree bindings representing the parametersrequiredquoted
+to enter/exit specific idle states on a given processor. + +According to the Server Base System Architecture document (SBSA,[3]), thequoted
+power states an ARM CPU can be put into are identified by thefollowing list:quoted
+ +- Running +- Idle_standby +- Idle_retention +- Sleep +- Off + +The power states described in the SBSA document define the basic CPUstates onquoted
+top of which ARM platforms implement power management schemes thatallow an OSquoted
+PM implementation to put the processor in different idle states(which includequoted
+states listed above; "off" state is not an idle state since it doesnot havequoted
+wake-up capabilities, hence it is not considered in this document). + +Idle state parameters (eg entry latency) are platform specific andneed to bequoted
+characterized with bindings that provide the required information toOSPM Dittoquoted
+code so that it can build the required tables and use them atruntime.quoted
+ +The device tree binding definition for ARM idle states is thesubject of thisquoted
+document. + +=========================================== +2 - idle-states node +=========================================== + +ARM processor idle states are defined within the idle-states node,which isquoted
+a direct child of the cpus node [1] and provides a container wherethequoted
+processor idle states, defined as device tree nodes, are listed. + +- idle-states node + + Usage: Optional - On ARM systems, is a container of processor idles/is/it is ?quoted
+ states nodes. If the system does not provide CPU + power management capabilities or the processor
just
quoted
+ supports idle_standby an idle-states node is not + required. + + Description: idle-states node is a container node, where its + subnodes describe the CPU idle states. + + Node name must be "idle-states". + + The idle-states node's parent node must be the cpus node. + + The idle-states node's child nodes can be:s/idle-states/idle-statequoted
+ + - one or more state nodes + + Any other configuration is considered invalid. + + An idle-states node defines the following properties: + + - entry-method + Usage: Required + Value type: <stringlist> + Definition: Describes the method by which a CPU enters the + idle states. This property is required and must
be
quoted
+ one of: + + - "arm,psci" + ARM PSCI firmware interface [2]. + + - "[vendor],[method]" + An implementation dependent string with + format "vendor,method", where vendor is a
string
quoted
+ denoting the name of the manufacturer and + method is a string specifying the mechanism + used to enter the idle state. + +The nodes describing the idle states (state) can only be definedwithin thequoted
+idle-states node, any other configuration is considered invalid andthereforequoted
+must be ignored. + +=========================================== +3 - state node +=========================================== + +A state node represents an idle state description and must bedefined asquoted
+follows: + +- state node + + Description: must be child of the idle-states node + + The state node name shall follow standard device tree naming + rules ([5], 2.2.1 "Node names"), in particular state nodes which + are siblings within a single common parent must be given a uniquename.quoted
+ + The idle state entered by executing the wfi instruction(idle_standbyquoted
+ SBSA,[3][4]) is considered standard on all ARM platforms andthereforequoted
+ must not be listed. + + To correctly specify idle states timing and energy relatedproperties,quoted
+ the following definitions identify the different execution phases + a CPU goes through to enter and exit idle states and the implied + energy metrics: + +..__[EXEC]__|__[PREP]__|__[ENTRY]__|__[IDLE]__|__[EXIT]__|__[EXEC] __..quoted
+ | | | | | + + |<------ entry ------->| + | latency | + |<- exit ->| + | latency | + |<-------- min-residency -------->| + |<------- wakeup-latency ------->| +I don't know the wakeup latency makes much sense and also correct. Hardware wakeup latency is actually exit latency. Is it for failed or abort-able ilde case ? We are adding this as a new parameter at least from idle states perspective. I think we should just avoid it.
Hi Santosh, To me wake up latency makes up a lot of sense. It is not always the same as exit latency, it will depend on your system, and just how smart it is. In some cases the [ENTRY] period may not be negligible in which case exit latency will be less than the wake up latency. In addition, it will generally always be shorter than entry+exit which is the default value if omitted, this assumes the PREP time is not abortable, but this is the safer assumption to make. Wake up latency is really the number that folk have in their head for what you'd stick into the pm_qos to veto entry into states when you are latency constrained. The one thing that really is an optimisation here is having a separate exit latency, which is being proposed for use in core selection for the scheduler. So if anything was going to be made optional pending new scheduler patches should that not be entry/exit latency? Cheers Charles
quoted
+ EXEC: Normal CPU execution. + + PREP: Preparation phase before committing the hardware to idle
mode
quoted
+ like cache flushing. This is abortable on pending wake-up + event conditions. The abort latency is assumed to benegligiblequoted
+ (i.e. less than the ENTRY + EXIT duration). If aborted, CPU + goes back to EXEC. This phase is optional. If not abortable, + this should be included in the ENTRY phase instead. + + ENTRY: The hardware is committed to idle mode. This period mustrunquoted
+ to completion up to IDLE before anything else can happen. + + IDLE: This is the actual energy-saving idle period. This may last + between 0 and infinite time, until a wake-up event occurs. + + EXIT: Period during which the CPU is brought back to operational + mode (EXEC). + + With the definitions provided above, the following list represents + the valid properties for a state node: + + - compatible + Usage: Required + Value type: <stringlist> + Definition: Must be "arm,idle-state". + + - logic-state-retained + Usage: See definition + Value type: <none> + Definition: if present logic is retained on state entry, + otherwise it is lost. + + - cache-state-retained + Usage: See definition + Value type: <none> + Definition: if present cache memory is retained on stateentry,quoted
+ otherwise it is lost. + + - entry-method-param + Usage: See definition. + Value type: <u32> + Definition: Depends on the idle-states node entry-method + property value. Refer to the entry-method
bindings
quoted
+ for this property value definition. + + - entry-latency-us + Usage: Required + Value type: <prop-encoded-array> + Definition: u32 value representing worst case latency in + microseconds required to enter the idle state. + The exit-latency-us duration may be guaranteed + only after entry-latency-us has passed. + + - exit-latency-us + Usage: Required + Value type: <prop-encoded-array> + Definition: u32 value representing worst case latency + in microseconds required to exit the idle state. + + - min-residency-us + Usage: Required + Value type: <prop-encoded-array> + Definition: u32 value representing minimum residency
duration
quoted
+ in microseconds, inclusive of preparation and + entry, for this idle state to be considered + worthwhile energy wise. + The residency time must take into account the + energy consumed while entering and exiting the + idle state and is therefore expected to be + longer than entry-latency-us. + + - wakeup-latency-us: + Usage: Optional + Value type: <prop-encoded-array> + Definition: u32 value representing maximum delay between the + signaling of a wake-up event and the CPU being + able to execute normal code again. If omitted, + this is assumed to be equal to: + entry-latency-us + exit-latency-us +Rest of the patch looks fine by to me. regards, Santosh