Thread (8 messages) 8 messages, 4 authors, 2016-03-16

Re: [PATCHv2 3/3] thermal: improve hot trip handling

From: Srikar Srimath Tirumala <hidden>
Date: 2016-02-03 23:30:07

Hi Eduardo and Rui,

Geert Uytterhoeven <geert <at> linux-m68k.org> writes:
Hi Eduardo,

On Thu, Dec 17, 2015 at 8:13 PM, Eduardo Valentin <edubezval <at> 
gmail.com> wrote:
quoted
The idea is to add the choice to be notified only when temperature
crosses trip points. The trip points affected are the non-passive
trip points.
This is good, how about notifying for non critical trip points as well?  
This will allow userspace to do workload throttling based on passive 
thermal trips.
quoted
It will check last temperature and current temperature against
the trip point temperature and its hysteresis.
In case the check shows temperature has changed enought indicating
a trip point crossing, a uevent will be sent to userspace.

The uevent contains the thermal zone type, the current temperature,
the last temperature and the trip point in which the current 
temperature
quoted
now resides.
How about adding sysfs_notify on some of the existing attributes like 
trip_point_*_type instead? Since notifying passive trip points using 
uevents feels expensive we could maybe use sysfs_notify, which would 
satisfy the requirements for all type of trip points.
quoted
The behavior of ops->notify() callback remains the same.

Cc: Zhang Rui <rui.zhang <at> intel.com>
Cc: linux-pm <at> vger.kernel.org
Cc: linux-kernel <at> vger.kernel.org
Signed-off-by: Eduardo Valentin <edubezval <at> gmail.com>
---
V1->V2: none
---
 drivers/thermal/thermal_core.c | 52 
++++++++++++++++++++++++++++++++++++++++++
quoted
 1 file changed, 52 insertions(+)
diff --git a/drivers/thermal/thermal_core.c 
b/drivers/thermal/thermal_core.c
quoted
index a229c84..e0f1f4e 100644
--- a/drivers/thermal/thermal_core.c
+++ b/drivers/thermal/thermal_core.c
 <at>  <at>  -423,6 +423,56  <at>  <at>  static void 
handle_non_critical_trips(struct thermal_zone_device *tz,
quoted
                       def_governor->throttle(tz, trip);
 }

+static void thermal_tripped_notify(struct thermal_zone_device *tz,
+                                  int trip, enum thermal_trip_type 
trip_type,
quoted
+                                  int trip_temp)
+{
+       char tuv_name[THERMAL_NAME_LENGTH + 15], tuv_temp[25],
+               tuv_ltemp[25], tuv_trip[25], tuv_type[25];
+       char *msg[6] = { tuv_name, tuv_temp, tuv_ltemp, tuv_trip, 
tuv_type,
quoted
+                       NULL };
+       int upper_trip_hyst, upper_trip_temp, trip_hyst = 0;
+       int ret = 0;
+
+       snprintf(tuv_name, sizeof(tuv_name), "THERMAL_ZONE=%s", tz-
type);
quoted
+       snprintf(tuv_temp, sizeof(tuv_temp), "TEMP=%d", tz-
temperature);
quoted
+       snprintf(tuv_ltemp, sizeof(tuv_ltemp), "LAST_TEMP=%d",
+                tz->last_temperature);
+       snprintf(tuv_trip, sizeof(tuv_trip), "TRIP=%d", trip);
+       snprintf(tuv_type, sizeof(tuv_type), "TRIP_TYPE=%d", 
trip_type);
quoted
+
+       mutex_lock(&tz->lock);
+
+       /* crossing up */
+       if (tz->last_temperature < trip_temp && trip_temp < tz-
temperature)
quoted
+               kobject_uevent_env(&tz->device.kobj, KOBJ_CHANGE, msg);
+
+       if (tz->ops->get_trip_hyst)
+               tz->ops->get_trip_hyst(tz, trip, &trip_hyst);
+
+       /* crossing down, check for hyst */
+       trip_temp -= trip_hyst;
+       if (tz->last_temperature > trip_temp && trip_temp > tz-
temperature) {
quoted
+               snprintf(tuv_trip, sizeof(tuv_trip), "TRIP=%d", trip - 
1);
quoted
+               kobject_uevent_env(&tz->device.kobj, KOBJ_CHANGE, msg);
+       }
+
+       ret = tz->ops->get_trip_temp(tz, trip + 1, &upper_trip_temp);
"trip + 1" may be equal to thermal_zone_device.trips and thus out-of-
range,
in which case rcar_thermal_get_trip_temp() will print an error message:

    rcar_thermal e61f0000.thermal: rcar driver trip error

Is the "+ 1" (also below) intentional?
If yes, I think the related error messages in rcar_thermal.c should be 
reduced
to debug messages.
quoted
+       if (ret)
+               goto unlock;
+
+       if (tz->ops->get_trip_hyst)
+               tz->ops->get_trip_hyst(tz, trip + 1, &upper_trip_hyst);
Gr{oetje,eeting}s,

                        Geert

--
Geert Uytterhoeven -- There's lots of Linux beyond ia32 -- geert <at> 
linux-m68k.org
In personal conversations with technical people, I call myself a hacker. 
But
when I'm talking to journalists I just say "programmer" or something like 
that.
                                -- Linus Torvalds
--
To unsubscribe from this list: send the line "unsubscribe linux-pm" in
the body of a message to majordomo <at> vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Alternate Proposal: Initiate a sysfs_notify on cooling_dev*/cur_state from 
thermal_cdev_update and a sysfs_notify on thermal_zone*/temp from 
thermal_zone_device_update whenever there is new state or whenever 
temperature crosses a trip point. 

A userspace app implementing workload throttling would be waiting for these 
notifications using the poll() method. It could have chosen any combination 
thermal zone and cooling device cur state to wait on. Upon receiving a 
notification, it would initiate actions like lowering the frame rate of a 
camera.

Please let us know your thoughts on the subject.

Thanks,
Srikar

Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help