Re: [PATCH v3 13/15] livepatch: change to a per-task consistency model

[PATCH v3 00/15] livepatch: hybrid consistency model · Josh Poimboeuf <hidden> · 2016-12-08
[PATCH v3 01/15] stacktrace/x86: add function for detecting reliable stack traces · Josh Poimboeuf <hidden> · 2016-12-08
Re: [PATCH v3 01/15] stacktrace/x86: add function for detecting reliable stack traces · Petr Mladek <pmladek@suse.com> · 2016-12-16
Re: [PATCH v3 01/15] stacktrace/x86: add function for detecting reliable stack traces · Josh Poimboeuf <hidden> · 2016-12-16
Re: [PATCH v3 01/15] stacktrace/x86: add function for detecting reliable stack traces · Miroslav Benes <mbenes@suse.cz> · 2016-12-19
Re: [PATCH v3 01/15] stacktrace/x86: add function for detecting reliable stack traces · Josh Poimboeuf <hidden> · 2016-12-19
Re: [PATCH v3 01/15] stacktrace/x86: add function for detecting reliable stack traces · Miroslav Benes <mbenes@suse.cz> · 2016-12-19
Re: [PATCH v3 01/15] stacktrace/x86: add function for detecting reliable stack traces · Petr Mladek <pmladek@suse.com> · 2016-12-20
Re: [PATCH v3 01/15] stacktrace/x86: add function for detecting reliable stack traces · Josh Poimboeuf <hidden> · 2016-12-20
[PATCH v3 05/15] livepatch/powerpc: add TIF_PATCH_PENDING thread flag · Josh Poimboeuf <hidden> · 2016-12-08
Re: [PATCH v3 05/15] livepatch/powerpc: add TIF_PATCH_PENDING thread flag · Petr Mladek <pmladek@suse.com> · 2016-12-16
Re: [PATCH v3 05/15] livepatch/powerpc: add TIF_PATCH_PENDING thread flag · Miroslav Benes <mbenes@suse.cz> · 2016-12-21
Re: [PATCH v3 05/15] livepatch/powerpc: add TIF_PATCH_PENDING thread flag · Kamalesh Babulal <hidden> · 2017-01-10
[PATCH v3 04/15] livepatch/x86: add TIF_PATCH_PENDING thread flag · Josh Poimboeuf <hidden> · 2016-12-08
Re: [PATCH v3 04/15] livepatch/x86: add TIF_PATCH_PENDING thread flag · Andy Lutomirski <luto@amacapital.net> · 2016-12-08
Re: [PATCH v3 04/15] livepatch/x86: add TIF_PATCH_PENDING thread flag · Petr Mladek <pmladek@suse.com> · 2016-12-16
Re: [PATCH v3 04/15] livepatch/x86: add TIF_PATCH_PENDING thread flag · Miroslav Benes <mbenes@suse.cz> · 2016-12-21
Re: [PATCH v3 04/15] livepatch/x86: add TIF_PATCH_PENDING thread flag · Kamalesh Babulal <hidden> · 2017-01-11
[PATCH v3 15/15] livepatch: allow removal of a disabled patch · Josh Poimboeuf <hidden> · 2016-12-08
Re: [PATCH v3 15/15] livepatch: allow removal of a disabled patch · Petr Mladek <pmladek@suse.com> · 2016-12-21
Re: [PATCH v3 15/15] livepatch: allow removal of a disabled patch · Miroslav Benes <mbenes@suse.cz> · 2017-01-04
Re: [PATCH v3 15/15] livepatch: allow removal of a disabled patch · Josh Poimboeuf <hidden> · 2017-01-06
[PATCH v3 14/15] livepatch: add /proc/<pid>/patch_state · Josh Poimboeuf <hidden> · 2016-12-08
Re: [PATCH v3 14/15] livepatch: add /proc/<pid>/patch_state · Petr Mladek <pmladek@suse.com> · 2016-12-21
Re: [PATCH v3 14/15] livepatch: add /proc/<pid>/patch_state · Miroslav Benes <mbenes@suse.cz> · 2017-01-04
[PATCH v3 10/15] livepatch: move patching functions into patch.c · Josh Poimboeuf <hidden> · 2016-12-08
Re: [PATCH v3 10/15] livepatch: move patching functions into patch.c · Petr Mladek <pmladek@suse.com> · 2016-12-16
Re: [PATCH v3 10/15] livepatch: move patching functions into patch.c · Miroslav Benes <mbenes@suse.cz> · 2016-12-23
Re: [PATCH v3 10/15] livepatch: move patching functions into patch.c · Kamalesh Babulal <hidden> · 2017-01-10
[PATCH v3 08/15] livepatch: separate enabled and patched states · Josh Poimboeuf <hidden> · 2016-12-08
Re: [PATCH v3 08/15] livepatch: separate enabled and patched states · Petr Mladek <pmladek@suse.com> · 2016-12-16
Re: [PATCH v3 08/15] livepatch: separate enabled and patched states · Miroslav Benes <mbenes@suse.cz> · 2016-12-23
Re: [PATCH v3 08/15] livepatch: separate enabled and patched states · Kamalesh Babulal <hidden> · 2017-01-10
[PATCH v3 12/15] livepatch: store function sizes · Josh Poimboeuf <hidden> · 2016-12-08
Re: [PATCH v3 12/15] livepatch: store function sizes · Petr Mladek <pmladek@suse.com> · 2016-12-19
Re: [PATCH v3 12/15] livepatch: store function sizes · Miroslav Benes <mbenes@suse.cz> · 2016-12-23
Re: [PATCH v3 12/15] livepatch: store function sizes · Kamalesh Babulal <hidden> · 2017-01-11
[PATCH v3 07/15] livepatch/s390: add TIF_PATCH_PENDING thread flag · Josh Poimboeuf <hidden> · 2016-12-08
[PATCH v3 11/15] livepatch: use kstrtobool() in enabled_store() · Josh Poimboeuf <hidden> · 2016-12-08
Re: [PATCH v3 11/15] livepatch: use kstrtobool() in enabled_store() · Petr Mladek <pmladek@suse.com> · 2016-12-16
Re: [PATCH v3 11/15] livepatch: use kstrtobool() in enabled_store() · Josh Poimboeuf <hidden> · 2016-12-16
Re: [PATCH v3 11/15] livepatch: use kstrtobool() in enabled_store() · Miroslav Benes <mbenes@suse.cz> · 2016-12-23
[PATCH v3 09/15] livepatch: remove unnecessary object loaded check · Josh Poimboeuf <hidden> · 2016-12-08
Re: [PATCH v3 09/15] livepatch: remove unnecessary object loaded check · Petr Mladek <pmladek@suse.com> · 2016-12-16
Re: [PATCH v3 09/15] livepatch: remove unnecessary object loaded check · Miroslav Benes <mbenes@suse.cz> · 2016-12-23
Re: [PATCH v3 09/15] livepatch: remove unnecessary object loaded check · Kamalesh Babulal <hidden> · 2017-01-10
[PATCH v3 02/15] x86/entry: define _TIF_ALLWORK_MASK flags explicitly · Josh Poimboeuf <hidden> · 2016-12-08
Re: [PATCH v3 02/15] x86/entry: define _TIF_ALLWORK_MASK flags explicitly · Petr Mladek <pmladek@suse.com> · 2016-12-16
Re: [PATCH v3 02/15] x86/entry: define _TIF_ALLWORK_MASK flags explicitly · Josh Poimboeuf <hidden> · 2016-12-16
Re: [PATCH v3 02/15] x86/entry: define _TIF_ALLWORK_MASK flags explicitly · Miroslav Benes <mbenes@suse.cz> · 2016-12-19
Re: [PATCH v3 02/15] x86/entry: define _TIF_ALLWORK_MASK flags explicitly · Kamalesh Babulal <hidden> · 2017-01-10
[PATCH v3 03/15] livepatch: temporary stubs for klp_patch_pending() and klp_update_patch_state() · Josh Poimboeuf <hidden> · 2016-12-08
Re: [PATCH v3 03/15] livepatch: temporary stubs for klp_patch_pending() and klp_update_patch_state() · Petr Mladek <pmladek@suse.com> · 2016-12-16
Re: [PATCH v3 03/15] livepatch: temporary stubs for klp_patch_pending() and klp_update_patch_state() · Josh Poimboeuf <hidden> · 2016-12-16
[PATCH v3 06/15] livepatch/s390: reorganize TIF thread flag bits · Josh Poimboeuf <hidden> · 2016-12-08
Re: [PATCH v3 06/15] livepatch/s390: reorganize TIF thread flag bits · Miroslav Benes <mbenes@suse.cz> · 2016-12-21
[PATCH v3 13/15] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-12-08
Re: [PATCH v3 13/15] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-12-20
Re: [PATCH v3 13/15] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-12-21
Re: [PATCH v3 13/15] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-12-22
Re: [PATCH v3 13/15] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-12-22
Re: [PATCH v3 13/15] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2017-01-10
Re: [PATCH v3 13/15] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2017-01-10
Re: [PATCH v3 13/15] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2017-01-11
Re: [PATCH v3 13/15] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2017-01-11
Re: [PATCH v3 13/15] livepatch: change to a per-task consistency model · Miroslav Benes <mbenes@suse.cz> · 2016-12-23
Re: [PATCH v3 13/15] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-12-23
Re: [PATCH v3 13/15] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2017-01-06
Re: [PATCH v3 13/15] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2017-01-10
Re: [PATCH v3 13/15] livepatch: change to a per-task consistency model · Miroslav Benes <mbenes@suse.cz> · 2017-01-04
Re: [PATCH v3 13/15] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2017-01-06
Re: [PATCH v3 13/15] livepatch: change to a per-task consistency model · Miroslav Benes <mbenes@suse.cz> · 2017-01-10
Re: [PATCH v3 13/15] livepatch: change to a per-task consistency model · Miroslav Benes <mbenes@suse.cz> · 2017-01-05
Re: [PATCH v3 13/15] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2017-01-06
Re: [PATCH v3 00/15] livepatch: hybrid consistency model · Balbir Singh <bsingharora@gmail.com> · 2016-12-10
Re: [PATCH v3 00/15] livepatch: hybrid consistency model · Josh Poimboeuf <hidden> · 2016-12-10
Re: [PATCH v3 00/15] livepatch: hybrid consistency model · Balbir Singh <bsingharora@gmail.com> · 2016-12-11
Re: [PATCH v3 00/15] livepatch: hybrid consistency model · Josh Poimboeuf <hidden> · 2016-12-12

From: Petr Mladek <pmladek@suse.com>
Date: 2017-01-11 15:18:35
Also in: linuxppc-dev, lkml

On Tue 2017-01-10 14:46:46, Josh Poimboeuf wrote:

On Tue, Jan 10, 2017 at 02:00:58PM +0100, Petr Mladek wrote:

quoted

On Thu 2016-12-22 12:31:37, Josh Poimboeuf wrote:

quoted

On Thu, Dec 22, 2016 at 03:34:52PM +0100, Petr Mladek wrote:

quoted

On Wed 2016-12-21 15:25:05, Josh Poimboeuf wrote:

quoted

On Tue, Dec 20, 2016 at 06:32:46PM +0100, Petr Mladek wrote:

quoted

On Thu 2016-12-08 12:08:38, Josh Poimboeuf wrote:

quoted

+	read_unlock(&tasklist_lock);
+
+	/*
+	 * Ditto for the idle "swapper" tasks, though they never cross the
+	 * syscall barrier.  Instead they switch over in cpu_idle_loop().
+	 */
+	get_online_cpus();
+	for_each_online_cpu(cpu)
+		set_tsk_thread_flag(idle_task(cpu), TIF_PATCH_PENDING);
+	put_online_cpus();

Also this stage need to be somehow handled by CPU coming/going
handlers.

Here I think we could automatically switch any offline CPUs' idle tasks.
And something similar in klp_try_complete_transition().

We still need to make sure to do not race with the cpu_up()/cpu_down()
calls.

Hm, maybe we'd need to call cpu_hotplug_disable() before switching the
offline idle tasks?

quoted

I would use here the trick with for_each_possible_cpu() and let
the migration for the stack check.

There are a few issues with that:

1) The idle task of a missing CPU doesn't *have* a stack, so it doesn't
   make much sense to try to check it.

2) We can't rely *only* on the stack check, because not all arches have
   it.  The other way to migrate idle tasks is from the idle loop switch
   point.  But if the task's CPU is down, its idle loop isn't running so
   it can't migrate.

   (Note this is currently a theoretical point: we currently don't allow
   such arches to use the consistency model anyway because there's no
   way for them to migrate kthreads.)

Good points. My only concern is that the transaction might take a long
or even forever. I am not sure if it is wise to disable cpu_hotplug
for the entire transaction.

A compromise might be to disable cpu hotplug only when the task
state is manipulated a more complex way. Hmm, cpu_hotplug_disable()
looks like a rather costly function. We should not call it in
klp_try_complete_transition(). But we could do:

  1. When the patch is being enabled, disable cpu hotplug,
     go through each_possible_cpu and setup the transaction
     only for CPUs that are online. Then we could enable
     the hotplug again.

  2. Check only each_online_cpu in klp_try_complete_transition().
     If all tasks are migrated, disable cpu hotplug and re-check
     idle tasks on online CPUs. If any is not migrated, enable
     hotplug and return failure. Othewise, continue with
     completion of the transaction. [*]

  3. In klp_complete_transition, update all tasks including
     the offline CPUs and enable cpu hotplug again.

If the re-check in the 2nd step looks ugly, we could add some hotlug
notifiers to make sure that enabled/disabled CPUs are in a reasonable
state. We still should disable the hotplug in the 1st and 3rd step.

BTW: There is a new API for the cpu hotplug callbacks. I was involved
in one conversion. You might take inspiration in
drivers/thermal/intel_powerclamp.c. See cpuhp_setup_state_nocalls()
there.

Backing up a bit, although I brought up cpu_hotplug_disable(), I think I
misunderstood the race you mentioned.  I actually don't think
cpu_hotplug_disable() is necessary.

Great backing! You made me to study the difference. If I get it
correctly:

  + cpu_hotplug_disable() works like a writer lock. It gets
    exclusive access via cpu_hotplug_begin(). A side effect
    is that do_cpu_up() and do_cpu_down() do not wait. They
    return -EBUSY if hotplug is disabled.

  + get_online_cpus() is kind of reader lock. It makes sure
    that all the hotplug operations are finished and "softly"
    blocks other further operation. By "softly" I mean that
    the operations wait for the exclusive (write) access
    in cpu_hotplug_begin().

IMHO, we really have to use get_online_cpus() and avoid the
the "hard" blocking.

What do you think about something like the following:

In klp_start_transition:

	get_online_cpus();
	for_each_possible_cpu(cpu)
		set_tsk_thread_flag(idle_task(cpu), TIF_PATCH_PENDING);
	put_online_cpus();

In klp_try_complete_transition:

	get_online_cpus();
	for_each_possible_cpu(cpu) {
		task = idle_task(cpu);
		if (cpu_online(cpu)) {
			if (!klp_try_switch_task(task))
				complete = false;
		} else if (task->patch_state != klp_target_state) {
			/* offline CPU idle tasks can be switched immediately */
			clear_tsk_thread_flag(task, TIF_PATCH_PENDING);
			task->patch_state = klp_target_state;
		}
	}
	put_online_cpus();

I like the idea. You are right that it is enough to always get/put
CPUs only when a state of the per-CPU idle tasks are manipulated.
In the meantime, we are safe because of the consistency model
(clever ftrace handler).

Note that we have to use for_each_possible_cpu() everywhere,
e.g. in klp_init_transition(), klp_complete_transition().
Otherwise, we might see an inconsistent state.

For example, klp_ftrace_handler() might see KLP_UNDEFINED state
if we do not set a valid one in klp_init_transition() and a CPU
gets online.

Best Regards,
Petr

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help