Re: klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a... | linux-s390

[RFC PATCH v2 00/18] livepatch: hybrid consistency model · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 02/18] x86/asm/head: use a common function for starting CPUs · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-04-28
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Jiri Kosina <jikos@kernel.org> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-04-30
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-04-30
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-05-02
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-02
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-05-02
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-02
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Ingo Molnar <mingo@kernel.org> · 2016-05-02
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-05-02
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Jiri Kosina <jikos@kernel.org> · 2016-05-02
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Jiri Kosina <jikos@kernel.org> · 2016-05-02
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-03
RE: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · David Laight <hidden> · 2016-05-04
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-05-19
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-19
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-05-20
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-20
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-05-20
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-20
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-05-20
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Jiri Kosina <jikos@kernel.org> · 2016-05-23
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-24
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-23
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-05-24
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-24
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-06-22
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-06-22
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-06-22
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-06-22
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-06-22
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-06-22
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-06-23
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-06-23
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-06-23
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-06-23
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-06-23
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-06-23
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-06-23
[RFC PATCH v2 08/18] livepatch: temporary stubs for klp_patch_pending() and klp_patch_task() · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 10/18] livepatch/powerpc: add TIF_PATCH_PENDING thread flag · Josh Poimboeuf <hidden> · 2016-04-28
Re: [RFC PATCH v2 10/18] livepatch/powerpc: add TIF_PATCH_PENDING thread flag · Petr Mladek <pmladek@suse.com> · 2016-05-03
Re: [RFC PATCH v2 10/18] livepatch/powerpc: add TIF_PATCH_PENDING thread flag · Miroslav Benes <mbenes@suse.cz> · 2016-05-03
[RFC PATCH v2 12/18] livepatch/s390: add TIF_PATCH_PENDING thread flag · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 14/18] livepatch: remove unnecessary object loaded check · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 18/18] livepatch: add /proc/<pid>/patch_state · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 13/18] livepatch: separate enabled and patched states · Josh Poimboeuf <hidden> · 2016-04-28
Re: [RFC PATCH v2 13/18] livepatch: separate enabled and patched states · Petr Mladek <pmladek@suse.com> · 2016-05-03
Re: [RFC PATCH v2 13/18] livepatch: separate enabled and patched states · Josh Poimboeuf <hidden> · 2016-05-03
[RFC PATCH v2 16/18] livepatch: store function sizes · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-04-28
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-04
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-04
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Miroslav Benes <mbenes@suse.cz> · 2016-05-05
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-05
barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-04
Re: barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Peter Zijlstra <peterz@infradead.org> · 2016-05-04
Re: barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-04
Re: barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-04
Re: barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-04
Re: barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-05
Re: barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Miroslav Benes <mbenes@suse.cz> · 2016-05-09
Re: barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-04
Re: barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-05
klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-04
Re: klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Jiri Kosina <jikos@kernel.org> · 2016-05-04
Re: klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-04
Re: klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-05
Re: klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-06
Re: klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-09
Re: klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-16
Re: klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-18
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-06
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-06
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Miroslav Benes <mbenes@suse.cz> · 2016-05-09
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-16
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Miroslav Benes <mbenes@suse.cz> · 2016-05-10
Re: livepatch: change to a per-task consistency model · Jessica Yu <hidden> · 2016-05-17
Re: livepatch: change to a per-task consistency model · Jiri Kosina <jikos@kernel.org> · 2016-05-18
Re: livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-18
Re: livepatch: change to a per-task consistency model · Jiri Kosina <jikos@kernel.org> · 2016-05-18
RE: livepatch: change to a per-task consistency model · David Laight <hidden> · 2016-05-23
RE: livepatch: change to a per-task consistency model · Jiri Kosina <jikos@kernel.org> · 2016-05-23
RE: livepatch: change to a per-task consistency model · David Laight <hidden> · 2016-05-24
RE: livepatch: change to a per-task consistency model · Jiri Kosina <jikos@kernel.org> · 2016-05-24
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-06-06
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-06-06
[RFC PATCH v2 15/18] livepatch: move patching functions into patch.c · Josh Poimboeuf <hidden> · 2016-04-28
Re: [RFC PATCH v2 15/18] livepatch: move patching functions into patch.c · Petr Mladek <pmladek@suse.com> · 2016-05-03
[RFC PATCH v2 11/18] livepatch/s390: reorganize TIF thread flag bits · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 09/18] livepatch/x86: add TIF_PATCH_PENDING thread flag · Josh Poimboeuf <hidden> · 2016-04-28
Re: [RFC PATCH v2 09/18] livepatch/x86: add TIF_PATCH_PENDING thread flag · Andy Lutomirski <luto@amacapital.net> · 2016-04-29
Re: [RFC PATCH v2 09/18] livepatch/x86: add TIF_PATCH_PENDING thread flag · Josh Poimboeuf <hidden> · 2016-04-29
[RFC PATCH v2 07/18] stacktrace/x86: function for detecting reliable stack traces · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 06/18] x86: dump_trace() error handling · Josh Poimboeuf <hidden> · 2016-04-28
Re: [RFC PATCH v2 06/18] x86: dump_trace() error handling · Minfei Huang <hidden> · 2016-04-29
Re: [RFC PATCH v2 06/18] x86: dump_trace() error handling · Josh Poimboeuf <hidden> · 2016-04-29
[RFC PATCH v2 04/18] x86: move _stext marker before head code · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 03/18] x86/asm/head: standardize the bottom of the stack for idle tasks · Josh Poimboeuf <hidden> · 2016-04-28
Re: [RFC PATCH v2 03/18] x86/asm/head: standardize the bottom of the stack for idle tasks · Brian Gerst <hidden> · 2016-04-29
Re: [RFC PATCH v2 03/18] x86/asm/head: standardize the bottom of the stack for idle tasks · Josh Poimboeuf <hidden> · 2016-04-29
Re: [RFC PATCH v2 03/18] x86/asm/head: standardize the bottom of the stack for idle tasks · Andy Lutomirski <luto@kernel.org> · 2016-04-29
Re: [RFC PATCH v2 03/18] x86/asm/head: standardize the bottom of the stack for idle tasks · Josh Poimboeuf <hidden> · 2016-04-29
Re: [RFC PATCH v2 03/18] x86/asm/head: standardize the bottom of the stack for idle tasks · Andy Lutomirski <luto@amacapital.net> · 2016-04-29
Re: [RFC PATCH v2 03/18] x86/asm/head: standardize the bottom of the stack for idle tasks · Josh Poimboeuf <hidden> · 2016-04-29
Re: [RFC PATCH v2 03/18] x86/asm/head: standardize the bottom of the stack for idle tasks · Andy Lutomirski <luto@amacapital.net> · 2016-04-30
[RFC PATCH v2 01/18] x86/asm/head: clean up initial stack variable · Josh Poimboeuf <hidden> · 2016-04-28

Re: klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model

From: Petr Mladek <pmladek@suse.com>
Date: 2016-05-18 13:12:25
Also in: linuxppc-dev, lkml

On Mon 2016-05-16 13:12:50, Josh Poimboeuf wrote:

On Mon, May 09, 2016 at 02:23:03PM +0200, Petr Mladek wrote:

quoted

On Fri 2016-05-06 07:38:55, Josh Poimboeuf wrote:

quoted

On Thu, May 05, 2016 at 01:57:01PM +0200, Petr Mladek wrote:

quoted

I have missed that the two commands are called with preemption
disabled. So, I had the following crazy scenario in mind:


CPU0				CPU1

klp_enable_patch()

  klp_target_state = KLP_PATCHED;

  for_each_task()
     set TIF_PENDING_PATCH

				# task 123

				if (klp_patch_pending(current)
				  klp_patch_task(current)

                                    clear TIF_PENDING_PATCH

				    smp_rmb();

				    # switch to assembly of
				    # klp_patch_task()

				    mov klp_target_state, %r12

				    # interrupt and schedule
				    # another task


  klp_reverse_transition();

    klp_target_state = KLP_UNPATCHED;

    klt_try_to_complete_transition()

      task = 123;
      if (task->patch_state == klp_target_state;
         return 0;

    => task 123 is in target state and does
    not block conversion

  klp_complete_transition()


  # disable previous patch on the stack
  klp_disable_patch();

    klp_target_state = KLP_UNPATCHED;
  
  
				    # task 123 gets scheduled again
				    lea %r12, task->patch_state

				    => it happily stores an outdated
				    state

Thanks for the clear explanation, this helps a lot.

quoted

This is why the two functions should get called with preemption
disabled. We should document it at least. I imagine that we will
use them later also in another context and nobody will remember
this crazy scenario.

Well, even disabled preemption does not help. The process on
CPU1 might be also interrupted by an NMI and do some long
printk in it.

IMHO, the only safe approach is to call klp_patch_task()
only for "current" on a safe place. Then this race is harmless.
The switch happen on a safe place, so that it does not matter
into which state the process is switched.

I'm not sure about this solution.  When klp_complete_transition() is
called, we need all tasks to be patched, for good.  We don't want any of
them to randomly switch to the wrong state at some later time in the
middle of a future patch operation.  How would changing klp_patch_task()
to only use "current" prevent that?

You are right that it is pity but it really should be safe because
it is not entirely random.

If the race happens and assign an outdated value, there are two
situations:

1. It is assigned when there is not transition in the progress.
   Then it is OK because it will be ignored by the ftrace handler.
   The right state will be set before the next transition starts.

2. It is assigned when some other transition is in progress.
   Then it is OK as long as the function is called from "current".
   The "wrong" state will be used consistently. It will switch
   to the right state on another safe state.

Maybe it would be safe, though I'm not entirely convinced.  Regardless I
think we should avoid these situations entirely because they create
windows for future bugs and races.

Yup, I would prefer a cleaner solution as well.

quoted

By other words, the task state might be updated only

   + by the task itself on a safe place
   + by other task when the updated on is sleeping on a safe place

This should be well documented and the API should help to avoid
a misuse.

I think we could fix it to be safe for future callers who might not have
preemption disabled with a couple of changes to klp_patch_task():
disabling preemption and testing/clearing the TIF_PATCH_PENDING flag
before changing the patch state:

  void klp_patch_task(struct task_struct *task)
  {
  	preempt_disable();
  
  	if (test_and_clear_tsk_thread_flag(task, TIF_PATCH_PENDING))
  		task->patch_state = READ_ONCE(klp_target_state);
  
  	preempt_enable();
  }

It reduces the race window a bit but it is still there. For example,
NMI still might add a huge delay between reading klp_target_state
and assigning task->patch state.

Maybe you missed this paragraph from my last email:

| We would also need a synchronize_sched() after the patching is complete,
| either at the end of klp_try_complete_transition() or in
| klp_complete_transition().  That would make sure that all existing calls
| to klp_patch_task() are done.

So a huge NMI delay wouldn't be a problem here.  The call to
synchronize_sched() in klp_complete_transition() would sleep until the
NMI handler returns and the critical section of klp_patch_task()
finishes.  So once a patch is complete, we know that it's really
complete.

Yes, synchronize_sched() will help with the premeption disabled. I did
not shake my head enough last time.

quoted

What about the following?

/*
 * This function might assign an outdated value if the transaction
`* is reverted and finalized in parallel. But it is safe. If the value
 * is assigned outside of a transaction, it is ignored and the next
 * transaction will set the right one. Or if it gets assigned
 * inside another transaction, it will repeat the cycle and
 * set the right state.
 */
void klp_update_current_patch_state()
{
	while (test_and_clear_tsk_thread_flag(current, TIF_PATCH_PENDING))
		current->patch_state = READ_ONCE(klp_target_state);
}

I'm not sure how this would work.  How would the thread flag get set
again after it's been cleared?

See the race described in the previous mail. The problem is when the
target_state and the TIF flags gets set after reading klp_target_state
into a register and before storing the value into current->patch_state.

We do not need this if use the synchronize_sched() and fix up
current->patch_state then.

Also I really don't like the idea of randomly updating a task's patch
state after the transition has been completed.

quoted

Note that the disabled preemption helped only partially,
so I think that it was not really needed.

Hmm, it means that the task->patch_state  might be either
KLP_PATCHED or KLP_UNPATCHED outside a transition. I wonder
if the tristate really brings some advantages.


Alternatively, we might synchronize the operation with klp_mutex.
The function is called in a slow path and in a safe context.
Well, it might cause contention on the lock when many CPUs are
trying to update their tasks.

I don't think a mutex would work because at least the ftrace handler
(and maybe more) can't sleep.  Maybe a spinlock could work but I think
that would be overkill.

Sure, I had a spinlock in mind.

Best Regards,
Petr

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help