Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking

[RFC PATCH v2 00/18] livepatch: hybrid consistency model · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 02/18] x86/asm/head: use a common function for starting CPUs · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-04-28
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Jiri Kosina <jikos@kernel.org> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-04-30
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-04-30
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-05-02
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-02
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-05-02
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-02
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Ingo Molnar <mingo@kernel.org> · 2016-05-02
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-05-02
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Jiri Kosina <jikos@kernel.org> · 2016-05-02
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Jiri Kosina <jikos@kernel.org> · 2016-05-02
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-03
RE: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · David Laight <hidden> · 2016-05-04
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-05-19
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-19
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-05-20
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-20
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-05-20
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-20
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-05-20
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Jiri Kosina <jikos@kernel.org> · 2016-05-23
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-24
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-23
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-05-24
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-24
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-06-22
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-06-22
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-06-22
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-06-22
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-06-22
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-06-22
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-06-23
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-06-23
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-06-23
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-06-23
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-06-23
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-06-23
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-06-23
[RFC PATCH v2 08/18] livepatch: temporary stubs for klp_patch_pending() and klp_patch_task() · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 10/18] livepatch/powerpc: add TIF_PATCH_PENDING thread flag · Josh Poimboeuf <hidden> · 2016-04-28
Re: [RFC PATCH v2 10/18] livepatch/powerpc: add TIF_PATCH_PENDING thread flag · Petr Mladek <pmladek@suse.com> · 2016-05-03
Re: [RFC PATCH v2 10/18] livepatch/powerpc: add TIF_PATCH_PENDING thread flag · Miroslav Benes <mbenes@suse.cz> · 2016-05-03
[RFC PATCH v2 12/18] livepatch/s390: add TIF_PATCH_PENDING thread flag · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 14/18] livepatch: remove unnecessary object loaded check · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 18/18] livepatch: add /proc/<pid>/patch_state · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 13/18] livepatch: separate enabled and patched states · Josh Poimboeuf <hidden> · 2016-04-28
Re: [RFC PATCH v2 13/18] livepatch: separate enabled and patched states · Petr Mladek <pmladek@suse.com> · 2016-05-03
Re: [RFC PATCH v2 13/18] livepatch: separate enabled and patched states · Josh Poimboeuf <hidden> · 2016-05-03
[RFC PATCH v2 16/18] livepatch: store function sizes · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-04-28
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-04
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-04
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Miroslav Benes <mbenes@suse.cz> · 2016-05-05
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-05
barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-04
Re: barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Peter Zijlstra <peterz@infradead.org> · 2016-05-04
Re: barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-04
Re: barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-04
Re: barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-04
Re: barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-05
Re: barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Miroslav Benes <mbenes@suse.cz> · 2016-05-09
Re: barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-04
Re: barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-05
klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-04
Re: klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Jiri Kosina <jikos@kernel.org> · 2016-05-04
Re: klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-04
Re: klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-05
Re: klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-06
Re: klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-09
Re: klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-16
Re: klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-18
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-06
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-06
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Miroslav Benes <mbenes@suse.cz> · 2016-05-09
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-16
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Miroslav Benes <mbenes@suse.cz> · 2016-05-10
Re: livepatch: change to a per-task consistency model · Jessica Yu <hidden> · 2016-05-17
Re: livepatch: change to a per-task consistency model · Jiri Kosina <jikos@kernel.org> · 2016-05-18
Re: livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-18
Re: livepatch: change to a per-task consistency model · Jiri Kosina <jikos@kernel.org> · 2016-05-18
RE: livepatch: change to a per-task consistency model · David Laight <hidden> · 2016-05-23
RE: livepatch: change to a per-task consistency model · Jiri Kosina <jikos@kernel.org> · 2016-05-23
RE: livepatch: change to a per-task consistency model · David Laight <hidden> · 2016-05-24
RE: livepatch: change to a per-task consistency model · Jiri Kosina <jikos@kernel.org> · 2016-05-24
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-06-06
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-06-06
[RFC PATCH v2 15/18] livepatch: move patching functions into patch.c · Josh Poimboeuf <hidden> · 2016-04-28
Re: [RFC PATCH v2 15/18] livepatch: move patching functions into patch.c · Petr Mladek <pmladek@suse.com> · 2016-05-03
[RFC PATCH v2 11/18] livepatch/s390: reorganize TIF thread flag bits · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 09/18] livepatch/x86: add TIF_PATCH_PENDING thread flag · Josh Poimboeuf <hidden> · 2016-04-28
Re: [RFC PATCH v2 09/18] livepatch/x86: add TIF_PATCH_PENDING thread flag · Andy Lutomirski <luto@amacapital.net> · 2016-04-29
Re: [RFC PATCH v2 09/18] livepatch/x86: add TIF_PATCH_PENDING thread flag · Josh Poimboeuf <hidden> · 2016-04-29
[RFC PATCH v2 07/18] stacktrace/x86: function for detecting reliable stack traces · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 06/18] x86: dump_trace() error handling · Josh Poimboeuf <hidden> · 2016-04-28
Re: [RFC PATCH v2 06/18] x86: dump_trace() error handling · Minfei Huang <hidden> · 2016-04-29
Re: [RFC PATCH v2 06/18] x86: dump_trace() error handling · Josh Poimboeuf <hidden> · 2016-04-29
[RFC PATCH v2 04/18] x86: move _stext marker before head code · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 03/18] x86/asm/head: standardize the bottom of the stack for idle tasks · Josh Poimboeuf <hidden> · 2016-04-28
Re: [RFC PATCH v2 03/18] x86/asm/head: standardize the bottom of the stack for idle tasks · Brian Gerst <hidden> · 2016-04-29
Re: [RFC PATCH v2 03/18] x86/asm/head: standardize the bottom of the stack for idle tasks · Josh Poimboeuf <hidden> · 2016-04-29
Re: [RFC PATCH v2 03/18] x86/asm/head: standardize the bottom of the stack for idle tasks · Andy Lutomirski <luto@kernel.org> · 2016-04-29
Re: [RFC PATCH v2 03/18] x86/asm/head: standardize the bottom of the stack for idle tasks · Josh Poimboeuf <hidden> · 2016-04-29
Re: [RFC PATCH v2 03/18] x86/asm/head: standardize the bottom of the stack for idle tasks · Andy Lutomirski <luto@amacapital.net> · 2016-04-29
Re: [RFC PATCH v2 03/18] x86/asm/head: standardize the bottom of the stack for idle tasks · Josh Poimboeuf <hidden> · 2016-04-29
Re: [RFC PATCH v2 03/18] x86/asm/head: standardize the bottom of the stack for idle tasks · Andy Lutomirski <luto@amacapital.net> · 2016-04-30
[RFC PATCH v2 01/18] x86/asm/head: clean up initial stack variable · Josh Poimboeuf <hidden> · 2016-04-28

From: Andy Lutomirski <luto@amacapital.net>
Date: 2016-04-29 21:38:14
Also in: linux-s390, lkml

On Fri, Apr 29, 2016 at 2:25 PM, Josh Poimboeuf [off-list ref] wrote:

On Fri, Apr 29, 2016 at 01:32:53PM -0700, Andy Lutomirski wrote:

quoted

On Fri, Apr 29, 2016 at 1:27 PM, Josh Poimboeuf [off-list ref] wrote:

quoted

On Fri, Apr 29, 2016 at 01:19:23PM -0700, Andy Lutomirski wrote:

quoted

On Fri, Apr 29, 2016 at 1:11 PM, Josh Poimboeuf [off-list ref] wrote:

quoted

On Fri, Apr 29, 2016 at 11:06:53AM -0700, Andy Lutomirski wrote:

quoted

On Thu, Apr 28, 2016 at 1:44 PM, Josh Poimboeuf [off-list ref] wrote:

quoted

A preempted function might not have had a chance to save the frame
pointer to the stack yet, which can result in its caller getting skipped
on a stack trace.

Add a flag to indicate when the task has been preempted so that stack
dump code can determine whether the stack trace is reliable.

I think I like this, but how do you handle the rather similar case in
which a task goes to sleep because it's waiting on IO that happened in
response to get_user, put_user, copy_from_user, etc?

Hm, good question.  I was thinking that page faults had a dedicated
stack, but now looking at the entry and traps code, that doesn't seem to
be the case.

Anyway I think it shouldn't be a problem if we make sure that any kernel
function which might trigger a valid page fault (e.g.,
copy_user_generic_string) do the proper frame pointer setup first.  Then
the stack should still be reliable.

In fact I might be able to teach objtool to enforce that: any function
which uses an exception table should create a stack frame.

Or alternatively, maybe set some kind of flag for page faults, similar
to what I did with this patch.

How about doing it the other way around: teach the unwinder to detect
when it hits a non-outermost entry (i.e. it lands in idtentry, etc)
and use some reasonable heuristic as to whether it's okay to keep
unwinding.  You should be able to handle preemption like that, too --
the unwind process will end up in an IRQ frame.

How exactly would the unwinder detect if a text address is in an
idtentry?  Maybe put all the idt entries in a special ELF section?

Hmm.

What actually happens when you unwind all the way into the entry code?
 Don't you end up in something that isn't in an ELF function?  Can you
detect that?

For entry from user space (e.g., syscalls), it's easy to detect because
there's always a pt_regs at the bottom of the stack.  So if the unwinder
reaches the stack address at (thread.sp0 - sizeof(pt_regs)), it knows
it's done.

But for nested entry (e.g. in-kernel irqs/exceptions like preemption and
page faults which don't have dedicated stacks), where the pt_regs is
stored somewhere in the middle of the stack instead of the bottom,
there's no reliable way to detect that.

quoted

Ideally, the unwinder could actually detect that it's
hit a pt_regs struct and report that.  If used for stack dumps, it
could display some indication of this and then continue its unwinding
by decoding the pt_regs.  If used for patching, it could take some
other appropriate action.

I would have no objection to annotating all the pt_regs-style entry
code, whether by putting it in a separate section or by making a table
of addresses.

I think the easiest way to make it work would be to modify the idtentry
macro to put all the idt entries in a dedicated section.  Then the
unwinder could easily detect any calls from that code.

That would work.  Would it make sense to do the same for the irq entries?

I'd be glad to review a patch.  It should be straightforward.

quoted

There are a couple of nasty cases if NMI or MCE is involved but, as of
4.6, outside of NMI, MCE, and vmalloc faults (ugh!), there should
always be a complete pt_regs on the stack before interrupts get
enabled for each entry.  Of course, finding the thing may be
nontrivial in case other things were pushed.

NMI, MCE and interrupts aren't a problem because they have dedicated
stacks, which are easy to detect.  If the tasks' stack is on an
exception stack or an irq stack, we consider it unreliable.

Only on x86_64.

And also, they don't sleep.  The stack of any running task (other than
current) is automatically considered unreliable anyway, since they could
be modifying it while we're reading it.

True.

quoted

I suppose we could try to rejigger the code so that rbp points to
pt_regs or similar.

I think we should avoid doing something like that because it would break
gdb and all the other unwinders who don't know about it.

How so?

Currently, rbp in the entry code is meaningless.  I'm suggesting that,
when we do, for example, 'call \do_sym' in idtentry, we point rbp to
the pt_regs.  Currently it points to something stale (which the
dump_stack code might be relying on.  Hmm.)  But it's probably also
safe to assume that if you unwind to the 'call \do_sym', then pt_regs
is the next thing on the stack, so just doing the section thing would
work.

We should really re-add DWARF some day.

--Andy

-- 
Andy Lutomirski
AMA Capital Management, LLC

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help