Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking

[RFC PATCH v2 00/18] livepatch: hybrid consistency model · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 02/18] x86/asm/head: use a common function for starting CPUs · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-04-28
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Jiri Kosina <jikos@kernel.org> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-04-30
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-04-29
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-04-30
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-05-02
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-02
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-05-02
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-02
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Ingo Molnar <mingo@kernel.org> · 2016-05-02
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-05-02
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Jiri Kosina <jikos@kernel.org> · 2016-05-02
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Jiri Kosina <jikos@kernel.org> · 2016-05-02
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-03
RE: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · David Laight <hidden> · 2016-05-04
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-05-19
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-19
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-05-20
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-20
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-05-20
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-20
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-05-20
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Jiri Kosina <jikos@kernel.org> · 2016-05-23
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-24
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-23
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-05-24
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-05-24
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-06-22
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-06-22
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-06-22
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-06-22
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-06-22
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-06-22
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-06-23
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-06-23
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-06-23
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-06-23
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-06-23
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Andy Lutomirski <luto@amacapital.net> · 2016-06-23
Re: [RFC PATCH v2 05/18] sched: add task flag for preempt IRQ tracking · Josh Poimboeuf <hidden> · 2016-06-23
[RFC PATCH v2 08/18] livepatch: temporary stubs for klp_patch_pending() and klp_patch_task() · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 10/18] livepatch/powerpc: add TIF_PATCH_PENDING thread flag · Josh Poimboeuf <hidden> · 2016-04-28
Re: [RFC PATCH v2 10/18] livepatch/powerpc: add TIF_PATCH_PENDING thread flag · Petr Mladek <pmladek@suse.com> · 2016-05-03
Re: [RFC PATCH v2 10/18] livepatch/powerpc: add TIF_PATCH_PENDING thread flag · Miroslav Benes <mbenes@suse.cz> · 2016-05-03
[RFC PATCH v2 12/18] livepatch/s390: add TIF_PATCH_PENDING thread flag · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 14/18] livepatch: remove unnecessary object loaded check · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 18/18] livepatch: add /proc/<pid>/patch_state · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 13/18] livepatch: separate enabled and patched states · Josh Poimboeuf <hidden> · 2016-04-28
Re: [RFC PATCH v2 13/18] livepatch: separate enabled and patched states · Petr Mladek <pmladek@suse.com> · 2016-05-03
Re: [RFC PATCH v2 13/18] livepatch: separate enabled and patched states · Josh Poimboeuf <hidden> · 2016-05-03
[RFC PATCH v2 16/18] livepatch: store function sizes · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-04-28
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-04
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-04
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Miroslav Benes <mbenes@suse.cz> · 2016-05-05
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-05
barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-04
Re: barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Peter Zijlstra <peterz@infradead.org> · 2016-05-04
Re: barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-04
Re: barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-04
Re: barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-04
Re: barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-05
Re: barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Miroslav Benes <mbenes@suse.cz> · 2016-05-09
Re: barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-04
Re: barriers: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-05
klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-04
Re: klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Jiri Kosina <jikos@kernel.org> · 2016-05-04
Re: klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-04
Re: klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-05
Re: klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-06
Re: klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-09
Re: klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-16
Re: klp_task_patch: was: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-18
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-05-06
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-06
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Miroslav Benes <mbenes@suse.cz> · 2016-05-09
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-16
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Miroslav Benes <mbenes@suse.cz> · 2016-05-10
Re: livepatch: change to a per-task consistency model · Jessica Yu <hidden> · 2016-05-17
Re: livepatch: change to a per-task consistency model · Jiri Kosina <jikos@kernel.org> · 2016-05-18
Re: livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-05-18
Re: livepatch: change to a per-task consistency model · Jiri Kosina <jikos@kernel.org> · 2016-05-18
RE: livepatch: change to a per-task consistency model · David Laight <hidden> · 2016-05-23
RE: livepatch: change to a per-task consistency model · Jiri Kosina <jikos@kernel.org> · 2016-05-23
RE: livepatch: change to a per-task consistency model · David Laight <hidden> · 2016-05-24
RE: livepatch: change to a per-task consistency model · Jiri Kosina <jikos@kernel.org> · 2016-05-24
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Petr Mladek <pmladek@suse.com> · 2016-06-06
Re: [RFC PATCH v2 17/18] livepatch: change to a per-task consistency model · Josh Poimboeuf <hidden> · 2016-06-06
[RFC PATCH v2 15/18] livepatch: move patching functions into patch.c · Josh Poimboeuf <hidden> · 2016-04-28
Re: [RFC PATCH v2 15/18] livepatch: move patching functions into patch.c · Petr Mladek <pmladek@suse.com> · 2016-05-03
[RFC PATCH v2 11/18] livepatch/s390: reorganize TIF thread flag bits · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 09/18] livepatch/x86: add TIF_PATCH_PENDING thread flag · Josh Poimboeuf <hidden> · 2016-04-28
Re: [RFC PATCH v2 09/18] livepatch/x86: add TIF_PATCH_PENDING thread flag · Andy Lutomirski <luto@amacapital.net> · 2016-04-29
Re: [RFC PATCH v2 09/18] livepatch/x86: add TIF_PATCH_PENDING thread flag · Josh Poimboeuf <hidden> · 2016-04-29
[RFC PATCH v2 07/18] stacktrace/x86: function for detecting reliable stack traces · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 06/18] x86: dump_trace() error handling · Josh Poimboeuf <hidden> · 2016-04-28
Re: [RFC PATCH v2 06/18] x86: dump_trace() error handling · Minfei Huang <hidden> · 2016-04-29
Re: [RFC PATCH v2 06/18] x86: dump_trace() error handling · Josh Poimboeuf <hidden> · 2016-04-29
[RFC PATCH v2 04/18] x86: move _stext marker before head code · Josh Poimboeuf <hidden> · 2016-04-28
[RFC PATCH v2 03/18] x86/asm/head: standardize the bottom of the stack for idle tasks · Josh Poimboeuf <hidden> · 2016-04-28
Re: [RFC PATCH v2 03/18] x86/asm/head: standardize the bottom of the stack for idle tasks · Brian Gerst <hidden> · 2016-04-29
Re: [RFC PATCH v2 03/18] x86/asm/head: standardize the bottom of the stack for idle tasks · Josh Poimboeuf <hidden> · 2016-04-29
Re: [RFC PATCH v2 03/18] x86/asm/head: standardize the bottom of the stack for idle tasks · Andy Lutomirski <luto@kernel.org> · 2016-04-29
Re: [RFC PATCH v2 03/18] x86/asm/head: standardize the bottom of the stack for idle tasks · Josh Poimboeuf <hidden> · 2016-04-29
Re: [RFC PATCH v2 03/18] x86/asm/head: standardize the bottom of the stack for idle tasks · Andy Lutomirski <luto@amacapital.net> · 2016-04-29
Re: [RFC PATCH v2 03/18] x86/asm/head: standardize the bottom of the stack for idle tasks · Josh Poimboeuf <hidden> · 2016-04-29
Re: [RFC PATCH v2 03/18] x86/asm/head: standardize the bottom of the stack for idle tasks · Andy Lutomirski <luto@amacapital.net> · 2016-04-30
[RFC PATCH v2 01/18] x86/asm/head: clean up initial stack variable · Josh Poimboeuf <hidden> · 2016-04-28

From: Josh Poimboeuf <hidden>
Date: 2016-05-02 19:44:56
Also in: linux-s390, lkml

On Mon, May 02, 2016 at 11:12:39AM -0700, Andy Lutomirski wrote:

On Mon, May 2, 2016 at 10:31 AM, Josh Poimboeuf [off-list ref] wrote:

quoted

On Mon, May 02, 2016 at 08:52:41AM -0700, Andy Lutomirski wrote:

quoted

On Mon, May 2, 2016 at 6:52 AM, Josh Poimboeuf [off-list ref] wrote:

quoted

On Fri, Apr 29, 2016 at 05:08:50PM -0700, Andy Lutomirski wrote:

quoted

On Apr 29, 2016 3:41 PM, "Josh Poimboeuf" [off-list ref] wrote:

quoted

On Fri, Apr 29, 2016 at 02:37:41PM -0700, Andy Lutomirski wrote:

quoted

On Fri, Apr 29, 2016 at 2:25 PM, Josh Poimboeuf [off-list ref] wrote:

quoted

I suppose we could try to rejigger the code so that rbp points to
pt_regs or similar.

I think we should avoid doing something like that because it would break
gdb and all the other unwinders who don't know about it.

How so?

Currently, rbp in the entry code is meaningless.  I'm suggesting that,
when we do, for example, 'call \do_sym' in idtentry, we point rbp to
the pt_regs.  Currently it points to something stale (which the
dump_stack code might be relying on.  Hmm.)  But it's probably also
safe to assume that if you unwind to the 'call \do_sym', then pt_regs
is the next thing on the stack, so just doing the section thing would
work.

Yes, rbp is meaningless on the entry from user space.  But if an
in-kernel interrupt occurs (e.g. page fault, preemption) and you have
nested entry, rbp keeps its old value, right?  So the unwinder can walk
past the nested entry frame and keep going until it gets to the original
entry.

Yes.

It would be nice if we could do better, though, and actually notice
the pt_regs and identify the entry.  For example, I'd love to see
"page fault, RIP=xyz" printed in the middle of a stack dump on a
crash.

Also, I think that just following rbp links will lose the
actual function that took the page fault (or whatever function
pt_regs->ip actually points to).

Hm.  I think we could fix all that in a more standard way.  Whenever a
new pt_regs frame gets saved on entry, we could also create a new stack
frame which points to a fake kernel_entry() function.  That would tell
the unwinder there's a pt_regs frame without otherwise breaking frame
pointers across the frame.

Then I guess we wouldn't need my other solution of putting the idt
entries in a special section.

How does that sound?

Let me try to understand.

The normal call sequence is call; push %rbp; mov %rsp, %rbp.  So rbp
points to (prev rbp, prev rip) on the stack, and you can follow the
chain back.  Right now, on a user access page fault or similar, we
have rbp (probably) pointing to the interrupted frame, and the
interrupted rip isn't saved anywhere that a naive unwinder can find
it.  (It's in pt_regs, but the rbp chain skips right over that.)

We could change the entry code so that an interrupt / idtentry does:

push pt_regs
push kernel_entry
push %rbp
mov %rsp, %rbp
call handler
pop %rbp
addq $8, %rsp

or similar.  That would make it appear that the actual C handler was
caused by a dummy function "kernel_entry".  Now the unwinder would get
to kernel_entry, but it *still* wouldn't find its way to the calling
frame, which only solves part of the problem.  We could at least teach
the unwinder how kernel_entry works and let it decode pt_regs to
continue unwinding.  This would be nice, and I think it could work.

Yeah, that's about what I had in mind.

FWIW, I just tried this:

static bool is_entry_text(unsigned long addr)
{
    return addr >= (unsigned long)__entry_text_start &&
        addr < (unsigned long)__entry_text_end;
}

it works.  So the entry code is already annotated reasonably well :)

I just hacked it up here:

https://git.kernel.org/cgit/linux/kernel/git/luto/linux.git/commit/?h=stack&id=085eacfe0edfc18768e48340084415dba9a6bd21

and it seems to work, at least for page faults.  A better
implementation would print out the entire contents of pt_regs so that
people reading the stack trace will know the registers at the time of
the exception, which might be helpful.

I still think we would need more specific annotations to do that
reliably: a call from entry code doesn't necessarily correlate with a
pt_regs frame.

quoted

I think I like this, except that, if it used a separate section, it
could potentially be faster, as, for each actual entry type, the
offset from the C handler frame to pt_regs is a foregone conclusion.

Hm, this I don't really follow.  It's true that the unwinder can easily
find RIP from pt_regs, which will always be a known offset from the
kernel_entry pointer on the stack.  But why would having the entry code
in a separate section make that faster?

It doesn't make the unwinder faster -- it makes the entry code faster.

Oh, right.  But I don't think a few extra frame pointer instructions are
much of an issue if you already have CONFIG_FRAME_POINTER enabled.

Anyway I'm not sure which way is better.  I'll think about it.

I hope your plans include rewriting the current stack unwinder
completely.  The thing in print_context_stack is (a)
hard-to-understand and hard-to-modify crap and (b) is called in a loop
from another file using totally ridiculous conventions.

I agree, that code is quite confusing.  I haven't really thought about
how specifically it could be improved or replaced though.

Along those lines, I think it would be awesome if we could have an
arch-independent DWARF unwinder so that most of the stack dumping code
could be shared amongst all the arches.

-- 
Josh

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help