RE: How to debug a hung multi-core system....
From: Morrison, Tom <hidden>
Date: 2009-05-21 15:22:17
Just had a little conference with several co-workers...to go over
results
We think that LT0 (the one that maps the kernel) has been corrupted:
Entry EPN RPN TID TMASK WIMGE TSIZ U0:3 X0:1
---------------------------------------------------------------
LT0 C0000000 00000000 00 0FF 04 9 0 0
PID TS PROT SHEN UR UW UX SR SW SX TIDZ VAL
---------------------------------------------------------------
0 0 P P E E D E E D D V
Is absolutely wrong - this is TLB for the kernel - and as you can see=20
...it does NOT have execution privileges (and in fact the user space=20
HAS executive privileges for this area (complete opposite of what it=20
should be)...
This is why it is stuck AT that instruction (can't even single step
from that location)..
(one of) The first problem(s) is how can/when did this TLB get
corrupted!
Tom