Thread (14 messages) 14 messages, 4 authors, 2013-07-31

Re: visible memory seems wrong in kexec crash dump kernel

From: Michael Ellerman <hidden>
Date: 2013-07-14 04:36:00
Also in: kexec

On Sat, Jul 13, 2013 at 12:30:50AM -0600, Chris Friesen wrote:
On 07/12/2013 04:59 PM, Chris Friesen wrote:
quoted
On 07/12/2013 03:08 PM, Chris Friesen wrote:
quoted
I turned on the instrumentation in early_init_dt_scan_memory() and got
the following when jumping to the capture kernel:

memory scan node memory, reg size 16, data: 0 0 2 0,
- 0 , 200000000

That 0x200000000 matches the fact that I'm seeing 8GB of memory
available in the recovery kernel.

If I boot the original kernel with "crashkernel=224M@32M", should I
expect that only 224MB is marked as "linux,usable-memory" in the
recovery kernel?
I started looking at the kexec side of things, and I noticed something a
bit odd. In most places dealing with the device tree in kexec it accepts
either "memory" or "memory@" for the memory node name. In
add_usable_mem_property() in arch/ppc64/fs2dt.c it seems to only accept
"memory@".

Is this expected behaviour? It seems to be the same in current git
versions of kexec-tools.

On my system I see "/proc/device-tree/memory".

If I modify add_usable_mem_property() to also accept "/memory" then my
recovery kernel boots up with

physicalMemorySize = 0x10000000

which is 256MB (which is still a bit odd since I specified 224MB for the
crashkernel).

However, it then hits the BUG() call at the end of mark_bootmem() in
mm/bootmem.c.
One final thing and I'll stop replying to myself. :)

It looks like the problem is that some board-specific freescale code
was calling lmb_reserve() with a base address in the 4GB range.  It
seems odd that lmb_reserve() didn't throw some kind of error when
the recovery kernel was supposed to be limited to 224MB.

Rather than try and fix the bug, I turned off the (unneeded) config
options related to the above lmb_reserve() calls and was able to
successfully access the information I needed via /dev/oldmem.

The upshot is that there seems to be a number of things that could
be improved:

1) kexec should accept "/memory" and not just "/memory@"
Yeah probably, I think folks tend to use "/memory@0" even if they only
have a single memory node. But that does sound like a bug in kexec.
2) lmb_reserve() should really respect the crashkernel memory limit
It's been replaced in mainline, so you'd have to check what the current
code does in that situation.
3) the freescale stuff really shouldn't assume it can map things
wherever it feels like
Agreed :)

cheers
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help