Thread (20 messages) 20 messages, 6 authors, 2009-11-24

Re: 2.6.31+2.6.31.4: XFS - All I/O locks up to D-state after 24-48 hours (sysrq-t+w available)

From: Dave Chinner <david@fromorbit.com>
Date: 2009-10-22 23:00:20
Also in: linux-xfs, lkml

On Thu, Oct 22, 2009 at 06:49:46PM -0400, Justin Piszcz wrote:
On Wed, 21 Oct 2009, Justin Piszcz wrote:
quoted
On Tue, 20 Oct 2009, Justin Piszcz wrote:
quoted
quoted
It appears that both the xfslogd and the xfsdatad on CPU 0 are in
the running state but don't appear to be consuming any significant
CPU time. If they remain like this then I think that means they are
stuck waiting on the run queue.  Do these XFS threads always appear
like this when the hang occurs? If so, is there something else that
is hogging CPU 0 preventing these threads from getting the CPU?
Yes, the XFS threads show up like this on each time the kernel 
crashed.  So far
with 2.6.30.9 after ~48hrs+ it has not crashed.  So it appears to be 
some issue
between 2.6.30.9 and 2.6.31.x when this began happening.  Any  
recommendations
on how to catch this bug w/certain options enabled/etc?
Uptime with 2.6.30.9:

06:18:41 up 2 days, 14:10, 14 users,  load average: 0.41, 0.21, 0.07

No issues yet, so it first started happening in 2.6.(31).(x).
Ok.
quoted
Any further recommendations on how to debug this issue?  BTW: Do
you view this as an XFS bug or MD/VFS layer issue based on the
logs/output thus far?
Could be either. Nothing so far points at a cause.
Any other ideas?
If it is relatively quick to reproduce, you could run a git bisect
to try to find the offending commit. Or when it has locked up, run
oprofile with callgraph sampling and so we can get an idea of what
is actually running when XFS appears to hang.
Currently stuck on 2.6.30.9.. (no issues, no lockups)-- Box normally has  
no load at all either.. Has anyone else reported similar problems?
Not that I know of.

Cheers,

Dave.
-- 
Dave Chinner
david@fromorbit.com
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help