Thread (10 messages) 10 messages, 3 authors, 2012-08-30

Re: BUG: soft lockup - CPU#6 stuck for 22s! [httpd2-event:15597]

From: Neal Cardwell <ncardwell@google.com>
Date: 2012-08-30 01:37:46

On Mon, Aug 27, 2012 at 12:06 AM, Eric Dumazet [off-list ref] wrote:
Indeed, so the first patch was better...

Not sure I can investigate this problem this week, as I attend LKS/LPC
in San Diego.

Could be that snd_cwnd is zero as well so we have this infinite loop...

        while (tp->snd_cwnd_cnt >= tp->snd_cwnd) {
                tp->snd_cwnd_cnt -= tp->snd_cwnd;
                delta++;
        }
Yes, it seems that one way or another cwnd is getting to zero and
turning this into an infinite loop.

Two questions for you, Cristian, if you have a second:

1) Are you still seeing this problem in your workload? If so, would
you have time to try another small patch to add instrumentation to
track down the cause?

2) Do you happen to run with the tcp_mtu_probing sysctl enabled? I
have been looking for code paths that could cause the cwnd to go to
zero, and that's the first possibility that I've been able to see (in
tcp_mtup_probe_success() it seems like there are corner cases where
snd_cwnd could become zero).

neal
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help