Re: ppoll() stuck on POLLIN while TCP peer is sending
From: Eric Dumazet <hidden>
Date: 2013-01-09 02:32:33
Also in:
linux-mm, lkml
On Tue, 2013-01-08 at 18:14 -0800, Eric Dumazet wrote:
On Tue, 2013-01-08 at 23:23 +0000, Eric Wong wrote:quoted
Mel Gorman [off-list ref] wrote:quoted
Please try the following patch. However, even if it works the benefit of capture may be so marginal that partially reverting it and simplifying compaction.c is the better decision.I already got my VM stuck on this one. I had two twosleepy instances, 2774 was the one that got stuck (also confirmed by watching top). Btw, have you been able to reproduce this on your end? I think the easiest reproduction on my 2-core VM is by running 2 twosleepy processes and doing the following to dirty a lot of pages:Given the persistent sk_stream_wait_memory() traces I suspect a plain TCP bug, triggered by some extra wait somewhere. Please mm guys don't spend too much time right now, I'll try to reproduce the problem. Don't be confused by sk_stream_wait_memory() name. A thread is stuck here because TCP stack is failing to wake it.
Hmm, it seems sk_filter() can return -ENOMEM because skb has the
pfmemalloc() set.
It seems nobody really tested this stuff under memory stress.
Mel, it looks like you are the guy who could fix this, after all ;)
One TCP socket keeps retransmitting an SKB via loopback, and TCP stack
drops the packet again and again.
commit c93bdd0e03e848555d144eb44a1f275b871a8dd5
Author: Mel Gorman [off-list ref]
Date: Tue Jul 31 16:44:19 2012 -0700
netvm: allow skb allocation to use PFMEMALLOC reserves
Change the skb allocation API to indicate RX usage and use this to fall
back to the PFMEMALLOC reserve when needed. SKBs allocated from the
reserve are tagged in skb->pfmemalloc. If an SKB is allocated from the
reserve and the socket is later found to be unrelated to page reclaim, the
packet is dropped so that the memory remains available for page reclaim.
Network protocols are expected to recover from this packet loss.
[a.p.zijlstra@chello.nl: Ideas taken from various patches]
[davem@davemloft.net: Use static branches, coding style corrections]
[sebastian@breakpoint.cc: Avoid unnecessary cast, fix !CONFIG_NET build]
Signed-off-by: Mel Gorman [off-list ref]
Acked-by: David S. Miller [off-list ref]
Cc: Neil Brown [off-list ref]
Cc: Peter Zijlstra [off-list ref]
Cc: Mike Christie [off-list ref]
Cc: Eric B Munson [off-list ref]
Cc: Eric Dumazet [off-list ref]
Cc: Sebastian Andrzej Siewior [off-list ref]
Cc: Mel Gorman [off-list ref]
Cc: Christoph Lameter [off-list ref]
Signed-off-by: Andrew Morton [off-list ref]
Signed-off-by: Linus Torvalds [off-list ref]
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>