Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues

[net-next PATCH V2 0/9] net: fragmentation performance scalability on NUMA/SMP systems · Jesper Dangaard Brouer <hidden> · 2012-11-29
[net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-11-29
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · David Miller <davem@davemloft.net> · 2012-11-29
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-11-29
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Stephen Hemminger <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-12-01
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Stephen Hemminger <hidden> · 2012-12-01
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-11-30
[net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-12-04
RE: [net-next PATCH V3-evictor] net: frag evictor,avoid killing warm frag queues · David Laight <hidden> · 2012-12-04
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-12-04
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-12-04
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-12-05
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-12-06
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Florian Westphal <fw@strlen.de> · 2012-12-06
RE: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · David Laight <hidden> · 2012-12-06
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · David Miller <davem@davemloft.net> · 2012-12-06
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-12-06
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-12-06
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-12-06
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-11-30
[net-next PATCH V2 2/9] net: frag cache line adjust inet_frag_queue.net · Jesper Dangaard Brouer <hidden> · 2012-11-29
[net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · Jesper Dangaard Brouer <hidden> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · David Miller <davem@davemloft.net> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · David Miller <davem@davemloft.net> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · David Miller <davem@davemloft.net> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · David Miller <davem@davemloft.net> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · Jesper Dangaard Brouer <hidden> · 2012-11-29
[net-next PATCH V2 4/9] net: frag helper functions for mem limit tracking · Jesper Dangaard Brouer <hidden> · 2012-11-29
[net-next PATCH V2 5/9] net: frag, per CPU resource, mem limit and LRU list accounting · Jesper Dangaard Brouer <hidden> · 2012-11-29
Re: [net-next PATCH V2 5/9] net: frag, per CPU resource, mem limit and LRU list accounting · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 5/9] net: frag, per CPU resource, mem limit and LRU list accounting · David Miller <davem@davemloft.net> · 2012-11-29
Re: [net-next PATCH V2 5/9] net: frag, per CPU resource, mem limit and LRU list accounting · Jesper Dangaard Brouer <hidden> · 2012-12-03
Re: [net-next PATCH V2 5/9] net: frag, per CPU resource, mem limit and LRU list accounting · David Miller <davem@davemloft.net> · 2012-12-03
[net-next PATCH V2 7/9] net: frag, move nqueues counter under LRU lock protection · Jesper Dangaard Brouer <hidden> · 2012-11-29
[net-next PATCH V2 6/9] net: frag, implement dynamic percpu alloc of frag_cpu_limit · Jesper Dangaard Brouer <hidden> · 2012-11-29
[net-next PATCH V2 8/9] net: frag queue locking per hash bucket · Jesper Dangaard Brouer <hidden> · 2012-11-29
Re: [net-next PATCH V2 8/9] net: frag queue locking per hash bucket · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 8/9] net: frag queue locking per hash bucket · Jesper Dangaard Brouer <hidden> · 2012-11-30
[net-next PATCH V2 9/9] net: increase frag queue hash size and cache-line · Jesper Dangaard Brouer <hidden> · 2012-11-29
RE: [net-next PATCH V2 9/9] net: increase frag queue hash size andcache-line · David Laight <hidden> · 2012-11-29
Re: [net-next PATCH V2 9/9] net: increase frag queue hash size and cache-line · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 9/9] net: increase frag queue hash size and cache-line · Jesper Dangaard Brouer <hidden> · 2012-11-29

From: Jesper Dangaard Brouer <hidden>
Date: 2012-12-06 12:26:46

On Wed, 2012-12-05 at 10:24 +0100, Jesper Dangaard Brouer wrote:

The previous evictor patch of letting new fragments enter, worked
amazingly well.  But I suspect, this might also be related to a
bug/problem in the evictor loop (which were being hidden by that
patch).

The evictor loop does not contain a bug, just a SMP scalability issue
(which is fixed by later patches).  The first evictor patch, which
does not let new fragments enter, only worked amazingly well because
its hiding this (and other) scalability issues, and implicit allowing
frags already "in" to exceed the mem usage for 1 jiffie.  Thus,
invalidating the patch, as the improvement were only a side effect.

My new *theory* is that the evictor loop, will be looping too much, if
it finds a fragment which is INET_FRAG_COMPLETE ... in that case, we
don't advance the LRU list, and thus will pickup the exact same
inet_frag_queue again in the loop... to get out of the loop we need
another CPU or packet to change the LRU list for us... I'll test that
theory... (its could also be CPUs fighting over the same LRU head
element that cause this) ... more to come...

The above theory does happen, but does not cause excessive looping.
The CPUs are just fighting about who gets to free the inet_frag_queue
and who gets to unlink it from its data structures (I guess, resulting
cache bouncing between CPUs).

CPUs are fighting for the same LRU head (inet_frag_queue) element,
which is bad for scalability.  We could fix this by unlinking the
element once a CPU graps it, but it would require us to change a
read_lock to a write_lock, thus we might not gain much performance.

I already (implicit) fix this is a later patch, where I'm moving the
LRU lists to be per CPU.  So, I don't know if it's worth fixing.


(And yes, I'm using thresh 4Mb/3Mb as my default setting now, but I'm
also experimenting with other thresh sizes)

p.s. Thank you Eric for being so persistent, so I realized this patch
were not good.  We can hopefully now, move on to the other patches,
which fixes the real scalability issues.

--Jesper

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help