Re: [net-next PATCH V2 5/9] net: frag, per CPU resource, mem limit and LRU... | netdev

[net-next PATCH V2 0/9] net: fragmentation performance scalability on NUMA/SMP systems · Jesper Dangaard Brouer <hidden> · 2012-11-29
[net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-11-29
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · David Miller <davem@davemloft.net> · 2012-11-29
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-11-29
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Stephen Hemminger <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-12-01
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Stephen Hemminger <hidden> · 2012-12-01
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-11-30
[net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-12-04
RE: [net-next PATCH V3-evictor] net: frag evictor,avoid killing warm frag queues · David Laight <hidden> · 2012-12-04
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-12-04
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-12-04
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-12-05
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-12-06
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Florian Westphal <fw@strlen.de> · 2012-12-06
RE: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · David Laight <hidden> · 2012-12-06
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · David Miller <davem@davemloft.net> · 2012-12-06
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-12-06
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-12-06
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-12-06
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-11-30
[net-next PATCH V2 2/9] net: frag cache line adjust inet_frag_queue.net · Jesper Dangaard Brouer <hidden> · 2012-11-29
[net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · Jesper Dangaard Brouer <hidden> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · David Miller <davem@davemloft.net> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · David Miller <davem@davemloft.net> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · David Miller <davem@davemloft.net> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · David Miller <davem@davemloft.net> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · Jesper Dangaard Brouer <hidden> · 2012-11-29
[net-next PATCH V2 4/9] net: frag helper functions for mem limit tracking · Jesper Dangaard Brouer <hidden> · 2012-11-29
[net-next PATCH V2 5/9] net: frag, per CPU resource, mem limit and LRU list accounting · Jesper Dangaard Brouer <hidden> · 2012-11-29
Re: [net-next PATCH V2 5/9] net: frag, per CPU resource, mem limit and LRU list accounting · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 5/9] net: frag, per CPU resource, mem limit and LRU list accounting · David Miller <davem@davemloft.net> · 2012-11-29
Re: [net-next PATCH V2 5/9] net: frag, per CPU resource, mem limit and LRU list accounting · Jesper Dangaard Brouer <hidden> · 2012-12-03
Re: [net-next PATCH V2 5/9] net: frag, per CPU resource, mem limit and LRU list accounting · David Miller <davem@davemloft.net> · 2012-12-03
[net-next PATCH V2 7/9] net: frag, move nqueues counter under LRU lock protection · Jesper Dangaard Brouer <hidden> · 2012-11-29
[net-next PATCH V2 6/9] net: frag, implement dynamic percpu alloc of frag_cpu_limit · Jesper Dangaard Brouer <hidden> · 2012-11-29
[net-next PATCH V2 8/9] net: frag queue locking per hash bucket · Jesper Dangaard Brouer <hidden> · 2012-11-29
Re: [net-next PATCH V2 8/9] net: frag queue locking per hash bucket · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 8/9] net: frag queue locking per hash bucket · Jesper Dangaard Brouer <hidden> · 2012-11-30
[net-next PATCH V2 9/9] net: increase frag queue hash size and cache-line · Jesper Dangaard Brouer <hidden> · 2012-11-29
RE: [net-next PATCH V2 9/9] net: increase frag queue hash size andcache-line · David Laight <hidden> · 2012-11-29
Re: [net-next PATCH V2 9/9] net: increase frag queue hash size and cache-line · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 9/9] net: increase frag queue hash size and cache-line · Jesper Dangaard Brouer <hidden> · 2012-11-29

Re: [net-next PATCH V2 5/9] net: frag, per CPU resource, mem limit and LRU list accounting

From: David Miller <davem@davemloft.net>
Date: 2012-12-03 17:25:07

From: Jesper Dangaard Brouer <redacted>
Date: Mon, 03 Dec 2012 15:02:41 +0100

On Thu, 2012-11-29 at 09:06 -0800, Eric Dumazet wrote:

quoted

On Thu, 2012-11-29 at 17:13 +0100, Jesper Dangaard Brouer wrote:

quoted

The major performance bottleneck on NUMA systems, is the mem limit
counter which is based an atomic counter.  This patch removes the
cache-bouncing of the atomic counter, by moving this accounting to be
bound to each CPU.  The LRU list also need to be done per CPU,
in-order to keep the accounting straight.

If fragments belonging together is "sprayed" across CPUs, performance
will still suffer, but due to NIC rxhashing this is not very common.
Correct accounting in this situation is maintained by recording and
"assigning" a CPU to a frag queue when its allocated (caused by the
first packet associated packet).

[...]

quoted

+/* Need to maintain these resource limits per CPU, else we will kill
+ * performance due to cache-line bouncing
+ */
+struct frag_cpu_limit {
+	atomic_t                mem;
+	struct list_head        lru_list;
+	spinlock_t              lru_lock;
+} ____cacheline_aligned_in_smp;
+

This looks like a big patch introducing a specific infrastructure, while
we already have lib/percpu_counter.c

For the record, I cannot use the lib/percpu_counter, because this
accounting is not kept strictly per CPU, if the fragments are "sprayed"
across CPUs (as described in the commit message above).

The percpu infrastructure allows precise counts and comparisons even
in that case.  It uses the cheap test when possible, and defers to a
more expensive test when necessary.

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help