Re: [net-next PATCH V2 9/9] net: increase frag queue hash size and cache-line

[net-next PATCH V2 0/9] net: fragmentation performance scalability on NUMA/SMP systems · Jesper Dangaard Brouer <hidden> · 2012-11-29
[net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-11-29
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · David Miller <davem@davemloft.net> · 2012-11-29
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-11-29
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Stephen Hemminger <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-12-01
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Stephen Hemminger <hidden> · 2012-12-01
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-11-30
[net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-12-04
RE: [net-next PATCH V3-evictor] net: frag evictor,avoid killing warm frag queues · David Laight <hidden> · 2012-12-04
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-12-04
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-12-04
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-12-05
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-12-06
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Florian Westphal <fw@strlen.de> · 2012-12-06
RE: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · David Laight <hidden> · 2012-12-06
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · David Miller <davem@davemloft.net> · 2012-12-06
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-12-06
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-12-06
Re: [net-next PATCH V3-evictor] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-12-06
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Jesper Dangaard Brouer <hidden> · 2012-11-30
Re: [net-next PATCH V2 1/9] net: frag evictor, avoid killing warm frag queues · Eric Dumazet <hidden> · 2012-11-30
[net-next PATCH V2 2/9] net: frag cache line adjust inet_frag_queue.net · Jesper Dangaard Brouer <hidden> · 2012-11-29
[net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · Jesper Dangaard Brouer <hidden> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · David Miller <davem@davemloft.net> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · David Miller <davem@davemloft.net> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · David Miller <davem@davemloft.net> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · David Miller <davem@davemloft.net> · 2012-11-29
Re: [net-next PATCH V2 3/9] net: frag, move LRU list maintenance outside of rwlock · Jesper Dangaard Brouer <hidden> · 2012-11-29
[net-next PATCH V2 4/9] net: frag helper functions for mem limit tracking · Jesper Dangaard Brouer <hidden> · 2012-11-29
[net-next PATCH V2 5/9] net: frag, per CPU resource, mem limit and LRU list accounting · Jesper Dangaard Brouer <hidden> · 2012-11-29
Re: [net-next PATCH V2 5/9] net: frag, per CPU resource, mem limit and LRU list accounting · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 5/9] net: frag, per CPU resource, mem limit and LRU list accounting · David Miller <davem@davemloft.net> · 2012-11-29
Re: [net-next PATCH V2 5/9] net: frag, per CPU resource, mem limit and LRU list accounting · Jesper Dangaard Brouer <hidden> · 2012-12-03
Re: [net-next PATCH V2 5/9] net: frag, per CPU resource, mem limit and LRU list accounting · David Miller <davem@davemloft.net> · 2012-12-03
[net-next PATCH V2 7/9] net: frag, move nqueues counter under LRU lock protection · Jesper Dangaard Brouer <hidden> · 2012-11-29
[net-next PATCH V2 6/9] net: frag, implement dynamic percpu alloc of frag_cpu_limit · Jesper Dangaard Brouer <hidden> · 2012-11-29
[net-next PATCH V2 8/9] net: frag queue locking per hash bucket · Jesper Dangaard Brouer <hidden> · 2012-11-29
Re: [net-next PATCH V2 8/9] net: frag queue locking per hash bucket · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 8/9] net: frag queue locking per hash bucket · Jesper Dangaard Brouer <hidden> · 2012-11-30
[net-next PATCH V2 9/9] net: increase frag queue hash size and cache-line · Jesper Dangaard Brouer <hidden> · 2012-11-29
RE: [net-next PATCH V2 9/9] net: increase frag queue hash size andcache-line · David Laight <hidden> · 2012-11-29
Re: [net-next PATCH V2 9/9] net: increase frag queue hash size and cache-line · Eric Dumazet <hidden> · 2012-11-29
Re: [net-next PATCH V2 9/9] net: increase frag queue hash size and cache-line · Jesper Dangaard Brouer <hidden> · 2012-11-29

From: Jesper Dangaard Brouer <hidden>
Date: 2012-11-29 20:55:30

On Thu, 2012-11-29 at 08:55 -0800, Eric Dumazet wrote:

On Thu, 2012-11-29 at 17:16 +0100, Jesper Dangaard Brouer wrote:

quoted

Increase frag queue hash size and assure cache-line alignment to
avoid false sharing.  Hash size is set to 256, because I have
observed 206 frag queues in use at 4x10G with packet size 4416 bytes
(three fragments).

[...]

quoted

 struct inet_frag_bucket {
 	struct hlist_head	chain;
 	spinlock_t		chain_lock;
-};
+} ____cacheline_aligned_in_smp;

This is a waste of memory.

Do keep in mind this is only 16 Kbytes (256 * 64 bytes = 16384 bytes).

Most linux powered devices dont care at all about fragments.

Just increase hashsz if you really want, and rely on hash dispersion
to avoid false sharing.

I must agree, that it is perhaps better usage of the memory to just
increase the hashsz (and drop ____cacheline_aligned_in_smp), especially
with the measured performance gain.

You gave no performance results for this patch anyway.

Yes, I did! -- See cover-mail patch 08 vs 09.
But the gain is really too small, to argue for this cache alignment.


Patch-08:
  2x10G size(4416)  result:(5024+4925)= 9949 Mbit/s
                 V2 result:(5140+5206)=10346 Mbit/s

  4x10G size(4416)  result:(4156+4714+4300+3985)=17155 Mbit/s
                 V2 result:(4341+4607+3963+4450)=17361 Mbit/s
                       (gen:6614+5330+7745+5366 =25055 Mbit/s)

Patch-09:
  2x10G size(4416)  result:(5421+5268)=10689 Mbit/s
                 V2 result:(5377+5336)=10713 Mbit/s

  4x10G size(4416) result:(4890+4364+4139+4530)=17923 Mbit/s
                V2 result:(3860+4533+4936+4519)=17848 Mbit/s
                      (gen:5170+6873+5215+7632 =24890 Mbit/s)
  
Improvements Patch 08 -> 09:

 2x10G size(4416):
   RunV1 (10689-9949) =740 Mbit/s
   RunV2 (10713-10346)=367 Mbit/s

 4x10G size(4416):
   RunV1 (17923-17155)=768 Mbit/s
   RunV2 (17848-17361)=487 Mbit/s

Its consistently better performance, but given magnitude the other
improvements, I don't want to argue over "wasting" 16Kbytes kernel
memory.

I have some debug patches for dumping the content of the hash, which
shows that at 4x10G size(4416) three frags, 206 frag queues, cross CPU
collisions occur anyhow.

Lets focus on the other patches instead.

--Jesper

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help