Re: [PATCH] dst_entry structure use,lastuse and refcnt abstraction

(off-list ancestor, not in this archive)
Re: [PATCH] dst_entry structure use,lastuse and refcnt abstraction · "David S. Miller" <davem@davemloft.net> · 2005-06-24
Re: [PATCH] dst_entry structure use,lastuse and refcnt abstraction · Christoph Lameter <hidden> · 2005-06-24
Re: [PATCH] dst_entry structure use,lastuse and refcnt abstraction · "David S. Miller" <davem@davemloft.net> · 2005-06-24
Re: [PATCH] dst_entry structure use,lastuse and refcnt abstraction · Christoph Lameter <hidden> · 2005-06-24
Re: [PATCH] dst_entry structure use,lastuse and refcnt abstraction · "David S. Miller" <davem@davemloft.net> · 2005-06-24
Re: [PATCH] dst_entry structure use,lastuse and refcnt abstraction · Christoph Lameter <hidden> · 2005-06-24
Re: [PATCH] dst_entry structure use,lastuse and refcnt abstraction · "David S. Miller" <davem@davemloft.net> · 2005-06-24
Re: [PATCH] dst_entry structure use,lastuse and refcnt abstraction · Christoph Lameter <hidden> · 2005-06-24
Re: [PATCH] dst_entry structure use,lastuse and refcnt abstraction · "David S. Miller" <davem@davemloft.net> · 2005-06-24
[PATCH] bugfix and scalability changes in net/ipv4/route.c · Eric Dumazet <hidden> · 2005-06-24
Re: [PATCH] bugfix and scalability changes in net/ipv4/route.c · "David S. Miller" <davem@davemloft.net> · 2005-06-28

From: "David S. Miller" <davem@davemloft.net>
Date: 2005-06-24 06:17:45
Also in: lkml

From: Christoph Lameter <redacted>
Date: Thu, 23 Jun 2005 23:03:45 -0700 (PDT)

Ok. Then we are done. With 58 Itanium processors and 200G Ram I get 
more than 10% improvement ;-). With 500 tasks we have 453 vs. 499 j/m/t.
That is 9.21%. For 300 tasks we have 9.4% etc. I am sure that I can push 
this some more with bigger counts of processors and also some other NUMA 
related performance issues.

So it took 7 times more processors to increase the performance gain by
just over 3 on a microscopic synthetic benchmark.  That's not
impressive at all.

And you still haven't shown what happens for the workloads I
suggested.  A web benchmark, with say a thousand unique clients, would
be sufficient for one of those btw.  That case has very low dst
locality, yet dsts are useful because you'll have about 2 or 3
concurrent connections per dst.

`h`	back out one level
`j`	next message in thread
`k`	previous message in thread
`l`	drill in
`Esc`	close help / fold thread tree
`?`	toggle this help