Re: [PATCH] dst_entry structure use,lastuse and refcnt abstraction
From: "David S. Miller" <davem@davemloft.net>
Date: 2005-06-24 06:17:45
Also in:
lkml
From: "David S. Miller" <davem@davemloft.net>
Date: 2005-06-24 06:17:45
Also in:
lkml
From: Christoph Lameter <redacted> Date: Thu, 23 Jun 2005 23:03:45 -0700 (PDT)
Ok. Then we are done. With 58 Itanium processors and 200G Ram I get more than 10% improvement ;-). With 500 tasks we have 453 vs. 499 j/m/t. That is 9.21%. For 300 tasks we have 9.4% etc. I am sure that I can push this some more with bigger counts of processors and also some other NUMA related performance issues.
So it took 7 times more processors to increase the performance gain by just over 3 on a microscopic synthetic benchmark. That's not impressive at all. And you still haven't shown what happens for the workloads I suggested. A web benchmark, with say a thousand unique clients, would be sufficient for one of those btw. That case has very low dst locality, yet dsts are useful because you'll have about 2 or 3 concurrent connections per dst.