Thread (47 messages) 47 messages, 9 authors, 2013-11-04

Re: [PATCH] x86: Run checksumming in parallel accross multiple alu's

From: Neil Horman <nhorman@tuxdriver.com>
Date: 2013-11-01 19:59:08
Also in: lkml

On Fri, Nov 01, 2013 at 12:45:29PM -0700, Joe Perches wrote:
On Fri, 2013-11-01 at 13:37 -0400, Neil Horman wrote:
quoted
I think it would be better if we just did the prefetch here
and re-addressed this area when AVX (or addcx/addox) instructions were available
for testing on hardware.
Could there be a difference if only a single software
prefetch was done at the beginning of transfer before
the while loop and hardware prefetches did the rest?
I wouldn't think so.  If hardware was going to do any prefetching based on
memory access patterns it will do so regardless of the leading prefetch, and
that first prefetch isn't helpful because we still wind up stalling on the adds
while its completing
Neil

--
To unsubscribe from this list: send the line "unsubscribe netdev" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help