Thread (13 messages) 13 messages, 3 authors, 2008-12-05

Re: raid6's using not the best bandwidth method && raid6 algo is significantly slower in x86_64.

From: "H. Peter Anvin" <hpa@zytor.com>
Date: 2008-11-21 20:15:10

Igor Podlesny wrote:
2008/11/22 H. Peter Anvin [off-list ref]:
[...]
quoted
It sort of makes sense that -Os would break this stuff.  For newer gccs,
it would be better to use SSE2 intrinsics rather than inline assembly.
The problem is that it breaks older gcc.
	Well, #ifdef could be helpful then, couldn't it? :-)
Yes, and that's probably the way to go.

I just tested a version using gcc intrinsics with gcc 4.3, and it is
almost 20% faster than the inline assembly version.  That, plus the fact
that the code is actually readable, makes me really want to figure out
how best to deploy this.

	-hpa
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help