Thread (5 messages) 5 messages, 2 authors, 2012-04-26
DORMANTno replies
Revisions (3)
  1. v1 [diff vs current]
  2. v1 [diff vs current]
  3. v1 current

[PATCH 0/4] memcpy optimized with strd/ldrd

From: Boojin Kim <hidden>
Date: 2012-04-26 07:35:40

Possibly related (same subject, not in this thread)

Nicolas Pitre wrote:
Sent: Tuesday, April 03, 2012 11:49 PM
To: Boojin Kim
Cc: linux-arm-kernel at lists.infradead.org
Subject: RE: [PATCH 0/4] memcpy optimized with strd/ldrd

On Tue, 3 Apr 2012, Boojin Kim wrote:
quoted
Nicolas Pitre wrote:
quoted
quoted
quoted
Here's my version.  Lightly tested.
I have no A15 hardware to run any performance comparison though.
I'm reviewing and testing your patch. But, My other work disturbs to reviewing it.
I will give you feedback soon within this week.
Wait a little more.
And, Thanks for your patches. :)
FYI, it occurred to me that some corner cases might not be quite right
with regards to alignment for the STRD instruction.  It seems that the
hardware on which I tested it (Marvell Dove CPU) apparently copes with
misaligned SDRD's when they're still 32-bit aligned.  So I need to run
this code through a real validation harness on different hardware.
It's sad, but the performance result wasn't better after adapting your patch.
I think something on 1~3 patch brings performance degreasing.
If you could identify which patch is responsible that would be helpful.
Sorry for late response. I'm so busy these days. Y_Y
I checked your patches. And, the 1st patch makes performance drop.
Transmit time for 4KB memcpy is 489ns. After applying 1st patch, the transmit time is 578ns.
Performance also drops on memcpy of other small size about 10%.
I wish this is helpful for you.
Thanks,
Thanks.


Nicolas

_______________________________________________
linux-arm-kernel mailing list
linux-arm-kernel at lists.infradead.org
http://lists.infradead.org/mailman/listinfo/linux-arm-kernel
Keyboard shortcuts
hback out one level
jnext message in thread
kprevious message in thread
ldrill in
Escclose help / fold thread tree
?toggle this help