Hi nidud,
here's the output of auto.zip:
Intel(R) Core(TM) i7-3770 CPU @ 3.40GHz (AVX)
----------------------------------------------
-- aligned strings --
491469 cycles - 10 ( 0) 0: crt_memcpy
889651 cycles - 10 ( 63) 1: movsd - mov eax,ecx
887273 cycles - 10 ( 63) 2: movsd - push ecx
355080 cycles - 10 ( 51) 3: movsb
487046 cycles - 10 (182) 4: SSE
355990 cycles - 10 ( 0) 5: auto
-- unaligned strings --
490269 cycles - 10 ( 0) 0: crt_memcpy
886259 cycles - 10 ( 63) 1: movsd - mov eax,ecx
886778 cycles - 10 ( 63) 2: movsd - push ecx
372520 cycles - 10 ( 51) 3: movsb
491780 cycles - 10 (182) 4: SSE
378881 cycles - 10 ( 0) 5: auto
-- short strings 15 --
174897 cycles - 8000 ( 0) 0: crt_memcpy
349626 cycles - 8000 ( 63) 1: movsd - mov eax,ecx
343812 cycles - 8000 ( 63) 2: movsd - push ecx
307384 cycles - 8000 ( 51) 3: movsb
98073 cycles - 8000 (182) 4: SSE
293479 cycles - 8000 ( 0) 5: auto
-- short strings 271 --
832627 cycles - 8000 ( 0) 0: crt_memcpy
773797 cycles - 8000 ( 63) 1: movsd - mov eax,ecx
764418 cycles - 8000 ( 63) 2: movsd - push ecx
586580 cycles - 8000 ( 51) 3: movsb
279676 cycles - 8000 (182) 4: SSE
557134 cycles - 8000 ( 0) 5: auto
-- short strings 2014 --
998188 cycles - 4000 ( 0) 0: crt_memcpy
2198740 cycles - 4000 ( 63) 1: movsd - mov eax,ecx
2195833 cycles - 4000 ( 63) 2: movsd - push ecx
935710 cycles - 4000 ( 51) 3: movsb
961563 cycles - 4000 (182) 4: SSE
906474 cycles - 4000 ( 0) 5: auto
--- ok ---
Gunther