Jochen
Intel recommend add and sub, instead of inc and dec
Got SSE ideas now
Mov ecx,-count
Lea ebx, adress+count
Movaps xmm0, values
L1:
Movaps [ebx+ecx], xmm0
Add ecx, 16
Jne L1
-------------------or
Mov ecx, count
Lea ebx, adress
Movaps xmm0, values
L2: movaps [ebx+ecx], xmm0
Sub ecx, 16
Jne L2
Instead of the usual add ecx, 1 and sub ecx, 1