Print Page - Zero a stack buffer (and probe it)

Title: Zero a stack buffer (and probe it)
Post by: jj2007 on October 25, 2013, 07:31:54 PM

Spin-off from MemStrategy (http://masm32.com/board/index.php?topic=2515.msg26340#msg26340):

AMD Athlon(tm) Dual Core Processor 4450B (SSE3)
loop overhead is approx. 268/100 cycles

3778 kCycles for 100 * rep stosd
4905 kCycles for 100 * push 0
4890 kCycles for 100 * push edx
3343 kCycles for 100 * movups xmm0
3319 kCycles for 100 * movaps xmm0

3785 kCycles for 100 * rep stosd
4891 kCycles for 100 * push 0
4894 kCycles for 100 * push edx
3457 kCycles for 100 * movups xmm0
3319 kCycles for 100 * movaps xmm0

3785 kCycles for 100 * rep stosd
4891 kCycles for 100 * push 0
4896 kCycles for 100 * push edx
3342 kCycles for 100 * movups xmm0
3320 kCycles for 100 * movaps xmm0

18 bytes for rep stosd
17 bytes for push 0
16 bytes for push edx
22 bytes for movups xmm0
25 bytes for movaps xmm0

Title: Re: Zero a stack buffer (and probe it)
Post by: Siekmanski on October 25, 2013, 07:58:50 PM

Intel(R) Core(TM) i7-4930K CPU @ 3.40GHz (SSE4)
loop overhead is approx. 579/100 cycles

2700 kCycles for 100 * rep stosd
5467 kCycles for 100 * push 0
4888 kCycles for 100 * push edx
4266 kCycles for 100 * movups xmm0
1411 kCycles for 100 * movaps xmm0

2752 kCycles for 100 * rep stosd
4887 kCycles for 100 * push 0
5651 kCycles for 100 * push edx
4262 kCycles for 100 * movups xmm0
1030 kCycles for 100 * movaps xmm0

2699 kCycles for 100 * rep stosd
4892 kCycles for 100 * push 0
4888 kCycles for 100 * push edx
4263 kCycles for 100 * movups xmm0
1744 kCycles for 100 * movaps xmm0

18 bytes for rep stosd
17 bytes for push 0
16 bytes for push edx
22 bytes for movups xmm0
25 bytes for movaps xmm0

Title: Re: Zero a stack buffer (and probe it)
Post by: sinsi on October 25, 2013, 08:40:15 PM

Intel(R) Core(TM) i7-3770K CPU @ 3.50GHz (SSE4)
loop overhead is approx. 310/100 cycles

2385 kCycles for 100 * rep stosd
4530 kCycles for 100 * push 0
4508 kCycles for 100 * push edx
3932 kCycles for 100 * movups xmm0
871 kCycles for 100 * movaps xmm0

Title: Re: Zero a stack buffer (and probe it)
Post by: TWell on October 25, 2013, 09:05:27 PM

AMD Athlon(tm) II X2 220 Processor (SSE3) 2.80 GHz
loop overhead is approx. 239/100 cycles

2621 kCycles for 100 * rep stosd
4891 kCycles for 100 * push 0
4895 kCycles for 100 * push edx
1666 kCycles for 100 * movups xmm0
1605 kCycles for 100 * movaps xmm0

Title: Re: Zero a stack buffer (and probe it)
Post by: dedndave on October 25, 2013, 10:09:55 PM

prescott w/htt

Intel(R) Pentium(R) 4 CPU 3.00GHz (SSE3)
loop overhead is approx. 248/100 cycles

4986    kCycles for 100 * rep stosd
4827    kCycles for 100 * push 0
4991    kCycles for 100 * push edx
6187    kCycles for 100 * movups xmm0
2767    kCycles for 100 * movaps xmm0

5023    kCycles for 100 * rep stosd
4857    kCycles for 100 * push 0
4935    kCycles for 100 * push edx
6207    kCycles for 100 * movups xmm0
2766    kCycles for 100 * movaps xmm0

5023    kCycles for 100 * rep stosd
4855    kCycles for 100 * push 0
4990    kCycles for 100 * push edx
6225    kCycles for 100 * movups xmm0
2765    kCycles for 100 * movaps xmm0

Title: Re: Zero a stack buffer (and probe it)
Post by: nidud on October 25, 2013, 10:18:42 PM

deleted

Title: Re: Zero a stack buffer (and probe it)
Post by: dedndave on October 25, 2013, 10:34:57 PM

prescott w/htt

Intel(R) Pentium(R) 4 CPU 3.00GHz (SSE3)
loop overhead is approx. 245/100 cycles

5107    kCycles for 100 * rep stosd
4844    kCycles for 100 * push 0
4902    kCycles for 100 * push edx
6153    kCycles for 100 * movups xmm0
2827    kCycles for 100 * movaps xmm0
2815    kCycles for 100 * rep stosd

5111    kCycles for 100 * rep stosd
4873    kCycles for 100 * push 0
4887    kCycles for 100 * push edx
6150    kCycles for 100 * movups xmm0
2795    kCycles for 100 * movaps xmm0
2782    kCycles for 100 * rep stosd

5053    kCycles for 100 * rep stosd
4892    kCycles for 100 * push 0
4850    kCycles for 100 * push edx
6179    kCycles for 100 * movups xmm0
2767    kCycles for 100 * movaps xmm0
2827    kCycles for 100 * rep stosd

Title: Re: Zero a stack buffer (and probe it)
Post by: Gunther on October 25, 2013, 11:22:20 PM

Jochen,

here are the results from an old Computer (located in an University laboratory). The other tests from my machine at home will come this evening.

The MASM Forum

General => The Laboratory => Topic started by: jj2007 on October 25, 2013, 07:31:54 PM