@Jochen
AMD's fpu's was faster before too,I had an Ahtlon and also in the other forum AMD fpu code was faster(D3d code using fpu also,so next cpu was intel core duo,because my interest in SSE,AMD was one step after intel in SSE generation
Intel(R) Core(TM) i5-7200U CPU @ 2.50GHz (SSE4)
82 cycles for 100 * add+adc
239 cycles for 100 * fadd
62 cycles for 100 * paddq aligned
67 cycles for 100 * paddq unaligned
81 cycles for 100 * add+adc
242 cycles for 100 * fadd
63 cycles for 100 * paddq aligned
71 cycles for 100 * paddq unaligned
85 cycles for 100 * add+adc
242 cycles for 100 * fadd
61 cycles for 100 * paddq aligned
70 cycles for 100 * paddq unaligned
83 cycles for 100 * add+adc
241 cycles for 100 * fadd
65 cycles for 100 * paddq aligned
73 cycles for 100 * paddq unaligned
85 cycles for 100 * add+adc
246 cycles for 100 * fadd
62 cycles for 100 * paddq aligned
72 cycles for 100 * paddq unaligned
34 bytes for add+adc
20 bytes for fadd
22 bytes for paddq aligned
25 bytes for paddq unaligned
-