Here is another one, with a FastSqrt added:
FastMath FastSqrt ; define a math function
For_ fct=0.0 To 10.0 Step 0.5
fld fct ; X
fld st
fstp REAL10 ptr [edi]
fsqrt ; Y
fstp REAL10 ptr [edi+REAL10]
add edi, 2*REAL10
Next
FastMath ; -------- done -------------
The speed gain is very modest, though:
1859 cycles for 100 * fsqrt
1398 cycles for 100 * FastSqrt
The tangens is more impressive:
10920 cycles for 100 * fptan
1390 cycles for 100 * FastTan