felipe is correct here, if you get the register list in the masm 64 bit reference section, transient registers are used differently from system or fixed registers and it works in much the same way as the Intel 32 bit ABI, some registers are reserved by the OS and the others are transient.
thanks Hutch
so 32bit use free registers used to change X and Y first avoids 3push/3pops,its push,push,push,push,call bitblt or other imagedrawing call,question if fixed point 16:16 shift to get only integer part
64bit use the right 4 registers to change X,xlow and Y,ylow and shift and call bitblt,few cycles faster!!!