well you cant take old 1980s asm code that was aimed at old 1980s cpu,which is more shifts prefered over muls and run it on P4,and probably compiler makers are skilled in cycle timings
asm programmers keep on update on whats fastest read intel manuals and time things,not stay still in old dos days
well <insert favourite language here> programmer also can use SIMT skills to make program faster