Don't double-define Half functions on aarch64
What does this implement/fix?
This change fixes a compilation error that occurs when compiling for GPU (CUDA) with an aarch64 (aka Arm64) host. Two sets of Half functions are defined, which conflict with each other. To fix this, the change specifically disables the aarch64 versions during the GPU compile phase.