clang doesn't allow to compile the code due to `cast from pointer to smaller type uint32` error. Use 64bit code path for NV_CPU_AARCH64 as well as NV_CPU_X86_64.