Fix disabling of CUDA Graphs with CPU forces
There is an MPI reduction to determine if any rank has CPU forces, and if so then disable graphs across all ranks. Previously, this was using a combination of MPI_SUM and MPI_BYTE which can cause undefined behaviour. Now fixed to use MPI_INTEGER (using temporary variables).