Most non-CUDA areas of the code were removed to make the example clearer to read.
OpenACC version follows similar principle.
Most non-CUDA areas of the code were removed to make the example clearer to read.
OpenACC version follows similar principle.