Workaround Cuda limits
Kernel launch sizes are restricted in CUDA (but not in OpenCL), this causes problem in Octopus when grids have too many points. This changes work around that.
Edited by Xavier Andrade
Kernel launch sizes are restricted in CUDA (but not in OpenCL), this causes problem in Octopus when grids have too many points. This changes work around that.