Fix integer overflows in GPU code
Description
For large batch sizes, the multiplication of the number of grid points with the number of states in a batch could overflow. Change the integers involved to 8-byte.
Also, fix the central kernel to compute stencils (operate.cl): use long long
for all indices to avoid integer overflows in that kernel.
Edited by Sebastian Ohlmann