OpenCL: add limited support for AMD RDNA devices
OpenCL kernels are broken on AMD RDNA devices due to them having 32-wide warp (Wave32). However, we can force the Wave64 mode with the correct compiler flags.
No idea about performance; probably suboptimal. To be used in CI only.
Edited by Andrey Alekseenko