CUDA Toolkit 13.0 compatibility fix
Fixes for toolkit 13.0. Tested on both 12.9 and 13.0.
The struct cudaDeviceProp prop;
no longer contains clockRate
and memoryClockRate
for some reason.
These are now retrieved by passing a reference as cudaDeviceGetAttribute(&clockRate, cudaDevAttrClockRate, device_index);
Also had to specify the exact compute architectures in cmake. The "all" or "compute_all" flag seems like an 11.0 feature.