include/petsc/private/matimpl.h · cef0416bfaf3f2eda18a772a528c82211592945c · PETSc / petsc

Add SELLHIP · 773bf0f6

Hong Zhang authored Mar 04, 2024

- The HIP kernels are converted directly from their CUDA version
- AMD GPUs and NVIDIA GPUs use different warp sizes. We set the warp size to 64 by default for AMD GPUs to faciliate compile-time code optimization

773bf0f6