-
Hong Zhang authored
- The HIP kernels are converted directly from their CUDA version - AMD GPUs and NVIDIA GPUs use different warp sizes. We set the warp size to 64 by default for AMD GPUs to faciliate compile-time code optimization
773bf0f6
- The HIP kernels are converted directly from their CUDA version - AMD GPUs and NVIDIA GPUs use different warp sizes. We set the warp size to 64 by default for AMD GPUs to faciliate compile-time code optimization