Skip to content
  • Hong Zhang's avatar
    Add SELLHIP · 773bf0f6
    Hong Zhang authored
    - The HIP kernels are converted directly from their CUDA version
    - AMD GPUs and NVIDIA GPUs use different warp sizes. We set the warp size to 64 by default for AMD GPUs to faciliate compile-time code optimization
    773bf0f6