Skip to content

Draft: CUDA Version of LAXLIB and parallel davidson

Pietro Delugas requested to merge pietrodelugas/q-e:laxlib-p_terg_gpu into develop

This MR is a slightly updated version of the M.R. by @sorland and @bonfus, that fixes the distributed iterative diagonalization done by the GPU version of pcegterg and pregterg.

To do:

  • test it before merging
  • more refactoring to replac CUF kernels with openACC
  • implement the matrix distribution in non-contigous blocks (BLACS style)
Edited by Pietro Delugas

Merge request reports