Draft: CUDA Version of LAXLIB and parallel davidson (!1952) · Merge requests · QEF - Quantum ESPRESSO Foundation / q-e

Pietro Delugas requested to merge pietrodelugas/q-e:laxlib-p_terg_gpu into develop Aug 16, 2022

This MR is a slightly updated version of the M.R. by @sorland and @bonfus, that fixes the distributed iterative diagonalization done by the GPU version of pcegterg and pregterg.

To do:

test it before merging
more refactoring to replac CUF kernels with openACC
implement the matrix distribution in non-contigous blocks (BLACS style)

Edited Sep 09, 2022 by Pietro Delugas

Draft: CUDA Version of LAXLIB and parallel davidson

Merge request reports