Skip to content

EXX FFT decomposition

Victor Yu requested to merge vwzyu/q-e:exx_fft into develop

This merge request intends to fix two things:

  • -pd .true. or .false. was not honored in exx.f90 and exx_band.f90, where pencil decomposition is always used.
  • A GPU to CPU memory transfer was at a wrong place in vexx_gamma_gpu.

For a number of test cases, this merge request leads to a performance improvement from 20% to 50%.

Example: SiC supercell containing 256 atoms.

Results are not affected in any way as far as I can tell.

Merge request reports