Avoid filling ptmp zero in hpsi_dot_v of cegterg.
Filling a large array zero is an expensive operation especially when using threads. It seems that putting zero in ptmp is not needed and it can be taken care by the following zgemm. The operation is aligned as psi.