a_selinger/feature faster scalable matmatmult
Created by: a_selinger
This optimization speeds up the scalable matrix-matrix multiplication by a factor ~70.
Now the scalable implementation needs roughly the same amount of time as the nonscalable implementation. This speed up was achieved by omitting the extremely slow calculation of the maximum number of nonzeros per row of the result matrix apnz_max
.