Use numext::fma for sparse x dense dot product.
This change improves accuracy of SparseQR and gives a small speedup on the example in #2583 (closed) from 170s to 163s (Skylake-X, AVX2).
Flame graph before:
Flame graph after:
Fixes #2583 (closed)
Edited by Charles Schlosser

