Add AVX512 s/dgemm optimizations for compute kernel (2nd try)
This is a follow up to resolve issues that pop up after !860 (merged) was merged but reverted.
It addresses the following:
- Build issue on NEON64
- Rename/move of transpose for trsm
- Rename member in data mapper to include to something other than
incrto avoid shadowing
I'm keeping the commits separate for now to help with readability. I will squash all of them on final rebase.
Edited by aaraujom