Add numext::fma and missing pmadd implementations.

This fixes the packetmath tests for float16/bfloat16.

Merge request reports

Loading