x86: Optimize svml_s_atanhf16_core_avx512.S (!193) · Merge requests · x86 glibc / glibc · GitLab

The 17.0 major release is coming on May 16, 2024! This version brings many exciting improvements to GitLab, but also removes some deprecated features. We are introducing three breaking change windows during which we expect breaking changes to be deployed to GitLab.com. You can read more about it on our blogpost . The third breaking change window begins 2024-05-06 09:00 UTC and ends 2024-05-08 22:00 UTC.

Noah Goldstein requested to merge users/goldsteinn/svml-optimization-rebased-1 into users/intel/libmvec/master Mar 15, 2022

Optimizations are:
    1. Reduce code size (-58 bytes).
    2. Remove redundant move instructions.
    3. Slightly improve instruction selection/scheduling where
       possible.
    4. Reduce rodata size ([-128, -188] bytes).

Result is roughly a 14% speedup:

        Function,   New Time, Old Time, New / Old
_ZGVeN16v_atanhf,      11.95,   13.879,     0.861