This patch implements following evex512 version of string functions. evex512 version takes up to 30% less cycle as compared to evex, depending on length and alignment.
- memrchr function using 512 bit vectors.
code size data:
memrchr-evex.o 704 memrchr-evex512.o 503(-29%)
Placeholder function, not used by any processor at the moment.