Experiment with FMV for gridding code
This issue was copied from the ASTRON Jira board: RAP-36
Description
WSClean doesn't use separate functions for SSE and AVX2, like AOFlagger does. Hence, optimization of WSClean needs to be done a bit different. Here we can probably use __attribute__((target_clones(...))), which instructs the compiler to generate different versions of the same function for all the features mentioned in ... (see for example this document for more information) .
It is very important to have a good way to measure performance. Andre Offringa will provide information on how this can best be done for WSClean.
Acceptance criteria:
- Report on the results
- Sketches direction to go toward
- Follow up tickets are created in jira.