Optimization of the batch_get/set_points routines
Description
Optimization of the batch_get/set_points routines.
For full batches and no spinors, further gain is obtained using BLAS directly.
A component test is added to do performance checks.
Benchmark is done for 4 states changing the number of grid points for complex states
News snippet
Code optmimization
Checklist
-
I have checked that my code follows the Octopus coding standards -
I have added tests for all the new features added in this request.