Add OpenMP statements to more loops. This improves the scaling at least up to 4 OpenMP threads, also for hybrid simulations. Results are best for large grids where the overhead of creating threads is comparatively smaller.
Improve OpenMP support.
- I have checked that my code follows the Octopus coding standards