Add memory profiling to batch allocations
Description
Up to now, batch allocations have not been accounted for in the memory profiling because the allocation is done in C to enable aligned memory for CPU runs (better vectorization) and pinned memory for GPU runs (faster transfer speeds). Now, the memory profiling takes these allocations into account.
Checklist
-
I have checked that my code follows the Octopus coding standards -
I have added tests for all the new features added in this request.