Skip to content

Add support for NVTX to aid GPU profiling

Description

Add a wrapper around the NVTX (NVIDIA Tools Extension) profiling functions using a Fortran and C interface to the C functions as defined in the corresponding header. Also modify configure.ac and the cuda m4 macro to add the necessary header file and library.

The NVTX functions are called in profiling_in and profiling_out to create regions around all functions for which profiling is done right now. This facilitates GPU profiling because one has a timeline where it is clear at which time the code is in which function.

News snippet

Add support for NVTX to aid GPU profiling

Checklist

Merge request reports

Loading