Fix event consumption counting for CUDA graphs
Previously, debug build runs with CUDA graphs were crashing due to event consumption counting issues associated with re-marking of events. This change avoids event remarking for PME and PP on same rank, and allows the recessary coordinate availability remarking to allow graphs to be defined and launched through setConsumptionLimits().