Vector reward for Nstep

Reported at https://github.com/ymd-h/cpprb/discussions/7

ValueError: could not broadcast input array from shape (2,1) into shape (1,1)

Assignee Loading
Time tracking Loading