Fix issue with nrm2 function on AMD GPUs
Description
It seems that running several nrm2 kernels in parallel on different streams triggers a race condition on AMD GPUs because it leads to stochastic deviations of the results from the expected one. Fix this by only executing in one stream.