spectrum: speed up beamforming gain calculation in three-gpp-spectrum-propagation-loss-model.cc (!1427) · Merge requests · nsnam / ns-3-dev

Merged Gabriel Ferreira requested to merge Gabrielcarvfer/ns-3-dev:CalculateBeamformingGainFaster into master 1 year ago

Here are the before and after the patches for lte-lena-comparison-user --simTag=test2-user --trafficScenario=2 --simulator=5GLENA --technology=LTE --numRings=2 --ueNumPergNb=5 --calibration=false --freqScenario=0 --operationMode=FDD --direction=UL --RngRun=1 --RngSeed=1 (from the nr codebase) using the release profile.

Before

As it can be seen, sincos is the main villain (57% of retired/completed/unaborted instructions). The caller of those many sincos is none other than ThreeGppSpectrumPropagationLossModel::CalcBeamformingGain().

The reason for those sincos take almost half of the simulation time (cycles not in halt) is not only because trigonometric functions are slow, but also because the phases being passed as the arguments are first wrapped to something like -Pi/2, Pi/2 for the best performance. This causes a ton of branch misses, which starve the CPU backend and slows everything down.

Since the delay components for the channel don't change frequently, nor do the frequency bands and number of clusters, we can in theory cache the computed value of the propagation delay, reducing both trigonometric functions and wrapping cache misses. After doing this, we get the following.

After