Generate ci breakdown using nsys instead of ncu

Change the profiling tool used to generate Allen breakdown. This makes the Allen pipeline much faster (1h instead of 3h) and avoid timeout on some MR.

FYI @msaur @cagapopo @raaij @dovombru

Merge request reports

Loading