Skip to content

Various code simplifications, leading to speed improvements in HltControlFlowMgr

These changes are purely motivated by speed measurements and usage of flamegraphs. It looks like gcc has a hard time inlining/optimizing in the context of templated + recursive calls (here ControlFlowNode::reserved in particular).

It would be interesting to understand more deeply why these changes are helping.

As a rough number, we get a factor 2 speedup on TurboSprucing with this. And in theory, we could see a 5% gain on overall HLT2 (to be confirmed)

Edited by Sebastien Ponce

Merge request reports

Loading