Skip to content
Snippets Groups Projects

BFieldCache Change the code so as to emit more clang like assembly for gcc

Merged Christos Anastopoulos requested to merge ATLAS-EGamma/athena:inside_ala_Clang_gcc into master

BFieldCache Change the code so as to emit more clang like assembly for gcc

see [https://godbolt.org/z/zK145fPbe]

ping @amete , this seems to improve things a bit.

But is the reason I asked if we had something with "micro-arch" for vtune.

This is usually called every time the propagators needs to access the magnetic field and is hard to accurately see if was an issue or not.

callgrind shows less branches and less mis-prediction (but callgrind emulates an old processor here). clang assumes the branch will be hard to predict and emit assembly for that assumption, gcc not ...

In some sense for this (no just this bit but the full chain) would be good to get some info on if there is something to improve ...

Edited by Christos Anastopoulos

Merge request reports

Loading
Loading

Activity

Filter activity
  • Approvals
  • Assignees & reviewers
  • Comments (from bots)
  • Comments (from users)
  • Commits & branches
  • Edits
  • Labels
  • Lock status
  • Mentions
  • Merge request status
  • Tracking
Please register or sign in to reply
Loading