Programmatic start/stop of profiling

Currently the profiling starts with a delay so that it doesn't capture the initialization. This is very fiddly and the good value is not uniform across tests. We need a way to programatically start/stop he profiling

In man perf-record there is an example of how to enable events programmatically (look for Example of bash shell script to enable and disable events during measurements). For example, this can be wrapped in an algorithm similar to IntelProfile in Gaudi.

/cc @gunther @chasse