Following from this MR, code was modified to run quicker. Currently the longest step is unrolling the tracks as the dataframe has duplicated nested lists of list:
entry | subentry | Some track variable |
---|---|---|
0 | 0 | [[28.354776, 13.942179, 9.871879, 12.308242], ... |
1 | [[28.354776, 13.942179, 9.871879, 12.308242], ... | |
2 | [[28.354776, 13.942179, 9.871879, 12.308242], ... | |
4 | [[28.354776, 13.942179, 9.871879, 12.308242], ... |
which takes over 8 minutes. Overall, the jet and track information are obtained in 12 minutes. If there is a method to reduce the time it takes to unnest the list and remove duplicates, then that would make the whole procedure run a lot faster.