Draft: Refactor JVT selection interface (!78763) · Merge requests · atlas / athena · GitLab

Snippets Groups Projects

Open William Keaton Balunas requested to merge wbalunas/athena:jvt-250228 into main 2 weeks ago

3 unresolved threads

Currently there is a duplication of functionality between the NNJVT tool, which both computes the NNJVT discriminant and applies cuts on it, and and the JVT selection tools, which apply cuts to select jets. This is suboptimal because it is often desirable to do this things in two different places (e.g. computing the variable when making a derivation, then applying the desired cuts at the analysis stage). This MR:

Removes the application of any cuts and validity ranges from the tool which computes NNJVT, along with associated config variables
Updates the JVT selection tools to also implement the IJetDecorator interface, making it easier to compute selection decision flags on the jets
Adjusts the jet trigger configuration to use this new structure (previously it has been relying on the pass decision made at variable computation time).

Tagging @khoo, @cmorenom, @gotero, @pakontax, @jlsmith.

Activity

ATLAS Robot @atlasbot · 2 weeks ago

Developer
This merge request affects 3 packages:

Reconstruction/Jet/JetJvtEfficiency

Reconstruction/Jet/JetMomentTools

Trigger/TriggerCommon/TriggerMenuMT

Affected files list will not be printed in this case

Adding @sutt ,@miochoa ,@goetz ,@cantel ,@lidiaz as watchers
ATLAS Robot added JetEtmiss Reconstruction Trigger TriggerMenu analysis-review-required main review-pending-level-1 labels 2 weeks ago

added JetEtmiss Reconstruction Trigger TriggerMenu analysis-review-required main review-pending-level-1 labels
William Keaton Balunas @wbalunas · 2 weeks ago

Author Developer

@khoo, since you made some relevant code changes recently: Is the Athena CI sufficient to check that this preserves the intended trigger behavior, or is any additional validation needed (if so, how can this be tested)?

Teng Jian Khoo @khoo · 2 weeks ago

Developer

You could run test_trig_mc_v1Dev_slice_jet_grid.py before/after, and diff the HLTChain.txt file. I'd be reasonably satisfied with that as an initial check, before this is merged. If something needed higher stats to be obvious we'd get it in a weekly HLT reprocessing.

Please register or sign in to reply
Teng Jian Khoo @khoo started a thread on the diff 2 weeks ago

Trigger/TriggerCommon/TriggerMenuMT/python/HLT/Jet/TriggerJetMods.py

         UseTrkAugNN = True,
         NNConfigDir = "JetPileupTag/NNJvt/HLT-2025-02-05",
         NNParamFile = "TrkAugNNJVT.Network.graph.HLT.json",
         NNCutFile = "TrkAugNNJVT.Cuts.HLT.json",

ATLAS Robot @atlasbot · 2 weeks ago

Developer

CI integration tests for projects Athena,AnalysisBase are cancelled because of compilation error(s)

ATLAS Robot @atlasbot · 2 weeks ago

Developer

CI Result FAILURE (hash 584b1c84)

	Athena	AnalysisBase	AthAnalysis
externals
cmake
make
tests

Full details available on this CI monitor view. Check the JIRA CI status board for known problems
Athena: number of compilation errors 1, warnings 1
AnalysisBase: number of compilation errors 1, warnings 2
AthAnalysis: number of compilation errors 0, warnings 1
For experts only: Jenkins output (remote access info)

Lukas Kretschmann @lkretsch · 2 weeks ago

Developer

Please resolve the open threads and take a look at the failing tests. -L1

Lukas Kretschmann added label and removed label 2 weeks ago

added review-user-action-required label and removed review-pending-level-1 label

Nils Erik Krumnack @krumnack · 2 weeks ago

Developer

From what I can tell these changes should be fine for analysis. Some of the changes even seem to anticipate what we'll need for columnar code. Approving for AR.

Nils [Analysis RC]

Nils Erik Krumnack added label and removed label 2 weeks ago

added analysis-review-approved label and removed analysis-review-required label

William Keaton Balunas @wbalunas · 2 weeks ago

Author Developer

Investigating the CI issues, I've realized something that seems a bit strange: Instead of taking as input the jet eta and phi, the NN appears to instead take as input features bin indices for eta and phi, which are dependent on the cut configuration. This would appear to defeat the entire purpose of this MR, because it forces the tool evaluating NNJVT to be aware of the cuts.

Is there some reason why it was done this way in the past? We can change it for the consolidated recommendations, but to retain backward compatibility we'll have to at least keep the option to pass in a cuts config file to the NNJVT tool.

Tagging @gotero, @cmorenom, @fballi, @jdandoy for any comments they might have.
Teng Jian Khoo @khoo · 2 weeks ago

Developer

Yes, the choice to discretise eta/pt for the first iteration was due to limited stats, i.e. that we didn't want to induce overtraining or spurious correlations due to sampling a handful of events in particular eta/pt bins.

We have solved the pipeline issues that limited using a larger training sample, but obviously this is baked into the original version. That said, you could very well move/copy the cut information into the model json, and read it from there just for the older offline version, and leave a more elegant solution for the next iteration.
William Keaton Balunas @wbalunas · 2 weeks ago

Author Developer

OK, good to know. I think it's probably best for now to avoid messing with the model json (we'd really need to release a new version with this structure, which seems more trouble than it's worth) so maybe we're better off pausing this MR until the consolidated recommendations are ready to go out, which can use pt/eta directly instead of needing this binning config. I'll mark it as draft for now and then come back in a few months when that's ready.
Please register or sign in to reply

William Keaton Balunas marked this merge request as draft 2 weeks ago

marked this merge request as draft

Thomas Strebler removed label 2 weeks ago

removed review-user-action-required label

Please register or sign in to reply