Skip to content

detect when a subsystem goes repeatedly into SoftErrorRecovery (ContinouslySoftError LM)

Created by: andreh12

e.g. not more than 3 transitions into SoftErrorRecovery per subsystem in 5 minutes.

counters should be kept per subsystem, information could be stored as timestamps in a queue of length 3 . Implies that a single snapshot is not sufficient to run tests with the corresponding code.

Currently the DAQAggregator may miss the state transition since the Level0 may fix the SoftErrorRecovery before the next snapshot is taken. A potential solution would require the Level0 to export counters how often each subsystem went into SoftErrorRecovery during the current run to a flashlist.