Skip to content

Catch monitoring loop and postprocessing errors in distributed mode

Pieter David requested to merge piedavid/bamboo:drivershouldnotcrash into master

Fixes #32 (closed)

In principle, the driver task should not fail beyond recovery after submission any more: sacct/squeue (status update) errors are caught, as well as postProcess (and some information is printed on how to recover).

Also fixed: status updates were done twice, and highlighted the "cache query results" option more in the user guide.

Merge request reports