Enable set -e in all system test scripts

We should be exiting on any failure in our system tests. I already did this when migrating the tests to Python, but I think it makes sense to do it separately first.