since tbb::task_scheduler_init is now deprecated, lets try a tbb::task_arena for our threadpool
in the scheduler code. Scheduler and moore tests run through locally on avx2/gcc9/opt.
I did not do any relevant throughput tests, so not sure how/whether this changes things.