Performance/kick numba cpp
The kick function in Numba and C++ was written with several parallel regions and excessive array accessing. The code was refactored to offer a higher execution speed while maintaining the functionality.
P.S. I accidentally included the commits from !55 (merged), thus merging !55 (merged) before is required