Fix for HostPrefixSum
All threads resolved!
All threads resolved!
In this commit we moved from _mm_extract_epi32
to _mm_storeu_si32
. This was done because SSE2 is available in all x86_64 processors, whereas SSE4 is not.
When compiling with gcc9 (Ubuntu OS) the _mm_storeu_si32
is not recognized, see here. HostPrefixSum.cpp
has been changed to use _mm_store_ss
instead.
FYI: @dcampora
Edited by Giovanni Bassi
Merge request reports
Activity
added RTA label
- A deleted user
added hlt1-throughput-decreased label
- Resolved by Daniel Hugo Campora Perez
This should not impact performance and be more consistently portable.
assigned to @msaur
added ci-test-triggered label
- [2022-03-18 08:47] Validation started with lhcb-master-mr#4015
- A deleted user
removed hlt1-throughput-decreased label
unassigned @msaur
mentioned in commit 58e6f5e9
mentioned in issue Moore#411 (closed)
Please register or sign in to reply