Use only SSE2 on prefix sum
This MR makes the manually vectorised implementation of the prefix sum use only SSE2 (max) ops, which are available in any x86_64
architecture.
Potentially solves MooreAnalysis#30 (closed)
Edited by Daniel Hugo Campora Perez
Merge request reports
Activity
mentioned in issue MooreAnalysis#30 (closed)
added RTA label
- A deleted user
added hlt1-throughput-decreased label
removed hlt1-throughput-decreased label
assigned to @peilian
- Resolved by Daniel Hugo Campora Perez
/ci-test
added ci-test-triggered label
- [2022-03-04 21:45] Validation started with lhcb-master-mr#3874
assigned to @rmatev
mentioned in issue Moore#397 (closed)
added 80 commits
-
4ee9fbea...0c4581f9 - 79 commits from branch
master
- 7388371f - Use only SSE2 on prefix sum.
-
4ee9fbea...0c4581f9 - 79 commits from branch
- A deleted user
added hlt1-throughput-decreased label
- A deleted user
removed hlt1-throughput-decreased label
unassigned @peilian
Please register or sign in to reply