Skip to content
Snippets Groups Projects

Fix for HostPrefixSum

Merged Giovanni Bassi requested to merge prefix_sum_fix into master
All threads resolved!

In this commit we moved from _mm_extract_epi32 to _mm_storeu_si32. This was done because SSE2 is available in all x86_64 processors, whereas SSE4 is not.
When compiling with gcc9 (Ubuntu OS) the _mm_storeu_si32 is not recognized, see here. HostPrefixSum.cpp has been changed to use _mm_store_ss instead.

FYI: @dcampora

Edited by Giovanni Bassi

Merge request reports

Approved by

Merged by Daniel Hugo Campora PerezDaniel Hugo Campora Perez 3 years ago (Mar 19, 2022 6:42am UTC)

Merge details

  • Changes merged into master with 58e6f5e9.
  • Deleted the source branch.

Activity

Filter activity
  • Approvals
  • Assignees & reviewers
  • Comments (from bots)
  • Comments (from users)
  • Commits & branches
  • Edits
  • Labels
  • Lock status
  • Mentions
  • Merge request status
  • Tracking
Please register or sign in to reply
Loading