Skip to content
Snippets Groups Projects

Optimise fragment offset calculation for GPU batches

Merged Roel Aaij requested to merge optimise_fragment_offsets into master
All threads resolved!

Also add an application that benchmarks fragment offset calculation

The speed of the mep_offsets function improves from 14.3 Hz to 648 Hz (45x speedup). 4 threads are now more than enough to calculate the offsets; 2 would probably still be fine. It seems that EB::get_padding( s, 1 << align ) is very slow.

requires Allen!1127 (merged)

FYI @gligorov @kaaricha

Edited by Roel Aaij

Merge request reports

Loading
Loading

Activity

Filter activity
  • Approvals
  • Assignees & reviewers
  • Comments (from bots)
  • Comments (from users)
  • Commits & branches
  • Edits
  • Labels
  • Lock status
  • Mentions
  • Merge request status
  • Tracking
Please register or sign in to reply
Loading