Optimized mask clustering (!2) · Merge requests · LHCb / Allen · GitLab

Snippets Groups Projects

Merged Daniel Hugo Campora Perez requested to merge optimized_mask_clustering into master 6 years ago

It now finds 0.019493% more clusters (down from 0.07%)
The algorithm should be a tad faster
Added support for CMAKE_BUILD_TYPE option. Available options:
- RelWithDebInfo (default)
- Release
- Debug
EstimateInputSize logic changed for adding candidates. Using masks now.
Removed sp_size from the GPU (it was unused).
Added constant candidate_ks for finding out active pixel numbers in a four-bit number (EstimateInputSize optimization).
Prefix Sum has been optimized, following the strategy of Merrill’s 2–level upsweep/downsweep.
When profiling Clustering alone, or the whole application, performance rate is now more stable and less picky about synchronization shenanigans.
A Handler class has been created, holding the stream, blocks and threads attributes. Any Handler should inherit from it.
Consolidate tracks is on by default now.
Found a good configuration for EstimateInputSize call. 70 kHz on 1080 Ti.

Edited 6 years ago by Daniel Hugo Campora Perez

Activity

Daniel Hugo Campora Perez added 1 commit 6 years ago
added 1 commit

26881b9a - Fixed CMakeLists, optimized EstimateInputSize

Compare with previous version
Daniel Hugo Campora Perez added 1 commit 6 years ago
added 1 commit

1a3bff76 - Optimized Prefix Sum

Compare with previous version
Daniel Hugo Campora Perez changed the description 6 years ago

changed the description
Daniel Hugo Campora Perez added 1 commit 6 years ago
added 1 commit

b3218740 - Found sweet spot for EstimateInputSize parameters

Compare with previous version
Daniel Hugo Campora Perez changed the description 6 years ago

changed the description
Dorothea Vom Bruch merged 6 years ago

merged
Dorothea Vom Bruch mentioned in commit 53d31739 6 years ago

mentioned in commit 53d31739

Please register or sign in to reply