Draft: [RFC] New way to define parameters in Allen

Small update: I have made it compatible with CUDA 12.1, added velo and velo_validation working sequences, and made the small CI sequence run those two. The throughput obtained matches the throughput on master for the VELO, and the physics efficiency matches as well.

added 1 commit

fc0e09a1 - Added first working line.

Compare with previous version

added 1 commit

aa606dc0 - Fixed formatting

Compare with previous version

changed the description

@ahennequ should this be closed once !1840 (merged) is merged?

resolved all threads

I think this MR should be revived as well, in the context of improving memory management and type safety. In particular, this would serve as a basis to add a way of structuring parameters, for instance:

__global__ void velo_search_by_triplet_kernel(
  std::span<const mask_t> dev_event_list,
  std::span<const unsigned> dev_number_of_events,
  std::span<const char> dev_sorted_velo_cluster_container,
  std::span<const unsigned> dev_offsets_estimated_input_size,
  std::span<const unsigned> dev_module_cluster_num,
  std::span<const Velo::Clusters>,
  std::span<Velo::TrackHits> dev_tracks,
  std::span<Velo::TrackletHits> dev_three_hit_tracks,
  std::span<Velo::TrackletHits> dev_tracklets,
  std::span<unsigned> dev_tracks_to_follow,
  std::span<bool> dev_hit_used,
  std::span<unsigned> dev_atomics_velo,
  std::span<unsigned> dev_number_of_velo_tracks,
  std::span<unsigned short> dev_rel_indices,
  const VeloGeometry* dev_velo_geometry,
  const float phi_tolerance,
  const float max_scatter,
  const unsigned max_skipped_modules)

would become something like:

__global__ void velo_search_by_triplet_kernel(
  const EventList event_list,
  const VeloClusters sorted_velo_clusters,
  Velo::TrackHits tracks,
  Velo::TrackletHits three_hit_tracks,
  Velo::TrackletHits dev_tracklets,
  std::span<unsigned> dev_tracks_to_follow,
  std::span<bool> dev_hit_used,
  std::span<unsigned short> dev_rel_indices,
  const VeloGeometry* dev_velo_geometry,
  const float phi_tolerance,
  const float max_scatter,
  const unsigned max_skipped_modules)

The parameters structs that are put in the store and passed between algorithms and kernels would contain shared pointers to the actual buffers, in order to handle data dependencies.

Buffers would not have to be named, only parameters (more gaudi-like).

To be noted that parameters struct could contain both host and device buffers, since host memory is pinned, accessing it (read/write) from device is possible (and even already done in prefix-sums) using zero copy mechanism.

Draft: [RFC] New way to define parameters in Allen

What is this?

named_buffer:

kernel invocations:

Lines:

Changes wrt current Allen:

Activity

Draft: [RFC] New way to define parameters in Allen

What is this?

named_buffer:

kernel invocations:

Lines:

Changes wrt current Allen:

Merge request reports

Activity