Draft: Extend float_v to work with arbitrary precision (!4992) · Merge requests · LHCb / LHCb · GitLab

Snippets Groups Projects

Open Andy Morris requested to merge anmorris-higher-float_v-precision into master 3 weeks ago

3 unresolved threads

Related to !4988 and #386 I'm trying to extend the float_v class to be templated to any floating-point type. This will also require an extension of int_v to allow longs and possibly also mask_v.

Todo:

Extend float_v to allow doubles
Extend int_v to allow longs
Extend mask_v to allow longs
Extend functions elsewhere in LHCb which required float_v, int_v and mask_v arguments/return values to instead use the templated classes

Edited 3 weeks ago by Andy Morris

Approval is optional

Merge blocked: 4 checks failed

Merge request must not be draft.

Unresolved discussions must be resolved.

Pipeline must succeed.

Merge conflicts must be resolved.

Activity

Andy Morris marked the checklist item Extend float_v to allow doubles as completed 3 weeks ago

marked the checklist item Extend float_v to allow doubles as completed
A

Andy Morris @anmorris · 3 weeks ago

Author Developer

@graven @msaur @jonrob I'm looking for the opinion of someone wiser than me. Do you think this MR is a good idea? Or if I finish it is it likely to break everything?

G

Gerhard Raven @graven · 3 weeks ago

Developer

I would also like to know what @ahennequ thinks -- basically, one has to be very careful about the implications of this...

In the end, one must balance accuracy and performance, and where possible, first try to identify and avoid 'bad algorithms' which require an increased 'internal' accuracy to remain numerically stable. Often (but not always) those can be re-formulated in mathematically equivalent ones which have better behavior - and it would be a pity if one had to go to an 'excessive' precision every where when it is only needed in a few places.

Hence I do not think there is an a-priori clear answer, and thus I will refrain from a clearcut opinion, and just acknowledge that this is tricky...

Arthur Marius Hennequin @ahennequ · 3 weeks ago

Developer

One of the design principle of the library is that we have the same number of elements in every vectors of a type in a given backend. This helps write code that feel like scalar.

I see two different ways to tackle this issue:

only allow doubles for the scalar backend

use virtual vectors for doubles (eg for avx2 one double_v would contain 2 4-elements registers to match the size required by the backend (8 elements))

A

Andy Morris @anmorris · 3 weeks ago

Author Developer

it would be a pity if one had to go to an 'excessive' precision every where when it is only needed in a few places

So that was my logic for making this templated, is that we can specify double-precision only when necessary and use floats otherwise. For the use case specifically highlighted in !4988, it turns out that there may be a work-around possible using the values that are already doubles (i.e. |p| and then constraining the PDG mass, roughly as @graven suggested)

For the scalar issue, that's I think why we'd need to also extend int_v and mask_v to allow size-8 elements (i.e. longs), or is this not what you mean @ahennequ?

Arthur Marius Hennequin @ahennequ · 3 weeks ago

Developer

no this not what I meant. You want to add a new type double_v to each backend. But you don't want to modify the existing types since that would result in loosing paralelism in the default case.

For example, avx2 as 3 types:

float_v: 8 x float

int_v: 8 x int

mask_v: 8 x bool

To avoid breaking the logic, the double_v type you want to add should be: 8 x double

The way to do that is to make it as:

class double_v { private: __m256 data{}; __m256 data2{}; };

That way every vector in a given backend have the same number of elements and you don't break the other types, the loop_masks, etc...

Please register or sign in to reply
Software for LHCb added RTA label 3 weeks ago

added RTA label
Gerhard Raven requested review from @ahennequ 3 weeks ago

requested review from @ahennequ
Gerhard Raven @graven started a thread on the diff 3 weeks ago

Kernel/LHCbMath/include/LHCbMath/SIMDWrapper.h

     class int_v;
     class float_v;
     template <typename T, typename = std::enable_if_t<std::is_floating_point_v<T>>>
     template <typename T, typename = std::enable_if_t<std::is_floating_point_v<T>>>
     template <std::floating_point T>

A

Andy Morris @anmorris · 3 weeks ago

Author Developer

@jonrob I think this may be worth discussing at the upcoming RTA WP1 meeting, what do you think?
Christopher Rob Jones @jonrob · 3 weeks ago

Maintainer

Sorry for not seeing this at first. I've added it to the agenda

https://indico.cern.ch/event/1531413/
Please register or sign in to reply

Please register or sign in to reply