Skip to content

Draft: FullTransformerDecoder

Guillaume Quetant requested to merge pippin_dev into master

This just adds the FullTransformerDecoder class analogous to the FullTransformerEncoder, but based on cross-attention rather than only self-attention.

Merge request reports