Skip to content

set muatmul precision to highest in inference

Minh Tuan Pham requested to merge tp_infer_precision into dev

Change the matmul precision to "highest" during inference. This guarantees deterministic behavior of the output. This is done in model.setup(stage)

Merge request reports

Loading