Overview — Transformer Engine (original) (raw)

NVIDIA® Transformer Engine is a library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilization in both training and inference.

These pages contain documentation for Transformer Engine release 2.2 and earlier releases.

The following documents are provided: