πŸ€— Optimum Neuron (original) (raw)

AWS Trainium & Inferentia documentation

AWS Trainium & Inferentia

Hugging Face's logo

Join the Hugging Face community

and get access to the augmented documentation experience

Collaborate on models, datasets and Spaces

Faster examples with accelerated inference

Switch between documentation themes

πŸ€— Optimum Neuron is the interface between the πŸ€— Transformers library and AWS Accelerators including AWS Trainium and AWS Inferentia. It provides a set of tools enabling easy model loading, training and inference on single- and multi-Accelerator settings for different downstream tasks. The list of officially validated models and tasks is available here.