Transformers NeuronX Tutorials — AWS Neuron Documentation (original) (raw)
This document is relevant for: Inf2
, Trn1
Transformers NeuronX Tutorials#
- Hugging Face meta-llama/Llama-2-13b autoregressive sampling on Inf2 & Trn1
- Hugging Face facebook/opt-13b autoregressive sampling on Inf2 & Trn1
- Hugging Face facebook/opt-30b autoregressive sampling on Inf2 & Trn1
- Hugging Face facebook/opt-66b autoregressive sampling on Inf2
This document is relevant for: Inf2
, Trn1