All Our Models | Unsloth Documentation (original) (raw)

Qwen3, TTS, FFT & all models are now supported! 🦥

Unsloth Documentation

Homepage Reddit Discord Blog Sign up

Get Started

All Our Models

PreviousUnsloth Notebooks NextInstalling + Updating

Last updated 4 days ago

Was this helpful?

See the table below for all GGUF, 4-bit, 16-bit uploaded models on .

GGUFs can be used to run in your favorite places like Ollama, Open WebUI and llama.cpp.
4-bit and 16-bit models can be used for inference serving or fine-tuning.

Here's a table of all our GGUF + 4-bit model uploads:

Model

GGUF

Instruct (4-bit)

Base (4-bit)

Mistral

Qwen2.5-Omni (new)

Original models:

Gemma 2

Phi-3.5

Phi-3

Llama 3

Llava

Llama 2

Qwen2 VL

SmolLM2

TinyLlama

Qwen2

Zephyr SFT

CodeLlama

Yi

Here's a table of all our 16-bit or 8-bit original model uploads:

Model

Instruct

Base

'

Mistral

Gemma 2

DeepSeek V3

Phi-3.5

Phi-3

Llama 3

Llava

Qwen2 VL

Llama 2

SmolLM2

TinyLlama

Qwen2

Zephyr SFT

- new

- new

- new

(new)

- new

- new

- new

- new

- STT

(new)

(new)

(new)

(new)

🔮

R1-0528-Qwen3-8B

Llama 3.3 (70B)

Qwen 2.5 (1.5B)

R1-0528-Qwen3-8B

Llama 3.3 (70B)

Qwen 2.5 (1.5B)

NeMo 2407 (12B)

NeMo 2407 (12B)

Small 2409 (22B)

Pixtral (12B) 2409

NeMo 2407 (12B)

Pixtral (12B) 2409

Text-to-speech (TTS)

Sesame-CSM (1B)

Whisper Large V3

Spark-TTS (0.5B)

1.6 Mistral (7B)

Llama 3.3 (70B)

Qwen 2.5 (1.5B)

Mistral Small 2501

NeMo 2407 (12B)

Small 2409 (22B)

Pixtral (12B) 2409

Mistral Small 2501

NeMo 2407 (12B)

Pixtral (12B) 2409

1.6 Mistral (7B)