Models - Hugging Face (original) (raw)

Edit Models filters

Multimodal

Image-Text-to-Text Visual Question Answering Document Question Answering Video-Text-to-Text Any-to-Any

Computer Vision

Depth Estimation Image Classification Object Detection Image Segmentation Text-to-Image Image-to-Text Image-to-Image Image-to-Video Unconditional Image Generation Video Classification Text-to-Video Zero-Shot Image Classification Mask Generation Zero-Shot Object Detection Text-to-3D Image-to-3D Image Feature Extraction Keypoint Detection

Natural Language Processing

Text Classification Token Classification Table Question Answering Question Answering Zero-Shot Classification Translation Summarization Feature Extraction Text Generation Text2Text Generation Fill-Mask Sentence Similarity

Audio

Text-to-Speech Text-to-Audio Automatic Speech Recognition Audio-to-Audio Audio Classification Voice Activity Detection

Tabular

Tabular Classification Tabular Regression Time Series Forecasting

Reinforcement Learning

Reinforcement Learning Robotics

Other

Graph Machine Learning

Full-text search

openai/whisper-large-v3-turbo Automatic Speech Recognition• Updated6 days ago • 102k• • 847

nvidia/NVLM-D-72B Image-Text-to-Text• Updated2 days ago • 18.8k• 585

black-forest-labs/FLUX.1-dev Text-to-Image• UpdatedAug 16 • 1.13M• • 5.31k

ostris/OpenFLUX.1 Text-to-Image• Updated7 days ago • 11.1k• 450

rain1011/pyramid-flow-sd3 Text-to-Video• Updatedabout 6 hours ago • 243

apple/DepthPro Depth Estimation• Updated1 day ago • 187

jxm/cde-small-v1 Feature Extraction• Updated1 day ago • 1.68k• 181

rhymes-ai/Aria Text Generation• Updatedabout 22 hours ago • 172• 168

meta-llama/Llama-3.2-11B-Vision-Instruct Image-Text-to-Text• Updated11 days ago • 564k• • 608

meta-llama/Llama-3.2-1B Text Generation• Updated10 days ago • 203k• 429

stepfun-ai/GOT-OCR2_0 Image-Text-to-Text• Updated23 days ago • 199k• 1.01k

black-forest-labs/FLUX.1-schnell Text-to-Image• UpdatedAug 16 • 1.05M• • 2.53k

allenai/Molmo-7B-D-0924 Image-Text-to-Text• Updated20 minutes ago • 29.4k• 342

meta-llama/Llama-3.2-3B-Instruct Text Generation• Updated15 days ago • 281k• • 309

meta-llama/Llama-3.1-8B-Instruct Text Generation• Updated15 days ago • 3.2M• • 2.75k

jasperai/Flux.1-dev-Controlnet-Upscaler Image-to-Image• Updated11 days ago • 17.6k• 299

meta-llama/Llama-3.2-1B-Instruct Text Generation• Updated15 days ago • 283k• • 323

coqui/XTTS-v2 Text-to-Speech• UpdatedDec 11, 2023 • 903k• 1.81k

google/gemma-2-2b-jpn-it Text Generation• Updated8 days ago • 8.91k• 97

stabilityai/stable-diffusion-3-medium Text-to-Image• UpdatedAug 12 • 42.5k• 4.41k

Revai/reverb-asr Automatic Speech Recognition• Updated2 days ago • 70• 59

meta-llama/Llama-3.2-11B-Vision Image-Text-to-Text• Updated14 days ago • 60.3k• 253

nyanko7/flux-dev-de-distill Updated28 days ago • 108

openai/whisper-large-v3 Automatic Speech Recognition• UpdatedAug 12 • 3.94M• • 3.54k

StephanST/WALDO30 Object Detection• Updated1 day ago • 36

meta-llama/Llama-3.2-3B Text Generation• Updated14 days ago • 71.7k• 174

glif/how2draw Text-to-Image• Updated17 days ago • 7.92k• • 181

abacusai/Dracarys2-72B-Instruct Text Generation• Updated10 days ago • 533• 34

OnomaAIResearch/Illustrious-xl-early-release-v0 Text-to-Image• Updated5 days ago • 12.4k• 199

jinaai/jina-embeddings-v3 Feature Extraction• Updatedabout 12 hours ago • 221k• 362