Hugging Face – The AI community building the future. (original) (raw)

Edit Datasets filters

Modalities

3D Audio Document Geospatial Image Tabular Text Time-series Video

Size (rows)

Format

json csv parquet imagefolder soundfolder webdataset text arrow

Datasets

418,798

Full-text search

open-thoughts/OpenThoughts3-1.2M Viewer• Updated1 day ago • 1.2M • 5.79k• 78

fka/awesome-chatgpt-prompts Viewer• UpdatedJan 6 • 203 • 22.1k• 7.9k

yandex/yambda Viewer• Updated4 days ago • 5.31B • 42.8k• 155

a-m-team/AM-DeepSeek-R1-0528-Distilled Preview• Updated1 day ago • 3.22k• 46

Hcompany/WebClick Viewer• Updated1 day ago • 1.64k • 4.24k• 45

nvidia/Nemotron-Personas Viewer• Updated1 day ago • 100k • 962• 39

open-r1/Mixture-of-Thoughts Viewer• Updated15 days ago • 699k • 31.3k• 209

openbmb/Ultra-FineWeb Viewer• Updated4 days ago • 1.29B • 35.9k• 156

PleIAs/common_corpus Viewer• Updated6 days ago • 470M • 238k• 282

common-pile/comma_v0.1_training_dataset Viewer• Updated4 days ago • 784M • 8k• 21

MiniMaxAI/SynLogic Viewer• Updatedabout 16 hours ago • 49.3k • 1.15k• 82

miriad/miriad-5.8M Viewer• Updated26 days ago • 5.82M • 418• 16

HuggingFaceFW/fineweb Viewer• UpdatedJan 31 • 25B • 371k• 2.19k

microsoft/mediflow Viewer• Updated11 days ago • 1.84M • 3.08k• 31

gaia-benchmark/GAIA UpdatedFeb 13 • 11.5k• 366

JokerJan/MMR-VBench Viewer• Updated6 days ago • 1.26k • 1.53k• 15

Dataseeds/DataSeeds.AI-Sample-Dataset-DSD Viewer• Updatedabout 12 hours ago • 7.84k • 242• 12

DeepMount00/italian_conversations Viewer• Updated3 days ago • 1k • 59• 12

thivux/phoaudiobook Viewer• Updated1 day ago • 1.04M • 1.22k• 11

bigai-nlco/ReflectionEvo Viewer• Updated7 days ago • 437k • 653• 11

snorkelai/Multi-Turn-Insurance-Underwriting Viewer• Updated12 days ago • 100 • 1.78k• 20

boltuix/emotions-dataset Viewer• Updated16 days ago • 131k • 291• 14

allenai/reward-bench-2 Viewer• Updated6 days ago • 1.87k • 886• 18

wikimedia/wikipedia Viewer• UpdatedJan 9, 2024 • 61.6M • 75k• 841

HuggingFaceFW/fineweb-edu Viewer• UpdatedJan 31 • 3.3B • 129k• 693

Josephgflowers/Finance-Instruct-500k Viewer• UpdatedMar 1 • 518k • 1.07k• 79

openai/gsm8k Viewer• UpdatedJan 4, 2024 • 17.6k • 501k• 756

bigcode/the-stack Viewer• UpdatedApr 13, 2023 • 546M • 13.6k• 822

roneneldan/TinyStories Viewer• UpdatedAug 12, 2024 • 2.14M • 25.4k• 678

nvidia/Llama-Nemotron-Post-Training-Dataset Viewer• UpdatedMay 8 • 3.91M • 11.3k• 500