Open Datasets
updated
fka/awesome-chatgpt-prompts
Viewer
•
Updated
•
600
•
25.6k
•
9.51k
Viewer
•
Updated
•
470M
•
47.2k
•
321
Viewer
•
Updated
•
2.2M
•
7.62k
•
385
Matthijs/cmu-arctic-xvectors
Viewer
•
Updated
•
7.93k
•
18.8k
•
62
parler-tts/libritts-r-filtered-speaker-descriptions
Viewer
•
Updated
•
359k
•
251
•
7
Viewer
•
Updated
•
860k
•
11.9k
•
515
alpindale/two-million-bluesky-posts
Viewer
•
Updated
•
2.11M
•
858
•
200
arimalabs/2.3-million-bluesky-posts
Viewer
•
Updated
•
2.37M
•
80
•
5
Viewer
•
Updated
•
70k
•
81.4k
•
216
Viewer
•
Updated
•
1.34M
•
9.63k
•
29
Viewer
•
Updated
•
1.12M
•
5.53k
•
4
parler-tts/libritts_r_filtered
Viewer
•
Updated
•
359k
•
2.86k
•
21
opendiffusionai/cc12m-cleaned
Viewer
•
Updated
•
8.53M
•
422
•
10
Viewer
•
Updated
•
31.4k
•
384
•
22
Preview
•
Updated
•
1.16k
•
7
Viewer
•
Updated
•
61.6M
•
67.7k
•
995
parler-tts/mls-eng-speaker-descriptions
Viewer
•
Updated
•
10.8M
•
408
•
10
Viewer
•
Updated
•
110M
•
2.64k
•
97
Updated
•
200
•
2
Viewer
•
Updated
•
602k
•
13.5k
•
144
Viewer
•
Updated
•
4.48B
•
58.9k
•
707
Viewer
•
Updated
•
1.55k
•
51
•
4
Updated
•
11.6k
•
138
Viewer
•
Updated
•
59.1k
•
3.83k
•
12
keremberke/license-plate-object-detection
Viewer
•
Updated
•
8.83k
•
985
•
33
Updated
•
56
•
8
Viewer
•
Updated
•
98.6k
•
1.59k
•
100
nebius/SWE-agent-trajectories
Viewer
•
Updated
•
80k
•
1.27k
•
67
Viewer
•
Updated
•
3.4k
•
4.95k
•
53
cfahlgren1/react-code-instructions
Viewer
•
Updated
•
74.4k
•
487
•
154
DAMO-NLP-SG/multimodal_textbook
Updated
•
5.19k
•
157
NovaSky-AI/Sky-T1_data_17k
Viewer
•
Updated
•
16.4k
•
195
•
186
Viewer
•
Updated
•
5.45B
•
8.39k
•
435
Viewer
•
Updated
•
546M
•
26.2k
•
896
hoskinson-center/proof-pile
Viewer
•
Updated
•
363k
•
7.75k
•
63
HuggingFaceFW/fineweb-edu
Viewer
•
Updated
•
3.5B
•
297k
•
883
EleutherAI/the_pile_deduplicated
Viewer
•
Updated
•
134M
•
17.7k
•
106
MohamedRashad/multilingual-tts
Viewer
•
Updated
•
25.5k
•
280
•
45
Viewer
•
Updated
•
16.4k
•
77
•
4
facebook/multilingual_librispeech
Viewer
•
Updated
•
1.49M
•
23.1k
•
166
Viewer
•
Updated
•
1.25M
•
16.6k
•
85
Viewer
•
Updated
•
2.77M
•
8.14k
•
111
Fumika/Wikinews-multilingual
Viewer
•
Updated
•
15.2k
•
110
•
7
ayymen/Weblate-Translations
Viewer
•
Updated
•
11.7M
•
4.51k
•
16
Updated
•
17.5k
•
152
Helsinki-NLP/opus_wikipedia
Viewer
•
Updated
•
1.75M
•
516
•
10
Viewer
•
Updated
•
3.59M
•
115
•
1
MLCommons/unsupervised_peoples_speech
Updated
•
36.2k
•
69
HKUSTAudio/Llasa_opensource_speech_data_160k_hours_tokenized
Updated
•
562
•
30
Viewer
•
Updated
•
10k
•
3.43k
•
514
Viewer
•
Updated
•
68.1k
•
146k
•
20
allenai/RLVR-GSM-MATH-IF-Mixed-Constraints
Viewer
•
Updated
•
29.9k
•
1.25k
•
30
allenai/olmo-2-0325-32b-preference-mix
Updated
•
227
•
15
allenai/tulu-3-sft-olmo-2-mixture-0225
Viewer
•
Updated
•
866k
•
890
•
22
Viewer
•
Updated
•
170M
•
46.4k
•
88
Viewer
•
Updated
•
621M
•
36.2k
•
84
Viewer
•
Updated
•
932
•
16.8k
•
504
Congliu/Chinese-DeepSeek-R1-Distill-data-110k
Viewer
•
Updated
•
110k
•
531
•
711
Viewer
•
Updated
•
102k
•
234
•
46
Viewer
•
Updated
•
450k
•
13.1k
•
687
Viewer
•
Updated
•
167M
•
2.23k
•
60