Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Aviv-anthonnyolime
's Collections
Papers
Dataset
Model - Misc
Paper - Multimodal
Audio Dataset
Text-to-image
Omni-model
Audio model
Dataset
updated
21 days ago
Upvote
-
mlfoundations/MINT-1T-HTML
Viewer
•
Updated
Sep 21, 2024
•
623M
•
277k
•
82
mlfoundations/MINT-1T-ArXiv
Viewer
•
Updated
Sep 19, 2024
•
5.6M
•
3.57k
•
48
mlfoundations/MINT-1T-PDF-CC-2024-18
Updated
Sep 19, 2024
•
9.81k
•
19
mlfoundations/dclm-baseline-1.0-parquet
Viewer
•
Updated
Jul 19, 2024
•
2.73B
•
8.4k
•
26
HuggingFaceFW/fineweb-edu
Viewer
•
Updated
23 days ago
•
3.3B
•
537k
•
634
HuggingFaceFW/fineweb
Viewer
•
Updated
23 days ago
•
25B
•
367k
•
1.98k
jat-project/jat-dataset
Viewer
•
Updated
Feb 16, 2024
•
258M
•
361k
•
37
HuggingFaceTB/finemath
Viewer
•
Updated
17 days ago
•
48.3M
•
12.1k
•
286
DAMO-NLP-SG/multimodal_textbook
Updated
Jan 11
•
6.33k
•
132
fhswf/TinyStoriesV2_cleaned
Viewer
•
Updated
May 23, 2024
•
2.71M
•
658
•
8
TurkuNLP/finerweb-10bt
Viewer
•
Updated
Jan 17
•
7.1M
•
733
•
5
Upvote
-
Share collection
View history
Collection guide
Browse collections