Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Aviv-anthonnyolime
's Collections
Dataset
Model - Misc
Paper - Multimodal
Audio Dataset
Text-to-image
Omni-model
Audio model
Dataset
updated
5 days ago
Upvote
-
mlfoundations/MINT-1T-HTML
Viewer
•
Updated
Sep 21, 2024
•
623M
•
157k
•
81
mlfoundations/MINT-1T-ArXiv
Viewer
•
Updated
Sep 19, 2024
•
5.6M
•
4.05k
•
48
mlfoundations/MINT-1T-PDF-CC-2024-18
Updated
Sep 19, 2024
•
7.54k
•
19
mlfoundations/dclm-baseline-1.0-parquet
Viewer
•
Updated
Jul 19, 2024
•
2.73B
•
23.6k
•
25
HuggingFaceFW/fineweb-edu
Viewer
•
Updated
7 days ago
•
3.3B
•
481k
•
611
HuggingFaceFW/fineweb
Viewer
•
Updated
7 days ago
•
25B
•
490k
•
1.88k
jat-project/jat-dataset
Viewer
•
Updated
Feb 16, 2024
•
258M
•
514k
•
35
HuggingFaceTB/finemath
Viewer
•
Updated
1 day ago
•
48.3M
•
20.3k
•
276
DAMO-NLP-SG/multimodal_textbook
Updated
27 days ago
•
15.3k
•
132
fhswf/TinyStoriesV2_cleaned
Viewer
•
Updated
May 23, 2024
•
2.71M
•
346
•
8
TurkuNLP/finerweb-10bt
Viewer
•
Updated
21 days ago
•
7.1M
•
427
•
5
Upvote
-
Share collection
View history
Collection guide
Browse collections