Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Aviv-anthonnyolime
's Collections
Dataset
Model - Misc
Paper - Multimodal
Audio Dataset
Text-to-image
Omni-model
Audio model
Dataset
updated
7 days ago
Upvote
-
mlfoundations/MINT-1T-HTML
Viewer
•
Updated
Sep 21, 2024
•
623M
•
204k
•
81
mlfoundations/MINT-1T-ArXiv
Viewer
•
Updated
Sep 19, 2024
•
5.6M
•
4.08k
•
48
mlfoundations/MINT-1T-PDF-CC-2024-18
Updated
Sep 19, 2024
•
7.66k
•
19
mlfoundations/dclm-baseline-1.0-parquet
Viewer
•
Updated
Jul 19, 2024
•
2.73B
•
21.4k
•
25
HuggingFaceFW/fineweb-edu
Viewer
•
Updated
9 days ago
•
3.3B
•
492k
•
613
HuggingFaceFW/fineweb
Viewer
•
Updated
9 days ago
•
25B
•
500k
•
1.9k
jat-project/jat-dataset
Viewer
•
Updated
Feb 16, 2024
•
258M
•
514k
•
35
HuggingFaceTB/finemath
Viewer
•
Updated
3 days ago
•
48.3M
•
20.1k
•
276
DAMO-NLP-SG/multimodal_textbook
Updated
29 days ago
•
15.3k
•
132
fhswf/TinyStoriesV2_cleaned
Viewer
•
Updated
May 23, 2024
•
2.71M
•
369
•
8
TurkuNLP/finerweb-10bt
Viewer
•
Updated
23 days ago
•
7.1M
•
629
•
5
Upvote
-
Share collection
View history
Collection guide
Browse collections