Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
mpasila
's Collections
Finnish fine-tunes
Japanese2English datasets
ExLlamaV2 quantizations
Finnish Instruct Datasets
Pre-training dataset prep
Magnum used datasets
Pre-training dataset prep
updated
15 days ago
Some datasets I should probably use.
Upvote
-
JeanKaddour/minipile
Viewer
•
Updated
Jun 20, 2023
•
1.01M
•
2.11k
•
115
wikimedia/wikipedia
Viewer
•
Updated
Jan 9
•
61.6M
•
58.2k
•
586
neuralwork/arxiver
Viewer
•
Updated
9 days ago
•
63.4k
•
4.55k
•
338
ohsuz/tiny-textbooks-edu
Viewer
•
Updated
Jun 11
•
3.31M
•
41
•
1
ohsuz/tiny-code-textbooks-edu
Viewer
•
Updated
Jun 11
•
1.84M
•
50
•
2
Upvote
-
Share collection
View history
Collection guide
Browse collections