Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
PleIAs
's Collections
Common Artifacts
Common Models
Common Corpus
Toxic Commons
Finance Commons
Bad Data Toolbox
OpenCulture
Common Corpus
updated
Nov 13
Largest multilingual pretraining data.
Upvote
8
PleIAs/common_corpus
Viewer
•
Updated
Nov 22
•
397M
•
29.8k
•
196
Upvote
8
+4
Share collection
View history
Collection guide
Browse collections