Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
HuggingFaceTB
/
cosmo2-tokenizer
like
1
Follow
Hugging Face TB Research
936
Transformers
HuggingFaceTB/cosmo2_training_data_subset_1M
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
cosmo2-tokenizer
cosmo2-tokenizer
Tokenizer for the training of cosmo2. This tokenizer was trained on 1M samples from:
FineWeb-Edu 70%
Cosmopedia v2 15%
StarCoderData 8%
OpenWebMath 5%
StackOverFlow 2%
Downloads last month
-
Downloads are not tracked for this model.
How to track
Inference API
Unable to determine this modelβs pipeline type. Check the
docs
.
Spaces using
HuggingFaceTB/cosmo2-tokenizer
13
π
Tousifahamed/SmolTextGen
π¦
MilindChawre/SmolLM2-Text-Generator
π»
satyanayak/SmalLMv2-Text-Generator
π
anjikum/generate_text_smollm2-135M_implementation
π
EzhirkoArulmozhi/TextGeneratorSmolLM2
π
nishantb06/SmolLMTextGenerator-5k
π
stokkangri/tsai_s13_smollm2_135M_param_matched_llm
π
ninagala/smollm2-shakespeare
π’
kalekarnn/SmolLM2-135-model
π
hashvibe007/smollm2
π’
Shilpaj/SmoLLMv2
π
Tousifahamed/smol-lm2-demo
+ 8 Spaces
+ 1 Spaces