Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
prithivMLmods
/
SmolLM2_135M_Grpo_Checkpoint
like
10
Text Generation
Transformers
Safetensors
openai/gsm8k
English
llama
text-generation-inference
GRPO
conversational
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
SmolLM2_135M_Grpo_Checkpoint
Commit History
Update README.md
13f0063
verified
prithivMLmods
commited on
12 days ago
Update README.md
17f5f78
verified
prithivMLmods
commited on
12 days ago
Update README.md
b87f9c2
verified
prithivMLmods
commited on
12 days ago
Update README.md
cdd099e
verified
prithivMLmods
commited on
12 days ago
Update README.md
937dac8
verified
prithivMLmods
commited on
12 days ago
Update README.md
7a4989e
verified
prithivMLmods
commited on
12 days ago
Upload SmolLM_x_Grpo.ipynb
9b868a8
verified
prithivMLmods
commited on
12 days ago
Create README.md
343f3dd
verified
prithivMLmods
commited on
12 days ago
Upload folder using huggingface_hub
7797c34
verified
prithivMLmods
commited on
12 days ago
Delete final_model
1dbe979
verified
prithivMLmods
commited on
12 days ago
Upload folder using huggingface_hub
2d43f42
verified
prithivMLmods
commited on
12 days ago
initial commit
61c8346
verified
prithivMLmods
commited on
12 days ago