Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
morizon
/
llm-jp-3-13b-instruct2-grpo-MATH-lighteval_step1000_lora
like
1
Transformers
Safetensors
English
text-generation-inference
unsloth
llama
trl
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
llm-jp-3-13b-instruct2-grpo-MATH-lighteval_step1000_lora
Commit History
Update README.md
51e0e51
verified
morizon
commited on
18 days ago
Trained with Unsloth
88cd2a8
verified
morizon
commited on
18 days ago
Trained with Unsloth
84f1f1b
verified
morizon
commited on
18 days ago
Upload README.md with huggingface_hub
26c8a0b
verified
morizon
commited on
18 days ago
initial commit
9b8131b
verified
morizon
commited on
18 days ago