Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
RTO-RL
/
Llama3-8B-RewardModel
like
0
Follow
Reinforced Token Optimization
4
Safetensors
HuggingFaceH4/ultrafeedback_binarized
llama
Model card
Files
Files and versions
Community
Train
main
Llama3-8B-RewardModel
Commit History
Update README.md
4c47959
verified
zkshan2002
commited on
8 days ago
Create README.md
f2aee75
verified
zkshan2002
commited on
Oct 11, 2024
initial commit
74f6c7d
verified
zkshan2002
commited on
Oct 11, 2024
initial commit
693f20e
verified
zkshan2002
commited on
Oct 11, 2024