Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
trl-internal-testing
/
rm_hh_1b
like
0
Follow
trl internal testing
14
Text Classification
Transformers
TensorBoard
Safetensors
gpt_neox
trl
reward-trainer
Generated from Trainer
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
db0801c
rm_hh_1b
Commit History
End of training
db0801c
verified
vwxyzjn
commited on
Jun 26
Model save
e06c06c
verified
vwxyzjn
commited on
Jun 26
Training in progress, step 500
4053365
verified
vwxyzjn
commited on
Jun 26
initial commit
73d7af9
verified
vwxyzjn
commited on
Jun 26