Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
fengyao1909
/
ppo_tldr
like
0
Text Generation
Transformers
Safetensors
trl-internal-testing/tldr-preference-sft-trl-style
gpt_neox
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
arxiv:
1909.08593
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
ba0ea37
ppo_tldr
Commit History
Training in progress, step 1
ba0ea37
verified
fengyao1909
commited on
12 days ago
Training in progress, step 1
0c2f20d
verified
fengyao1909
commited on
12 days ago
Training in progress, step 1
df28300
verified
fengyao1909
commited on
12 days ago
Training in progress, step 1
464765d
verified
fengyao1909
commited on
12 days ago
Training in progress, step 1
e358023
verified
fengyao1909
commited on
12 days ago
Training in progress, step 1
c1773fb
verified
fengyao1909
commited on
12 days ago
Training in progress, step 1
91ad64b
verified
fengyao1909
commited on
12 days ago
Training in progress, step 1
e3e1448
verified
fengyao1909
commited on
12 days ago
Training in progress, step 2
8b18444
verified
fengyao1909
commited on
12 days ago
End of training
2769c63
verified
fengyao1909
commited on
12 days ago
Training in progress, step 1
fbccb8d
verified
fengyao1909
commited on
12 days ago
Training in progress, step 1
8136d99
verified
fengyao1909
commited on
12 days ago
Training in progress, step 1
ae78f46
verified
fengyao1909
commited on
12 days ago
Training in progress, step 1
cbe5955
verified
fengyao1909
commited on
12 days ago
toyppo
cb120e2
verified
fengyao1909
commited on
18 days ago
initial commit
9f5f714
verified
fengyao1909
commited on
18 days ago