Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
rinna
/
japanese-gpt-neox-3.6b-instruction-ppo
like
70
Follow
rinna Co., Ltd.
89
Text Generation
Transformers
PyTorch
Safetensors
Anthropic/hh-rlhf
Japanese
gpt_neox
lm
nlp
text-generation-inference
arxiv:
2203.02155
arxiv:
1707.06347
arxiv:
2404.01657
License:
mit
Model card
Files
Files and versions
Community
4
Train
Deploy
Use this model
refs/pr/4
japanese-gpt-neox-3.6b-instruction-ppo
Commit History
Adding `safetensors` variant of this model
d179df6
SFconvertbot
commited on
Aug 24, 2023
update readme
2140541
tianyuz
commited on
Jun 9, 2023
upload
60ed51e
tianyuz
commited on
May 30, 2023
initial commit
fc6fad9
tianyuz
commited on
May 30, 2023