PKU-Alignment
/

beaver-7b-v3.0-cost

Reinforcement Learning

reinforcement-learning-from-human-feedback

Model card Files Files and versions Community

beaver-7b-v3.0-cost

Commit History

Update README.md

6b37df2

XuehaiPan commited on Apr 20

Add beaver-7b-v3.0-cost

435d9f7

XuehaiPan commited on Apr 19

initial commit

7bbbbef
verified

XuehaiPan commited on Apr 19