FDeRubeis
/

araft_trained_dpo

Generated from Trainer

Model card Files Files and versions Community

araft_trained_dpo

1 contributor

History: 6 commits

FDeRubeis's picture

Update Readme.md: add links to ReAct and HotpotQA papers

f2941b0 verified 6 months ago