FDeRubeis
/

araft_trained_dpo

generated_from_trainer

Model card Files Files and versions Community

araft_trained_dpo

1 contributor

History: 6 commits

FDeRubeis's picture

Update Readme.md: add links to ReAct and HotpotQA papers

f2941b0 verified 15 days ago