Safetensors
llama
alignment-handbook
trl
dpo
Generated from Trainer
File size: 135 Bytes
2d9d8e6
 
 
1
2
3
4
version https://git-lfs.github.com/spec/v1
oid sha256:2e9cc3ccaabb82c8557935e86dc33ac987c379e0048d864786d2947c21e692e6
size 4980945440