Text Generation
Transformers
Safetensors
llama
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints

Commit History

Update README.md
0b30eab
verified

flydust commited on

Update README.md
f75750c
verified

flydust commited on

End of training
268dd21
verified

flydust commited on

Model save
d52283b
verified

flydust commited on