Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
mNLP-project
/
gpt2-dpo
like
0
Text Generation
Transformers
Safetensors
gpt2
trl
dpo
Generated from Trainer
text-generation-inference
Inference Endpoints
License:
mit
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
gpt2-dpo
/
README.md
Commit History
do test run on scitas with ref_model
9644871
verified
Luca-Engel
commited on
Jun 2
Training in progress, epoch 0
90b9a3d
verified
Luca-Engel
commited on
Jun 2
do test run on scitas with ref_model
0750890
verified
Luca-Engel
commited on
May 29
do test run on scitas with ref_model
edd65f0
verified
Luca-Engel
commited on
May 28
Training in progress, epoch 1
6f96b10
verified
Luca-Engel
commited on
May 28
Update README.md
8f2498d
verified
Luca-Engel
commited on
May 26
do test run on scitas with base gpt mode
a69ad36
verified
Luca-Engel
commited on
May 24
Training in progress, epoch 1
30a3b64
verified
Luca-Engel
commited on
May 23