Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
anakin87
/
gemma-2b-orpo
like
27
Text Generation
Transformers
Safetensors
alvarobartt/dpo-mix-7k-simplified
English
gemma
trl
orpo
Generated from Trainer
conversational
Eval Results
text-generation-inference
Inference Endpoints
arxiv:
2403.07691
License:
gemma-terms-of-use (other)
Model card
Files
Files and versions
Community
2
Train
Deploy
Use this model
c8b9386
gemma-2b-orpo
Commit History
improve notebook visualization
c8b9386
anakin87
commited on
Mar 26
fix
cd3951b
anakin87
commited on
Mar 25
fixes
189413b
anakin87
commited on
Mar 25
improve readme
ce4ba3c
anakin87
commited on
Mar 25
Upload gemma-2b-orpo.png
159c797
verified
anakin87
commited on
Mar 25
material
4db7146
anakin87
commited on
Mar 25
Update README.md
5cbf999
verified
anakin87
commited on
Mar 25
little change
15a13e0
anakin87
commited on
Mar 25
End of training
7fbb0bb
verified
anakin87
commited on
Mar 24
Training in progress, epoch 2
b6e4162
verified
anakin87
commited on
Mar 24
Training in progress, epoch 2
d241042
verified
anakin87
commited on
Mar 24
Training in progress, epoch 0
3c43e7b
verified
anakin87
commited on
Mar 24
initial commit
dcc2f5c
verified
anakin87
commited on
Mar 24