Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Columbia-NLP
/
gemma-2b-zephyr-dpo
like
4
Text Generation
Transformers
Safetensors
argilla/dpo-mix-7k
gemma
alignment-handbook
trl
dpo
Generated from Trainer
conversational
Eval Results
text-generation-inference
Inference Endpoints
License:
gemma-terms-of-use
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
e5f12d3
gemma-2b-zephyr-dpo
/
README.md
Commit History
Update README.md
e5f12d3
verified
qywu
commited on
Apr 12
Update README.md
a6501ce
verified
qywu
commited on
Apr 12
Update README.md
2955d05
verified
qywu
commited on
Apr 12
Update README.md
4a35d92
verified
qywu
commited on
Apr 12
Upload GemmaForCausalLM
ff46885
verified
qywu
commited on
Apr 12