Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
princeton-nlp
/
gemma-2-9b-it-DPO
like
5
Text Generation
Transformers
Safetensors
princeton-nlp/gemma2-ultrafeedback-armorm
gemma2
alignment-handbook
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
arxiv:
2405.14734
arxiv:
2310.01377
arxiv:
2406.12845
Model card
Files
Files and versions
Community
2
Train
Deploy
Use this model
main
gemma-2-9b-it-DPO
Commit History
Update README.md
f646c99
verified
princeton-nlp
commited on
Jul 18
Update README.md
cb78d4b
verified
princeton-nlp
commited on
Jul 18
update
b6d53aa
xiamengzhou
commited on
Jul 17
update
693f340
xiamengzhou
commited on
Jul 17
update config
b6d0acc
xiamengzhou
commited on
Jul 16
Upload Gemma2ForCausalLM
1a75e57
verified
princeton-nlp
commited on
Jul 16
initial commit
ffb1d77
verified
princeton-nlp
commited on
Jul 16