Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
trl-lib
/
qwen1.5-1.8b-dpo-cli
like
0
Transformers
Safetensors
Inference Endpoints
arxiv:
1910.09700
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
qwen1.5-1.8b-dpo-cli
Commit History
Upload tokenizer
0e9484b
verified
ybelkada
HF staff
commited on
Mar 15
Upload model
be448c6
verified
ybelkada
HF staff
commited on
Mar 15
initial commit
5154088
verified
ybelkada
HF staff
commited on
Mar 15