Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
double-ai
/
DPO_AV_sigmoid_0.15_f137ec203748d4ec_checkpoint-154_2025-01-01_13-53-25_f6eb02e2f56a8f31
like
0
Follow
double-ai
7
Transformers
Safetensors
Generated from Trainer
trl
dpo
Inference Endpoints
arxiv:
2305.18290
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
DPO_AV_sigmoid_0.15_f137ec203748d4ec_checkpoint-154_2025-01-01_13-53-25_f6eb02e2f56a8f31
/
reference_adapter
1 contributor
History:
1 commit
ibenshaul
End of training
52a426b
verified
25 days ago
adapter_config.json
Safe
805 Bytes
End of training
25 days ago
adapter_model.safetensors
Safe
4.3 GB
LFS
End of training
25 days ago