Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
wtxfrancise
/
mini_llm_dpo
like
0
Text Generation
Transformers
Safetensors
qwen
custom_code
Model card
Files
Files and versions
Community
Train
Use this model
f8aff71
mini_llm_dpo
/
optimizer.pt
Commit History
DPO LLM
2f76b25
verified
wtxfrancise
commited on
Apr 24, 2024