Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
adamo1139
/
Yi-34B-200K-rawrr1-LORA-DPO-experimental-r3
like
1
Text Generation
Transformers
Safetensors
adamo1139/rawrr_v1
llama
dpo
qlora
unsloth
text-generation-inference
Inference Endpoints
4-bit precision
bitsandbytes
License:
apache-2.0
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
main
Yi-34B-200K-rawrr1-LORA-DPO-experimental-r3
Commit History
Update README.md
30235f3
verified
adamo1139
commited on
May 27
Update README.md
8248694
verified
adamo1139
commited on
Jan 22
Upload yi-34b-dpo-unsloth-1.py
91a3383
verified
adamo1139
commited on
Jan 22
Upload 5 files
1e20652
verified
adamo1139
commited on
Jan 22
Upload adapter_model.safetensors
3950041
verified
adamo1139
commited on
Jan 22
initial commit
58980f6
verified
adamo1139
commited on
Jan 22