Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
adamo1139
/
Yi-34B-200K-rawrr1-LORA-DPO-experimental-r3
like
1
Text Generation
Transformers
Safetensors
adamo1139/rawrr_v1
llama
dpo
qlora
unsloth
text-generation-inference
Inference Endpoints
4-bit precision
bitsandbytes
License:
apache-2.0
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
main
Yi-34B-200K-rawrr1-LORA-DPO-experimental-r3
1 contributor
History:
6 commits
adamo1139
Update README.md
30235f3
verified
6 months ago
.gitattributes
Safe
1.52 kB
initial commit
10 months ago
LICENSE
0 Bytes
initial commit
10 months ago
README.md
Safe
822 Bytes
Update README.md
6 months ago
adapter_config.json
Safe
568 Bytes
Upload 5 files
10 months ago
adapter_model.safetensors
Safe
492 MB
LFS
Upload adapter_model.safetensors
10 months ago
config.json
Safe
1.1 kB
Upload 5 files
10 months ago
special_tokens_map.json
Safe
573 Bytes
Upload 5 files
10 months ago
tokenizer.model
Safe
1.03 MB
LFS
Upload 5 files
10 months ago
tokenizer_config.json
Safe
1 kB
Upload 5 files
10 months ago
yi-34b-dpo-unsloth-1.py
Safe
4.64 kB
Upload yi-34b-dpo-unsloth-1.py
10 months ago