Model Description
Finetuning based on chihoonlee10/T3Q-ko-solar-dpo-v1.0.
Training Method
Using Deepspeed, Accelerate, TRL etc.
Datasets
TBA
- Downloads last month
- 1,198
Finetuning based on chihoonlee10/T3Q-ko-solar-dpo-v1.0.
Using Deepspeed, Accelerate, TRL etc.
TBA