Saxo
/

Linkbricks-Horizon-AI-Korean-llama3-sft-dpo-8b-base

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Saxo commited on Jul 2, 2024

Commit

ee70e42

•

1 Parent(s): 3fbabe1

Update README.md

Files changed (1) hide show

README.md +1 -2

README.md CHANGED Viewed

@@ -17,8 +17,7 @@ pipeline_tag: text-generation
 </div>
-AI 와 빅데이터 분석 전문 기업인 Linkbricks의 데이터사이언티스트인 지윤성 박사(Saxo)가 meta-llama/Meta-Llama-3-8B를 베이스모델로 GCP상의 H100-60G 8개를 통해 SFT-DPO 훈련을 한(8000 Tokens) 모델.
- Accelerate, Deepspeed Zero-3 라이브러리를 사용했으며 Flash Attention 은 Disable  로 설정
 Dr. Yunsung Ji (Saxo), a data scientist at Linkbricks, a company specializing in AI and big data analytics, trained the meta-llama/Meta-Llama-3-8B base model on 8 H100-60Gs on GCP for 4 hours of instructional training (8000 Tokens).
 Accelerate, Deepspeed Zero-3 libraries were used.

 </div>
+AI 와 빅데이터 분석 전문 기업인 Linkbricks의 데이터사이언티스트인 지윤성 박사(Saxo)가 meta-llama/Meta-Llama-3-8B를 베이스모델로 GCP상의 H100-60G 8개를 통해 SFT-DPO 훈련을 한(8000 Tokens) 한글 기반 모델.
 Dr. Yunsung Ji (Saxo), a data scientist at Linkbricks, a company specializing in AI and big data analytics, trained the meta-llama/Meta-Llama-3-8B base model on 8 H100-60Gs on GCP for 4 hours of instructional training (8000 Tokens).
 Accelerate, Deepspeed Zero-3 libraries were used.