Saxo
/

yunsung-llama-2-koen-13b-linkbricks-sft-basic-v1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Saxo commited on Mar 12, 2024

Commit

ae4910e

·

verified ·

1 Parent(s): e99991b

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ pipeline_tag: text-generation
 # Model Card for Model ID
 AI 와 빅데이터 분석 전문 기업인 Linkbricks의 데이터사이언티스트인 지윤성 박사(Saxo)가 beomi/llama-2-koen-13b 베이스모델을 GCC상의 A100-40G 4개를 통해 4시간 SFT 훈련을 한(2048 Tokens) 인스트럭션 모델.
-Accelerate, Deepspeed Zero-3 라이브러리를 사용했다.
 Dr. Yunsung Ji (Saxo), a data scientist at Linkbricks, a company specializing in AI and big data analytics, trained the beomi/llama-2-koen-13b base model on 4 A100-40Gs on GCC for 4 hours of instructional training (2048 Tokens).
 Accelerate, Deepspeed Zero-3 libraries were used.

 # Model Card for Model ID
 AI 와 빅데이터 분석 전문 기업인 Linkbricks의 데이터사이언티스트인 지윤성 박사(Saxo)가 beomi/llama-2-koen-13b 베이스모델을 GCC상의 A100-40G 4개를 통해 4시간 SFT 훈련을 한(2048 Tokens) 인스트럭션 모델.
+ Accelerate, Deepspeed Zero-3 라이브러리를 사용했으며 Flash Attention 은 Disable  로 설정
 Dr. Yunsung Ji (Saxo), a data scientist at Linkbricks, a company specializing in AI and big data analytics, trained the beomi/llama-2-koen-13b base model on 4 A100-40Gs on GCC for 4 hours of instructional training (2048 Tokens).
 Accelerate, Deepspeed Zero-3 libraries were used.