Edit model card

Model Card for Model ID

AI ์™€ ๋น…๋ฐ์ดํ„ฐ ๋ถ„์„ ์ „๋ฌธ ๊ธฐ์—…์ธ Linkbricks์˜ ๋ฐ์ดํ„ฐ์‚ฌ์ด์–ธํ‹ฐ์ŠคํŠธ์ธ ์ง€์œค์„ฑ ๋ฐ•์‚ฌ(Saxo)๊ฐ€ meta-llama/Meta-Llama-3-8B๋ฅผ ๋ฒ ์ด์Šค๋ชจ๋ธ๋กœ GCP์ƒ์˜ H100-80G 8๊ฐœ๋ฅผ ํ†ตํ•ด SFT-DPO ํ›ˆ๋ จํ•œ ํ•œ๊ธ€ ๊ธฐ๋ฐ˜ LLAMA3-8b 8๊ฐœ์˜ MoE(Mixture of Expert)๋ชจ๋ธ. ํ† ํฌ๋‚˜์ด์ €๋Š” ๋ผ๋งˆ3๋ž‘ ๋™์ผํ•˜๋ฉฐ ํ•œ๊ธ€ VOCA ํ™•์žฅ์€ ํ•˜์ง€ ์•Š์€ ๋ฒ„์ „ ์ž…๋‹ˆ๋‹ค. ์ผ๋ฐ˜์งˆ์˜์‘๋‹ต(์ฑ„ํŒ…)-์˜๋ฃŒ-๊ตฐ์‚ฌ-ํ•œ์ค‘์ผ๋ฒˆ์—ญ-์ฝ”๋”ฉ ๊ฐ ํŠนํ™” LLM์„ ํ†ตํ•ฉ

Dr. Yunsung Ji (Saxo), a data scientist at Linkbricks, a company specializing in AI and big data analytics, trained the meta-llama/Meta-Llama-3-8B base model on 8 H100-60Gs on GCP for 4 hours of instructional training (8000 Tokens). Accelerate, Deepspeed Zero-3 libraries were used.

www.linkbricks.com, www.linkbricks.vc

Downloads last month
6
Safetensors
Model size
47.5B params
Tensor type
BF16
ยท
Inference API
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train Saxo/Linkbricks-Horizon-AI-Korean-LLAMA3blend-8x8b