Model Card for Model ID

AI 와 빅데이터 뢄석 μ „λ¬Έ 기업인 Linkbricks의 λ°μ΄ν„°μ‚¬μ΄μ–Έν‹°μŠ€νŠΈμΈ μ§€μœ€μ„±(Saxo) 이사가
Hermes-3-Llama-3.1-8B 베이슀λͺ¨λΈμ„ μ‚¬μš©ν•΄μ„œ H100-80G 8개λ₯Ό 톡해 μ•½ 25%μ •λ„μ˜ νŒŒλΌλ―Έν„°λ₯Ό ν•œκ΅­μ–΄ CPT(Continued-Pretraining)->SFT->DPO ν•œ ν•œκΈ€ μ–Έμ–΄ λͺ¨λΈ
천만건의 ν•œκΈ€ λ‰΄μŠ€ μ½”νΌμŠ€λ₯Ό κΈ°μ€€μœΌλ‘œ λ‹€μ–‘ν•œ ν…ŒμŠ€ν¬λ³„ ν•œκ΅­μ–΄-쀑ꡭ어-μ˜μ–΄-일본어 ꡐ차 ν•™μŠ΅ 데이터와 μˆ˜ν•™ 및 λ…Όλ¦¬νŒλ‹¨ 데이터λ₯Ό ν†΅ν•˜μ—¬ ν•œμ€‘μΌμ˜ μ–Έμ–΄ ꡐ차 증강 μ²˜λ¦¬μ™€ λ³΅μž‘ν•œ 논리 문제 μ—­μ‹œ λŒ€μ‘ κ°€λŠ₯ν•˜λ„λ‘ ν›ˆλ ¨ν•œ λͺ¨λΈμ΄λ‹€.
-ν† ν¬λ‚˜μ΄μ €λŠ” 단어 ν™•μž₯ 없이 베이슀 λͺ¨λΈ κ·ΈλŒ€λ‘œ μ‚¬μš©
-고객 λ¦¬λ·°λ‚˜ μ†Œμ…œ ν¬μŠ€νŒ… 고차원 뢄석 및 μ½”λ”©κ³Ό μž‘λ¬Έ, μˆ˜ν•™, λ…Όλ¦¬νŒλ‹¨ 등이 κ°•ν™”λœ λͺ¨λΈ
-128k-Context Window
-ν•œκΈ€ Function Call 및 Tool Calling 지원
-Deepspeed Stage=3, rslora 및 BAdam Layer Mode μ‚¬μš©
-ollama run benedict/linkbricks-hermes3-llama3.1-8b-korean-advanced-q4
-ollama run benedict/linkbricks-hermes3-llama3.1-8b-korean-advanced-q8

Finetuned by Mr. Yunsung Ji (Saxo), a data scientist at Linkbricks, a company specializing in AI and big data analytics
about 25% of total parameters Korean CPT(Continued-Pretraining)->SFT->DPO training model based on Hermes-3-Llama-3.1-8B through 8 H100-80Gs as a Korean language model
It is a model that has been trained to handle Korean-Chinese-English-Japanese cross-training data and 10M korean news corpus and logic judgment data for various tasks to enable cross-fertilization processing and complex Korean logic & math problems.
-Tokenizer uses the base model without word expansion
-Models enhanced with high-dimensional analysis of customer reviews and social posts, as well as coding, writing, math and decision making
-128k-Context Window
-Support for Korean Functioncall and Tool Calling
-Deepspeed Stage=3, use rslora and BAdam Layer Mode


www.linkbricks.com, www.linkbricks.vc

Downloads last month
700
Safetensors
Model size
8.03B params
Tensor type
BF16
Β·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for Saxo/Linkbricks-Horizon-AI-Korean-Advanced-8B

Quantized
(41)
this model
Finetunes
2 models
Quantizations
1 model

Datasets used to train Saxo/Linkbricks-Horizon-AI-Korean-Advanced-8B

Spaces using Saxo/Linkbricks-Horizon-AI-Korean-Advanced-8B 3