Edit model card

The license is cc-by-nc-sa-4.0.

๐Ÿปโ€โ„๏ธDopeorNope/COKAL-ko-v1-70B๐Ÿปโ€โ„๏ธ

img

Model Details

Model Developers DopeorNope (Seungyoo Lee)

์•ˆ๋…•ํ•˜์„ธ์š”? ์˜คํ”ˆ ์†Œ์Šค ์ง„์˜์˜ ๋ฐœ์ „์„ ์œ„ํ•ด์„œ ๊ณต๊ฐœํ•˜๊ฒŒ ๋œ COKAL-ko-v1-70B ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค.

์ด ๋ชจ๋ธ์€ ๊ธฐ์กด์˜ ๋ชจ๋ธ๋“ค๊ณผ ๋‹ค๋ฅด๊ฒŒ ํ•œ๊ตญ์–ด๋กœ ์„ค๋ช…์„ ๋“œ๋ฆฌ๊ณ ์ž ํ•ฉ๋‹ˆ๋‹ค.

์• ์ดˆ์— ๋ชฉ์ ์ด, ํ•œ๊ตญ์–ด ํŠนํ™” ๋ชจ๋ธ์ด๊ธฐ๋„ ํ•˜์˜€๊ณ , ๋‹ค์–‘ํ•œ ์‚ฌ๋žŒ๋“ค์ด ํŽธํ•˜๊ฒŒ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๊ฒŒ ์ด๋ ‡๊ฒŒ ๊ณต๊ฐœํ•˜๊ณ ์ž ํ•ฉ๋‹ˆ๋‹ค.

์ด ๋ชจ๋ธ์˜ ๋ฒ ์ด์Šค ๋ชจ๋ธ์€, beomi๋‹˜์˜ beomi/llama-2-ko-70b์ž…๋‹ˆ๋‹ค.

๋ฒ ์ด์Šค ๋ชจ๋ธ์— ์–‘์žํ™” ํ•˜์—ฌ์„œ ํƒ€๊ฒŸ๋ชจ๋“ˆ ๊ฐ€๋Šฅํ•œ ๋Œ€๋ถ€๋ถ„ ๋Š˜๋ฆฌ๊ณ  ์•ฝ 15,000,000๊ฐœ์˜ ํ•œ๊ตญ์–ด ํ† ํฐ์„ ๋จน์˜€์Šต๋‹ˆ๋‹ค.

ํ›ˆ๋ จ์‹œ๊ฐ„์€ ์•ฝ ์ผ์ฃผ์ผ ์†Œ์š” ๋˜์—ˆ์œผ๋ฉฐ, A100x8๋Œ€๋ฅผ ํ™œ์šฉํ•˜์—ฌ ํŠœ๋‹ํ•˜์˜€์Šต๋‹ˆ๋‹ค.

๋ฒค์น˜๋งˆํฌ๋Š” ๋”ฐ๋กœ ์žก์ง€ ์•Š์•˜์œผ๋‚˜, ํ•œ๋ฒˆ ์‹œ๋„ํ•ด๋ณด์‹œ๋ฉด ์ข‹์„๋“ฏ ํ•˜๋„ค์š”.

ํ•œ๊ตญ์–ด ๋ชจ๋ธ์ค‘ ํฐ ๋ชจ๋ธ๋“ค์ด ๋งŽ์ด ์—†์–ด์„œ ์ด๋ ‡๊ฒŒ ํ•œ๋ฒˆ ์‹œ๋„ํ•ด๋ณด๊ณ , ์˜คํ”ˆ์œผ๋กœ ๊ณต๊ฐœํ•˜๊ฒŒ ๋˜์—ˆ์Šต๋‹ˆ๋‹ค.

์ตœ๋Œ€ํ•œ ์ผ๋ฐ˜ ์ƒ์‹์„ ๋Š˜๋ฆฌ๊ธฐ์œ„ํ•ด ์น˜์ค‘ํ•œ ๋ชจ๋ธ๋กœ, ๊ฐ๊ฐ ๊ฐœ๋ณ„๋งˆ๋‹ค ํƒ€๊ฒŸํ•˜๊ณ ์ž task๋งˆ๋‹ค ์„ฑ๋Šฅ์ด ๋‹ค๋ฅผ ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

๊ฐ์‚ฌํ•ฉ๋‹ˆ๋‹ค.

Implementation Code


from transformers import AutoModelForCausalLM, AutoTokenizer
import torch

dir = "DopeorNope/COKAL-ko-v1-70B"

model = AutoModelForCausalLM.from_pretrained(
        dir,
        return_dict=True,
        torch_dtype=torch.float16,
        device_map='auto'
)
tokenizer = AutoTokenizer.from_pretrained(dir)

Downloads last month
12
Safetensors
Model size
69.2B params
Tensor type
F32
ยท