Edit model card

Model Details

Model Developers Seungyoo Lee (DopeorNope)

์ด ๋ชจ๋ธ์€ Mistral Base์˜ ์ƒˆ๋กœ์šด ์•„ํ‚คํ…์ณ์ด๋ฉฐ, 10.7B์˜ ํŒŒ๋ผ๋ฏธํ„ฐ๋กœ ๊ตฌ์„ฑ๋˜์—ˆ์Šต๋‹ˆ๋‹ค. (Solar๋‚˜, ์‹œ๋‚˜ํŠธ๋ผ ๋ฒ ์ด์Šค ๋ชจ๋ธ์ด ์•„๋‹™๋‹ˆ๋‹ค.)

์•ฝ 1.5B์˜ ํ† ํฐ์œผ๋กœ pretrain ๋˜์—ˆ์œผ๋‚˜, ์‹คํ—˜๋‹จ๊ณ„๋กœ ํ–ฅํ›„ ๋‹ค์‹œ ํ›ˆ๋ จ๋˜์–ด ์ƒˆ๋กญ๊ฒŒ ๋‚˜์˜ฌ ์˜ˆ์ •์ž…๋‹ˆ๋‹ค.

ํ…Œ์ŠคํŠธ์šฉ์œผ๋กœ ์˜ฌ๋ ค๋ด…๋‹ˆ๋‹ค.

Context length๊ฐ€ 32k ๊นŒ์ง€์ง€์› ๊ฐ€๋Šฅํ•œ ๋ชจ๋ธ์ด๋ฉฐ, ํ–ฅํ›„ ๋” ์™„๋ฒฝํ•˜๊ฒŒ ์„ค๊ณ„ํ•˜์—ฌ ์˜ฌ๋ฆฌ๋„๋ก ํ•˜๊ฒ ์Šต๋‹ˆ๋‹ค.

Downloads last month
1,225
Safetensors
Model size
10.8B params
Tensor type
F32
ยท

Collection including DopeorNope/Mistralopithecus-v0.1-10.8B