squarelike's picture
Update README.md
c6a992a
metadata
language:
  - ko
tags:
  - pytorch
  - causal-lm
license: llama2
pipeline_tag: text-generation

llama-2-ko-story-7b

llama-2-koen-story-13b๋Š” beomi/llama-2-koen-13b๋ฅผ ๊ธฐ๋ฐ˜์œผ๋กœ ํ•œ๊ธ€ ์†Œ์„ค raw ๋ฐ์ดํ„ฐ๋ฅผ ํ•™์Šต์‹œํ‚จ ๊ธฐ๋ฐ˜ ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค.

ํ•™์Šต ๋ฐ์ดํ„ฐ

llama-2-koen-story-13b๋Š” ์•ฝ 167MB์˜ ํ•œ๊ธ€ ์†Œ์„ค ๋ง๋ญ‰์น˜๋กœ ํ•™์Šต๋˜์—ˆ์Šต๋‹ˆ๋‹ค. ์ฃผ์š” ๋ฐ์ดํ„ฐ์…‹์€ ๋‹ค์Œ๊ณผ ๊ฐ™์Šต๋‹ˆ๋‹ค.

Source Size (MB) Link
ํ•œ๊ธ€ ์†Œ์„ค ๋ง๋ญ‰์น˜ 115.0
๊ณต์œ ๋งˆ๋‹น ํ•œ๊ตญ ๊ณ ์ „ ๋ฌธํ•™ ๋ง๋ญ‰์น˜ 53.0 https://gongu.copyright.or.kr/

ํ•™์Šต

llama-2-koen-story-13b๋Š” beomi/llama-2-koen-13b์—์„œ qlora๋กœ ์ถ”๊ฐ€ ํ•™์Šต๋˜์—ˆ์Šต๋‹ˆ๋‹ค.

  • lora_alpha: 16
  • lora_dropout: 0.05
  • lora_r: 32
  • target_modules: q_proj, v_proj
  • epoch: 3
  • learning_rate: 3e-4