Edit model card

Model Card for Model ID

Model Description

midm-bitext-S-7B-inst-v1 ๋ฏธ์„ธ ํŠœ๋‹

ํ•ด๋‹น ๋ชจ๋ธ์€ ๋„ค์ด๋ฒ„ ์˜ํ™” ๋ฆฌ๋ทฐ ๋ฐ์ดํ„ฐ์…‹์ธ NSMC์— ๋Œ€ํ•ด KT-AI/midm-bitext-S-7B-inst-v1์„ ๋ฏธ์„ธํŠœ๋‹ํ•œ ๋ชจ๋ธ์ž…๋‹ˆ๋‹ค.

์˜ํ™” ๋ฆฌ๋ทฐ ํ…์ŠคํŠธ๋ฅผ ํ”„๋กฌํ”„ํŠธ์— ํฌํ•จํ•˜์—ฌ ๋ชจ๋ธ์— ์ž…๋ ฅ์‹œ,'๊ธ์ •' ๋˜๋Š” '๋ถ€์ •' ์ด๋ผ๊ณ  ์˜ˆ์ธก ํ…์ŠคํŠธ๋ฅผ ์ง์ ‘ ์ƒ์„ฑํ•ฉ๋‹ˆ๋‹ค.

๊ฒฐ๊ณผ์ ์œผ๋กœ, ์ •ํ™•๋„ 90.0%๋ฅผ ๊ฐ€์ง€๋Š” ๋ชจ๋ธ์„ ์™„์„ฑํ–ˆ์Šต๋‹ˆ๋‹ค.

Train, Test ๋ฐ์ดํ„ฐ์…‹

ํ•ด๋‹น ๋ชจ๋ธ์€ NSMC์˜ train ๋ฐ์ดํ„ฐ์˜ ์ƒ์œ„ 2,000๊ฐœ์˜ ์ƒ˜ํ”Œ์„ ํ•™์Šต์— ์‚ฌ์šฉํ–ˆ์Šต๋‹ˆ๋‹ค.

ํ•ด๋‹น ๋ชจ๋ธ์€ NSMC์˜ test ๋ฐ์ดํ„ฐ์˜ ์ƒ์œ„ 1,000๊ฐœ์˜ ์ƒ˜ํ”Œ์„ ํ‰๊ฐ€์— ์‚ฌ์šฉํ–ˆ์Šต๋‹ˆ๋‹ค.

Training_step_loss

image/png

Confusion_Matrix

image/png

Accuracy_Classification_Report

image/png

Training procedure

The following bitsandbytes quantization config was used during training:

  • quant_method: bitsandbytes
  • load_in_8bit: False
  • load_in_4bit: True
  • llm_int8_threshold: 6.0
  • llm_int8_skip_modules: None
  • llm_int8_enable_fp32_cpu_offload: False
  • llm_int8_has_fp16_weight: False
  • bnb_4bit_quant_type: nf4
  • bnb_4bit_use_double_quant: False
  • bnb_4bit_compute_dtype: bfloat16

Framework versions

  • PEFT 0.7.0
Downloads last month
1
Unable to determine this modelโ€™s pipeline type. Check the docs .

Adapter for

Dataset used to train seojin0128/hw-midm-7B-nsmc