gyupro's picture
Update README.md
a67170f
---
license: apache-2.0
language:
- en
- ko
tags:
- korean
- translation
- english
- llama
- koalpaca
- polyglot
- translator
---
[![Hits](https://hits.seeyoufarm.com/api/count/incr/badge.svg?url=https%3A%2F%2Fhuggingface.co%2Fgyupro%2FKoalpaca-Translation-KR2EN&count_bg=%2379C83D&title_bg=%23555555&icon=&icon_color=%23E7E7E7&title=hits&edge_flat=false)](https://hits.seeyoufarm.com)
ν•œκ΅­μ–΄ -> μ˜μ–΄ λ²ˆμ—­κΈ°μž…λ‹ˆλ‹€.
데이터셋은 AIHUB의 ꡬ어체 λ²ˆμ—­ 데이터셋을 μ‚¬μš©ν–ˆμŠ΅λ‹ˆλ‹€. [AIHUB](https://aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&aihubDataSe=realm&dataSetSn=71265)
Train 폴더에 μžˆλŠ” λ°μ΄ν„°λ§Œ μ‚¬μš©ν–ˆμœΌλ©°, 데이터셋을 μ „μ²˜λ¦¬ν•œ μ½”λ“œλŠ” [μ—¬κΈ°](https://github.com/gyupro/Koalpaca-Translation-KR2EN/blob/main/make_dataset.ipynb) μ—μ„œ λ³΄μ‹œλ©΄ λ©λ‹ˆλ‹€.
[KO_TO_EN](https://drive.google.com/file/d/12qNXQ3SPKLHGa3PuuAFQD2NI3-qV15Xa/view?usp=sharing) source ν•œκ΅­μ–΄ target μ˜μ–΄ 120만 λ¬Έμž₯ 쌍
[EN_TO_KO](https://drive.google.com/file/d/1pzgN2PvKfY5cgWR0V9cE1J6JukrrDcRc/view?usp=sharing) source μ˜μ–΄ target ν•œκ΅­μ–΄ 120만 λ¬Έμž₯ 쌍
μžμ„Έν•œ λ‚΄μš©μ€ [GITHUB](https://github.com/gyupro/Koalpaca-Translation-KR2EN) λ₯Ό ν™•μΈν•΄μ£Όμ„Έμš”.