VARCO_Arena / guide_mds /input_jsonls_kr.md
sonsus's picture
others
c2ba4d5

A newer version of the Streamlit SDK is available: 1.41.1

Upgrade

[KR] ์ง‘์–ด๋„ฃ์„ jsonl ํŒŒ์ผ ๊ฐ€์ด๋“œ

๋น„๊ตํ•  ๋ชจ๋ธ์ด ๋‹ค์„ฏ ๊ฐœ๋ผ๋ฉด ๋‹ค์„ฏ ๊ฐœ์˜ .jsonl ํŒŒ์ผ์„ ์—…๋กœ๋“œํ•˜์„ธ์š”.

  • ๐Ÿ’ฅ๋ชจ๋“  jsonl ์€ ๊ฐ™์€ ์ˆ˜์˜ ํ–‰์„ ๊ฐ€์ ธ์•ผํ•ฉ๋‹ˆ๋‹ค.
  • ๐Ÿ’ฅmodel_id ํ•„๋“œ๋Š” ํŒŒ์ผ๋งˆ๋‹ค ๋‹ฌ๋ผ์•ผํ•˜๋ฉฐ ํŒŒ์ผ ๋‚ด์—์„œ๋Š” ์œ ์ผํ•ด์•ผํ•ฉ๋‹ˆ๋‹ค.

jsonl ํ•„์ˆ˜ ํ•„๋“œ

  • ๊ฐœ๋ณ„

    • model_id: ํ‰๊ฐ€๋ฐ›๋Š” ๋ชจ๋ธ์˜ ์ด๋ฆ„์ž…๋‹ˆ๋‹ค. (์งง๊ฒŒ ์“ฐ๋Š” ๊ฒƒ ์ถ”์ฒœ)
    • generated: ๋ชจ๋ธ์ด testset instruction ์— ์ƒ์„ฑํ•œ ์‘๋‹ต์„ ๋„ฃ์œผ์„ธ์š”.
  • ๋ฒˆ์—ญํ‰๊ฐ€ ํ”„๋กฌํ”„ํŠธ ์‚ฌ์šฉ์‹œ (translation_pair. streamlit_app_local/user_submit/mt/llama5.jsonl ์—์„œ ์˜ˆ์‹œ ๋ณผ ์ˆ˜ ์žˆ์Œ)

    • source_lang: input language (e.g. Korean, KR, kor, ...)
    • target_lang: output language (e.g. English, EN, ...)
  • ๊ณตํ†ต ๋ถ€๋ถ„ (๋ชจ๋“  ํŒŒ์ผ์— ๋Œ€ํ•ด ๊ฐ™์•„์•ผ ํ•จ)

    • instruction: ๋ชจ๋ธ์— ์ง‘์–ด๋„ฃ๋Š” testset instruction ํ˜น์€ input์— ํ•ด๋‹นํ•˜๋Š” ๋ฌด์–ธ๊ฐ€์ž…๋‹ˆ๋‹ค.
    • task: ์ „์ฒด ๊ฒฐ๊ณผ๋ฅผ subset์œผ๋กœ ๊ทธ๋ฃน์ง€์–ด์„œ ๋ณด์—ฌ์ค„ ๋•Œ ์‚ฌ์šฉ๋ฉ๋‹ˆ๋‹ค. evaluation prompt๋ฅผ ํ–‰๋ณ„๋กœ ๋‹ค๋ฅด๊ฒŒ ์‚ฌ์šฉํ•˜๊ณ  ์‹ถ์„ ๋•Œ ํ™œ์šฉ๋  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

๊ฐ jsonl ํŒŒ์ผ์€ ์•„๋ž˜์ฒ˜๋Ÿผ ์ƒ๊ฒผ์Šต๋‹ˆ๋‹ค.

# model1.jsonl
{"model_id": "๋ชจ๋ธ1", "task": "๊ธธ ๋ฌป๊ธฐ", "instruction": "์–ด๋””๋กœ ๊ฐ€์•ผํ•˜์˜ค", "generated": "์ €๊ธฐ๋กœ์š”"}
{"model_id": "๋ชจ๋ธ1", "task": "์‚ฐ์ˆ˜", "instruction": "1+1", "generated": "2"} # ๊ธธ ๋ฌป๊ธฐ์™€ ์‚ฐ์ˆ˜์˜ ๊ฒฝ์šฐ ๋‹ค๋ฅธ ํ‰๊ฐ€ ํ”„๋กฌํ”„ํŠธ๋ฅผ ์‚ฌ์šฉํ•˜๊ณ  ์‹ถ์„ ์ˆ˜ ์žˆ๊ฒ ์ฃ ?

# model2.jsonl -* model1.jsonl๊ณผ `instruction`์€ ๊ฐ™๊ณ  `generated`, `model_id` ๋Š” ๋‹ค๋ฆ…๋‹ˆ๋‹ค!
{"model_id": "๋ชจ๋ธ2", "task": "๊ธธ ๋ฌป๊ธฐ", "instruction": "์–ด๋””๋กœ ๊ฐ€์•ผํ•˜์˜ค", "generated": "ํ•˜์ด"}
{"model_id": "๋ชจ๋ธ2", "task": "์‚ฐ์ˆ˜", "instruction": "1+1", "generated": "3"}

...
..

์˜ˆ๋ฅผ ๋“ค์–ด, ํ•œ๊ฐ€์ง€ ๋ชจ๋ธ์— ๋Œ€ํ•ด ๋‹ค๋ฅธ ํ”„๋กฌํ”„ํŒ…์„ ์‹œ๋„ํ•˜์—ฌ ๋‹ค๋ฅธ ์ƒ์„ฑ๋ฌธ์„ ์–ป์—ˆ๊ณ  ์ด๋ฅผ ๋น„๊ตํ•˜๊ณ  ์‹ถ์€ ๊ฒฝ์šฐ๋ฅผ ์ƒ๊ฐํ•ด๋ด…์‹œ๋‹ค. ์ด ๋•Œ ํ‰๊ฐ€๋ฐ›์„ testset์€ ๊ฐ™์œผ๋ฏ€๋กœ instruction์€ ๋ชจ๋‘ ๊ฐ™๊ณ  ํ”„๋กฌํ”„ํŒ…์— ๋”ฐ๋ผ generated๋Š” ๋‹ฌ๋ผ์ง€๊ฒ ์ฃ ? model_id ๋Š” "prompt1", "prompt2" ๋“ฑ ์ทจํ–ฅ์— ๋งž๊ฒŒ ์ ์–ด์ฃผ์‹œ๋ฉด ๋ฉ๋‹ˆ๋‹ค.