Spaces:
Running
Running
| """ | |
| λ°μ΄ν°μ λ€μ΄λ‘λ ν UI μ»΄ν¬λνΈ | |
| πΎ λ°μ΄ν°μ λ€μ΄λ‘λ νμ UIμ λ‘μ§μ κ΄λ¦¬ν©λλ€. | |
| """ | |
| import gradio as gr | |
| import pandas as pd | |
| def create_dataset_tab(): | |
| """λ°μ΄ν°μ λ€μ΄λ‘λ ν UI μμ±""" | |
| # λ°μ΄ν°μ 미리보기 λ‘λ (μ΄κΈ°ν μ ν λ²λ§) | |
| try: | |
| dev_preview_data = pd.read_csv("data/public/ko-freshqa_2025_dev.csv").head(5) | |
| test_preview_data = pd.read_csv("data/public/ko-freshqa_2025_test.csv").head(5) | |
| except Exception as e: | |
| print(f"β οΈ λ°μ΄ν°μ 미리보기 λ‘λ μ€ν¨: {e}") | |
| dev_preview_data = pd.DataFrame() | |
| test_preview_data = pd.DataFrame() | |
| gr.Markdown(""" | |
| ### Ko-FreshQA Dataset | |
| - μ΄ λ°μ΄ν°μ λ° λ¦¬λ보λλ [FreshQA](https://github.com/freshllms/freshqa)μμ μκ°μ λ°μ λ§λ€μ΄μ‘μ΅λλ€. | |
| - fact type(fast changing, slow changing, never changing), μ μ μ μ ν¨μ±, 10κ°μ λλ©μΈμ λ°λΌ λλλ μ§λ¬Έλ€μ ν΅ν΄ νκ΅μ΄ μ§μκ³Ό κ΄λ ¨λ LLMμ μ΅μ μ±μ νλ¨ν μ μμ΅λλ€. | |
| - κ²μ¦ λ° νκ°μ νμν λ°μ΄ν°μ μ μ£ΌκΈ°μ μΌλ‘ μ λ°μ΄νΈν μμ μ λλ€. | |
| <br> | |
| ### Ko-FreshQA λ°μ΄ν°μ μ μλμ κ°μ νΉμ§μ κ°μ§κ³ μμ΅λλ€. | |
| - **fact type** | |
| - μκ°μ νλ¦μ λ°λ₯Έ λ΅λ³μ λ³λ κ°λ₯μ±μ λ°λΌ μ§λ¬Έμ μλμ μΈ κ°μ§λ‘ λΆλ₯λ©λλ€. | |
| - **fast changing** : λ΅λ³μ΄ λ³΄ν΅ 1λ λλ κ·Έ μ΄λ΄μ λ³νλ μ§λ¬Έ | |
| - **slow changing** : λ΅λ³μ΄ λͺ λ μ κ±Έμ³ λ³νλ μ§λ¬Έ | |
| - **never changing** : μμ¬μ μ¬κ±΄, μ§μ€κ³Ό κ°μ΄ λ΅λ³μ΄ κ±°μ λ³νμ§ μλ μ§λ¬Έ | |
| - **μ μ μ ν¨μ±** | |
| - **false premise (T/F)** : μ§λ¬Έμ ν¬ν¨λ μ μ μμ²΄κ° μλͺ»λμ΄ μμΌλ©΄ True, μ μ μ λ¬Έμ κ° μμΌλ©΄ False | |
| - **one/multi hop** | |
| - λ΅λ³μ μμ±νκΈ° μν΄ νμν μΆλ‘ μ κ°μμ λ°λΌ μ§λ¬Έμ one hop, multi hopμΌλ‘ λΆλ₯ν©λλ€. | |
| - **λλ©μΈ** | |
| - λͺ¨λ μ§λ¬Έκ³Ό λλ΅μ λ€μ λλ©μΈ μ€ νλλ‘ λΆλ₯λ©λλ€. | |
| - μ μΉ, μ€ν¬μΈ , μ°μ, λ μ¨, μΈκ³, κ²½μ , μ¬ν, IT/κ³Όν, μν/λ¬Έν, UNK | |
| - **λλ¨Έμ§ λ©ν μ 보** | |
| - **effective year** : μ§λ¬Έμ λ΅λ³μ΄ λ§μ§λ§μΌλ‘ λ³κ²½λ μ°λ | |
| - **next review** : μμλλ λ€μ κ²ν λ μ§ | |
| - **source** : μ§λ¬Έ/λ΅λ³μ λν μ 보λ₯Ό μ°Ύμ μ μλ μΆμ² | |
| <br> | |
| """) | |
| with gr.Column(elem_classes=["leaderboard-group"]): | |
| with gr.Row(): | |
| with gr.Column(): | |
| gr.Markdown("### π§ͺ DEV λ°μ΄ν°μ (κ°λ°/κ²μ¦μ©)") | |
| gr.Markdown(""" | |
| **Dev set**: 550μ | |
| - λͺ¨λΈ κ°λ° λ° κ²μ¦μ μν΄ μ¬μ©ν μ μμ΅λλ€. | |
| - μ λ΅μ λΉλ‘―νμ¬ λͺ¨λ λ©νλ°μ΄ν°κ° μ 곡λ©λλ€. | |
| """) | |
| # DEV λ°μ΄ν°μ λ€μ΄λ‘λ λ²νΌ | |
| dev_download_btn = gr.DownloadButton( | |
| "πΎ DEV λ°μ΄ν°μ λ€μ΄λ‘λ", | |
| value="data/public/ko-freshqa_2025_dev.csv", | |
| variant="primary", | |
| size="lg" | |
| ) | |
| # DEV λ°μ΄ν°μ 미리보기 | |
| dev_preview = gr.DataFrame( | |
| value=lambda: pd.read_csv("data/public/ko-freshqa_2025_dev.csv").head(5), | |
| interactive=False, | |
| label="" | |
| ) | |
| with gr.Column(): | |
| gr.Markdown("### π― TEST λ°μ΄ν°μ (μ΅μ’ νκ°μ©)") | |
| gr.Markdown(""" | |
| **Test set**: 3,000κ° | |
| - 리λ보λ μ μΆμ μν νκ°μ© λ°μ΄ν°μ μ λλ€. | |
| - model_responseλ₯Ό μ±μμ μ μΆν΄μ£ΌμΈμ. | |
| """) | |
| # TEST λ°μ΄ν°μ λ€μ΄λ‘λ λ²νΌ | |
| test_download_btn = gr.DownloadButton( | |
| "πΎ TEST λ°μ΄ν°μ λ€μ΄λ‘λ", | |
| value="data/public/ko-freshqa_2025_test.csv", | |
| variant="primary", | |
| size="lg" | |
| ) | |
| # TEST λ°μ΄ν°μ 미리보기 | |
| test_preview = gr.DataFrame( | |
| value=lambda: pd.read_csv("data/public/ko-freshqa_2025_test.csv").head(5), | |
| interactive=False, | |
| label="" | |
| ) | |
| # λ€μ΄λ‘λ μλ΄ λ©μμ§ | |
| gr.Markdown(""" | |
| <br> | |
| ### π‘ λ€μ΄λ‘λ μλ΄ | |
| - μμ λ€μ΄λ‘λ λ²νΌμ ν΄λ¦νλ©΄ λΈλΌμ°μ μμ μλμΌλ‘ νμΌ λ€μ΄λ‘λκ° μμλ©λλ€. | |
| - **DEV λ°μ΄ν°μ **μ λͺ¨λΈ κ°λ° λ° κ²μ¦μ©μΌλ‘ μ¬μ©νμΈμ. | |
| - **TEST λ°μ΄ν°μ **μ μ΅μ’ νκ° λ° λ¦¬λ보λ μ μΆμ©μΌλ‘ μ¬μ©νμΈμ. | |
| - λ€μ΄λ‘λλ νμΌμ **CSV νμ**, **UTF-8 μΈμ½λ©**μΌλ‘ μ μ₯λ©λλ€. | |
| <br> | |
| """) | |
| # License & References | |
| gr.Markdown(""" | |
| ### π License & References | |
| - λ³Έ λ°μ΄ν°μ μ **CC-BY-ND-NC 4.0 (μ μμνμ Β· λ³κ²½ κΈμ§ Β· λΉμ리)** λΌμ΄μ μ€λ‘ μ 곡λ©λλ€. | |
| - μ΄ λ¦¬λ보λλ IITPμ **βμμ±ν μΈμ΄λͺ¨λΈμ μ§μκ°λ₯μ±κ³Ό μκ°μ νλ¦μ λ°λ₯Έ μ΅μ μ± λ°μμ μν νμ΅ λ° νμ© κΈ°μ κ°λ°β** μ¬μ μ μ§μμ λ°μ μ μλμμ΅λλ€. | |
| - μ΄ μμ€ν μ FreshLLMs νλ‘μ νΈμ **FreshQA λ°μ΄ν°μ κ³Ό νκ° λ°©λ²λ‘ **μ κΈ°λ°μΌλ‘ ꡬμΆλμμ΅λλ€. | |
| - μλ³Έ FreshQAλ λ§ν¬λ₯Ό μ°Έκ³ ν΄ μ£ΌμΈμ. π https://github.com/freshllms/freshqa | |
| - λ°μ΄ν°μ λ° λ¦¬λ보λμ κ΄νμ¬ λ¬Έμμ¬νμ΄ μμ κ²½μ°, taehwan.oh@upstage.aiλ‘ μ°λ½ν΄μ£ΌμΈμ. | |
| """) | |