Spaces:

THUIR
/

AEOLLM

Running

陈俊杰 commited on Sep 3, 2024

Commit

03d0a26

1 Parent(s): 9f4f414

task

Files changed (1) hide show

app.py CHANGED Viewed

@@ -107,7 +107,9 @@ elif page == "Methodology":
 elif page == "Datasets":
     st.header("Answer Generation")
     st.markdown("""
-We randomly sampled **100 instances** from **each** dataset as the question set and selected **7 different LLMs** to generate answers, forming the answer set. As a result, each dataset produced 700 instances, totaling **2,800 instances across the four datasets**.
 """)
     st.header("Human Annotation")
     st.markdown("""
@@ -212,7 +214,7 @@ elif page == "Data and File format":
 elif page == "Submit":
     st.header("Submit")
     st.markdown("""
-We will be following a similar format as the ones used by most **TREC submissions**: White space is used to separate columns. The width of the columns in the format is not important, but it is important to have exactly five columns per line with at least one space between the columns.
 **taskId  questionId  answerId  score  rank**
@@ -229,11 +231,10 @@ We will be following a similar format as the ones used by most **TREC submission
 🔗 An example of the submission file content is [here](https://huggingface.co/spaces/THUIR/AEOLLM/blob/main/baseline_example/output/baseline1_chatglm3_6B.txt).
     """)
 elif page == "LeaderBoard":
-    st.header("LeaderBoard")
     # # 描述
     st.markdown("""
 <p class='main-text'>
-NTCIR-18 Automatic Evaluation Methods of LLMs (AEOLLM) Leaderboard.
 </p>
     """, unsafe_allow_html=True)
     df = {

 elif page == "Datasets":
     st.header("Answer Generation")
     st.markdown("""
+We randomly sampled **100 instances** from **each** dataset as the question set and selected **7 different LLMs** to generate answers, forming the answer set.
+As a result, each dataset produced 700 instances, totaling **2,800 instances across the four datasets**.
 """)
     st.header("Human Annotation")
     st.markdown("""
 elif page == "Submit":
     st.header("Submit")
     st.markdown("""
+We will be following a similar format as the ones used by most **TREC submissions**: white space is used to separate columns. The width of the columns in the format is not important, but it is important to have exactly five columns per line with at least one space between the columns.
 **taskId  questionId  answerId  score  rank**
 🔗 An example of the submission file content is [here](https://huggingface.co/spaces/THUIR/AEOLLM/blob/main/baseline_example/output/baseline1_chatglm3_6B.txt).
     """)
 elif page == "LeaderBoard":
     # # 描述
     st.markdown("""
 <p class='main-text'>
+🏆 NTCIR-18 Automatic Evaluation Methods of LLMs (AEOLLM) task Leaderboard.
 </p>
     """, unsafe_allow_html=True)
     df = {