陈俊杰
commited on
Commit
·
b0e4c3b
1
Parent(s):
932ed5c
time
Browse files
app.py
CHANGED
@@ -286,10 +286,11 @@ Please feel free to contact us! 😉
|
|
286 |
</p>""",unsafe_allow_html=True)
|
287 |
elif page == "References":
|
288 |
st.header("References")
|
289 |
-
st.markdown("""
|
|
|
290 |
[2] Chang Y, Wang X, Wang J, et al. A survey on evaluation of large language models. <a href="https://dl.acm.org/doi/pdf/10.1145/3641289">pdf</a><br />
|
291 |
[3] Chan C M, Chen W, Su Y, et al. Chateval: Towards better llm-based evaluators through multi-agent debate. <a href="https://arxiv.org/pdf/2308.07201">pdf</a><br />
|
292 |
[4] Li R, Patel T, Du X. Prd: Peer rank and discussion improve large language model based evaluations. <a href="https://arxiv.org/pdf/2307.02762">pdf</a><br />
|
293 |
[5] Chu Z, Ai Q, Tu Y, et al. Pre: A peer review based large language model evaluator. <a href="https://arxiv.org/pdf/2401.15641">pdf</a></p>
|
294 |
-
""")
|
295 |
|
|
|
286 |
</p>""",unsafe_allow_html=True)
|
287 |
elif page == "References":
|
288 |
st.header("References")
|
289 |
+
st.markdown("""
|
290 |
+
<p class='main-text'>[1] Mao R, Chen G, Zhang X, et al. GPTEval: A survey on assessments of ChatGPT and GPT-4. <a href="https://arxiv.org/pdf/2308.12488">pdf</a><br />
|
291 |
[2] Chang Y, Wang X, Wang J, et al. A survey on evaluation of large language models. <a href="https://dl.acm.org/doi/pdf/10.1145/3641289">pdf</a><br />
|
292 |
[3] Chan C M, Chen W, Su Y, et al. Chateval: Towards better llm-based evaluators through multi-agent debate. <a href="https://arxiv.org/pdf/2308.07201">pdf</a><br />
|
293 |
[4] Li R, Patel T, Du X. Prd: Peer rank and discussion improve large language model based evaluations. <a href="https://arxiv.org/pdf/2307.02762">pdf</a><br />
|
294 |
[5] Chu Z, Ai Q, Tu Y, et al. Pre: A peer review based large language model evaluator. <a href="https://arxiv.org/pdf/2401.15641">pdf</a></p>
|
295 |
+
""",unsafe_allow_html=True)
|
296 |
|