Spaces:
Running
Running
model_name,method,rouge1,rougeL,semantic_sim,LCS(character),LCS(word),ACS(word),Levenshtein Distance,Minhash Similarity,MMLU,MT-Bench,Blocklisted rougeL,In-Domain rougeL,Efficiency | |
llama2-70b-chat-hf_books_rag,vanilla,0.99009900990099,0.99009900990099,0.9815567135810852,741.0,160.0,160.0,1451.0,0.984375,0.619,7.1,0.156,0.161,1.00 | |
llama2-70b-chat-hf_books_rag,sys_prompt_bing,0.99009900990099,0.99009900990099,0.9815567135810852,741.0,160.0,160.0,1513.0,0.984375,0.614,7.2,0.136,0.144,1.00 | |
llama2-70b-chat-hf_books_rag,top_k_3,0.84251968503937,0.8346456692913385,0.9569202065467834,695.0,158.0,158.0,1577.0,0.796875,0.361,4.8,0.145,0.146,0.99 | |
llama2-70b-chat-hf_books_rag,memfree_6,0.8925081433224755,0.8794788273615635,0.9494256973266602,349.0,71.0,140.0,1506.0,0.78125,0.619,6.6,0.152,0.160,0.99 | |