lmzheng commited on
Commit
de56c35
1 Parent(s): 5070998

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -39,6 +39,8 @@ See more details in the "Training Details of Vicuna Models" section in the appen
39
 
40
  ## Evaluation
41
 
 
 
42
  Vicuna is evaluated with standard benchmarks, human preference, and LLM-as-a-judge. See more details in this [paper](https://arxiv.org/pdf/2306.05685.pdf) and [leaderboard](https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard).
43
 
44
  ## Difference between different versions of Vicuna
 
39
 
40
  ## Evaluation
41
 
42
+ ![Evaluation Results](https://github.com/lm-sys/lm-sys.github.io/blob/main/public/images/webdata/vicuna_v1.5_eval.png?raw=true)
43
+
44
  Vicuna is evaluated with standard benchmarks, human preference, and LLM-as-a-judge. See more details in this [paper](https://arxiv.org/pdf/2306.05685.pdf) and [leaderboard](https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard).
45
 
46
  ## Difference between different versions of Vicuna