hanbin commited on
Commit
cb5bb5e
1 Parent(s): 49dca7c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -0
README.md CHANGED
@@ -66,6 +66,11 @@ Simplify your answer as much as possible.
66
  {Instruction} [/INST]
67
  ```
68
 
 
 
 
 
 
69
  ## Citation
70
  ```
71
  @misc{yuan2024advancing,
 
66
  {Instruction} [/INST]
67
  ```
68
 
69
+ ## Evaluation
70
+ - Eurus, both the 7B and 70B variants, achieve the best overall performance among open-source models of similar sizes. Eurus even outperforms specialized models in corresponding domains in many cases. Notably, Eurus-7B outperforms baselines that are 5× larger, and Eurus-70B achieves better performance than GPT-3.5 Turbo.
71
+ - Preference learning with UltraInteract can further improve performance, especially in math and the multi-turn ability.
72
+ <img src="./figures/main_exp.png" alt="stats" style="zoom: 40%;" />
73
+
74
  ## Citation
75
  ```
76
  @misc{yuan2024advancing,