ptrdvn commited on
Commit
06e7cf7
β€’
1 Parent(s): 65972c2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -62,11 +62,13 @@ We achieve the following MT-Bench scores across 6 languages:
62
  | **Russian** πŸ‡·πŸ‡Ί | NaN | 8.19 | 8.28 | 7.94 |
63
  | **Chinese** πŸ‡¨πŸ‡³ | NaN | 7.11 | 6.97 | 7.55 |
64
  | **English** πŸ‡ΊπŸ‡Έ | 7.98 | 7.73 | 7.92 | 8.26 |
 
65
 
66
  We observe minimal degredation of Llama 3's English ability while achieving best-in-class multilingual abilities compared to the top rated 7B model ([Nexusflow/Starling-LM-7B-beta](https://huggingface.co/Nexusflow/Starling-LM-7B-beta)) on the [Chatbot Arena Leaderboard](https://chat.lmsys.org/?leaderboard).
67
 
68
  [Here is our evaluation script.](https://drive.google.com/file/d/15HPn7452t8LbTD9HKSl7ngYYWnsoOG08/view?usp=sharing)
69
 
 
70
  # Training data
71
 
72
  We train on three sources of data to create this model:
 
62
  | **Russian** πŸ‡·πŸ‡Ί | NaN | 8.19 | 8.28 | 7.94 |
63
  | **Chinese** πŸ‡¨πŸ‡³ | NaN | 7.11 | 6.97 | 7.55 |
64
  | **English** πŸ‡ΊπŸ‡Έ | 7.98 | 7.73 | 7.92 | 8.26 |
65
+ (Note the Russian scores exclude code, reasoning and math problems due to not having any translated reference answers for these questions.)
66
 
67
  We observe minimal degredation of Llama 3's English ability while achieving best-in-class multilingual abilities compared to the top rated 7B model ([Nexusflow/Starling-LM-7B-beta](https://huggingface.co/Nexusflow/Starling-LM-7B-beta)) on the [Chatbot Arena Leaderboard](https://chat.lmsys.org/?leaderboard).
68
 
69
  [Here is our evaluation script.](https://drive.google.com/file/d/15HPn7452t8LbTD9HKSl7ngYYWnsoOG08/view?usp=sharing)
70
 
71
+
72
  # Training data
73
 
74
  We train on three sources of data to create this model: