ptrdvn commited on
Commit
06e7cf7
ยท
verified ยท
1 Parent(s): 65972c2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -0
README.md CHANGED
@@ -62,11 +62,13 @@ We achieve the following MT-Bench scores across 6 languages:
62
  | **Russian** ๐Ÿ‡ท๐Ÿ‡บ | NaN | 8.19 | 8.28 | 7.94 |
63
  | **Chinese** ๐Ÿ‡จ๐Ÿ‡ณ | NaN | 7.11 | 6.97 | 7.55 |
64
  | **English** ๐Ÿ‡บ๐Ÿ‡ธ | 7.98 | 7.73 | 7.92 | 8.26 |
 
65
 
66
  We observe minimal degredation of Llama 3's English ability while achieving best-in-class multilingual abilities compared to the top rated 7B model ([Nexusflow/Starling-LM-7B-beta](https://huggingface.co/Nexusflow/Starling-LM-7B-beta)) on the [Chatbot Arena Leaderboard](https://chat.lmsys.org/?leaderboard).
67
 
68
  [Here is our evaluation script.](https://drive.google.com/file/d/15HPn7452t8LbTD9HKSl7ngYYWnsoOG08/view?usp=sharing)
69
 
 
70
  # Training data
71
 
72
  We train on three sources of data to create this model:
 
62
  | **Russian** ๐Ÿ‡ท๐Ÿ‡บ | NaN | 8.19 | 8.28 | 7.94 |
63
  | **Chinese** ๐Ÿ‡จ๐Ÿ‡ณ | NaN | 7.11 | 6.97 | 7.55 |
64
  | **English** ๐Ÿ‡บ๐Ÿ‡ธ | 7.98 | 7.73 | 7.92 | 8.26 |
65
+ (Note the Russian scores exclude code, reasoning and math problems due to not having any translated reference answers for these questions.)
66
 
67
  We observe minimal degredation of Llama 3's English ability while achieving best-in-class multilingual abilities compared to the top rated 7B model ([Nexusflow/Starling-LM-7B-beta](https://huggingface.co/Nexusflow/Starling-LM-7B-beta)) on the [Chatbot Arena Leaderboard](https://chat.lmsys.org/?leaderboard).
68
 
69
  [Here is our evaluation script.](https://drive.google.com/file/d/15HPn7452t8LbTD9HKSl7ngYYWnsoOG08/view?usp=sharing)
70
 
71
+
72
  # Training data
73
 
74
  We train on three sources of data to create this model: