zoeywin commited on
Commit
7c800f1
1 Parent(s): 28bf6be

eval for 1.2

Browse files
Files changed (1) hide show
  1. README.md +9 -1
README.md CHANGED
@@ -29,7 +29,15 @@ The current version is **1.2**; congrats to our team on the new release!
29
 
30
  Previous versions remain available in the repository. New models will be released for everyone once our team's training and validation process is complete :)
31
 
32
- ## Prompting
 
 
 
 
 
 
 
 
33
  Shining Valiant uses the same prompt format as Llama 2 Chat - feel free to use your existing prompts and scripts!
34
  A few examples of different formats:
35
 
 
29
 
30
  Previous versions remain available in the repository. New models will be released for everyone once our team's training and validation process is complete :)
31
 
32
+ ## Evaluation
33
+
34
+ | Model | Avg | ARC | HS | MMLU | TQA |
35
+ |-----------------------|--------|-------|-------|--------|-------|
36
+ | **Shining Valiant 1.2** | 74.17 | 72.95 | 87.88 | 70.97 | 64.88 |
37
+ | Llama 2 | 67.35 | 67.32 | 87.33 | 69.83 | 44.92 |
38
+ | Llama 2 Chat | 66.80 | 64.59 | 85.88 | 63.91 | 52.80 |
39
+
40
+ ## Prompting Guide
41
  Shining Valiant uses the same prompt format as Llama 2 Chat - feel free to use your existing prompts and scripts!
42
  A few examples of different formats:
43