doberst commited on
Commit
5ddbd55
1 Parent(s): 9075c24

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -3
README.md CHANGED
@@ -21,15 +21,14 @@ Average of 2 Test Runs with 1 point for correct answer, 0.5 point for partial co
21
  --**Accuracy Score**: **82.25** correct out of 100
22
  --Not Found Classification: 40.0%
23
  --Boolean: 61.25%
24
- --Math/Logic: 8.75%
25
  --Complex Questions (1-5): 1 (Low)
26
  --Summarization Quality (1-5): 2 (Coherent, extractive)
27
  --Hallucinations: No hallucinations observed in test runs.
28
 
29
  For test run results (and good indicator of target use cases), please see the files ("core_rag_test" and "answer_sheet" in this repo).
30
 
31
- --As a reference point, this model shows substantial improvements in results, compared with the BLING 1.0B Pythia, with fine-tuning and the base training substantially the same.
32
- --The model's ability to follow instructions and answer detailed questions improves dramatically from 1.0B -> 1.4B parameters.
33
 
34
 
35
  ### Model Description
 
21
  --**Accuracy Score**: **82.25** correct out of 100
22
  --Not Found Classification: 40.0%
23
  --Boolean: 61.25%
24
+ --Math/Logic: 8.75%
25
  --Complex Questions (1-5): 1 (Low)
26
  --Summarization Quality (1-5): 2 (Coherent, extractive)
27
  --Hallucinations: No hallucinations observed in test runs.
28
 
29
  For test run results (and good indicator of target use cases), please see the files ("core_rag_test" and "answer_sheet" in this repo).
30
 
31
+ --As a reference point, this model shows substantial improvements in results, compared with the BLING 1.0B Pythia, with fine-tuning and the base training substantially the same. The model's ability to follow instructions and answer detailed questions improves dramatically from 1.0B -> 1.4B parameters.
 
32
 
33
 
34
  ### Model Description