Text Generation
Transformers
PyTorch
English
llama
text-generation-inference
Inference Endpoints
bleysg commited on
Commit
26d1bc5
1 Parent(s): fc8b1b0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -2
README.md CHANGED
@@ -88,8 +88,7 @@ Our average performance for BigBench-Hard: 0.488
88
 
89
  Average for AGIEval: 0.447
90
 
91
- In the Orca paper, they measured their score relative to Vicuna on these evals.
92
- We have done the same and have found our score averages to **~103%** of the total performance that was shown in the Orca paper, using the same evaluation methods as outlined in the paper.
93
 
94
  So we are surpassing Orca performance with <20% of the dataset size and <1/10th the training budget!
95
 
 
88
 
89
  Average for AGIEval: 0.447
90
 
91
+ We find our score averages to **~103%** of the total performance that was shown in the Orca paper, using the same evaluation methods as outlined in the paper.
 
92
 
93
  So we are surpassing Orca performance with <20% of the dataset size and <1/10th the training budget!
94