Text Generation
Transformers
PyTorch
English
llama
Eval Results
text-generation-inference
Inference Endpoints
Pankaj Mathur commited on
Commit
965ed2a
1 Parent(s): 36aabdb

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -26,11 +26,11 @@ Here are the results on metrics used by [HuggingFaceH4 Open LLM Leaderboard](htt
26
  |||||
27
  |:------:|:-------------:|:-------------:|:---------:|
28
  |**Task**|**Metric**|**Value**|**Stderr**|
29
- |*arc_challenge*|acc_norm|0.5572|0.0145|
30
- |*hellaswag*|acc_norm|0.7964|0.0040|
31
  |*mmlu*|acc_norm|0.4969|0.035|
32
- |*truthfulqa_mc*|mc2|0.5231|0.0158|
33
- |*Total Average*|acc_norm|0.5933|0.0114|
34
 
35
 
36
 
 
26
  |||||
27
  |:------:|:-------------:|:-------------:|:---------:|
28
  |**Task**|**Metric**|**Value**|**Stderr**|
29
+ |*arc_challenge*|acc_norm|0.5478|0.0145|
30
+ |*hellaswag*|acc_norm|0.7023|0.0040|
31
  |*mmlu*|acc_norm|0.4969|0.035|
32
+ |*truthfulqa_mc*|mc2|0.44|0.0158|
33
+ |*Total Average*|acc_norm|0.54675|0.0114|
34
 
35
 
36