Ludwig Stumpp commited on
Commit
a011af1
1 Parent(s): 12a4ec3

Add stablelm results on HellaSwag and WinoGrande zero-shot

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -55,8 +55,8 @@ https://huggingface.co/spaces/ludwigstumpp/llm-leaderboard
55
  | [palm-2-l](https://ai.google/static/documents/palm2techreport.pdf) | Google Research | no | | | | [0.868](https://ai.google/static/documents/palm2techreport.pdf) | | | [0.869](https://ai.google/static/documents/palm2techreport.pdf) | | | | [0.861](https://ai.google/static/documents/palm2techreport.pdf) | | [0.830](https://ai.google/static/documents/palm2techreport.pdf) | |
56
  | [palm-2-l-instruct](https://ai.google/static/documents/palm2techreport.pdf) | Google Research | no | | | | | | | | | | | | | | [0.909](https://ai.google/static/documents/palm2techreport.pdf) |
57
  | [replit-code-v1-3b](https://huggingface.co/replit/replit-code-v1-3b) | Replit | yes | | | | | [0.219](https://twitter.com/amasad/status/1651019556423598081/photo/2) | | | | | | | | | |
58
- | [stablelm-base-alpha-7b](https://huggingface.co/stabilityai/stablelm-base-alpha-7b) | Stability AI | yes | | | [0.533](https://www.mosaicml.com/blog/mpt-7b) | | | [0.533](https://www.mosaicml.com/blog/mpt-7b) | | [0.251](https://www.mosaicml.com/blog/mpt-7b) | | [0.049](https://www.mosaicml.com/blog/mpt-7b) | | | | |
59
- | [stablelm-tuned-alpha-7b](https://huggingface.co/stabilityai/stablelm-tuned-alpha-7b) | Stability AI | no | [858](https://lmsys.org/blog/2023-05-03-arena/) | | | | | | | | | | | | | |
60
  | [starcoder-base-16b](https://huggingface.co/bigcode/starcoderbase) | BigCode | yes | | | | | [0.304](https://drive.google.com/file/d/1cN-b9GnWtHzQRoE7M7gAEyivY0kl4BYs/view) | | | | | | | | | |
61
  | [starcoder-16b](https://huggingface.co/bigcode/starcoder) | BigCode | yes | | | | | [0.336](https://drive.google.com/file/d/1cN-b9GnWtHzQRoE7M7gAEyivY0kl4BYs/view) | | | | | | | | | |
62
  | [vicuna-13b](https://huggingface.co/lmsys/vicuna-13b-delta-v0) | Lmsys.org | no | [1169](https://lmsys.org/blog/2023-05-03-arena/) | | | | | | | | | | | | | |
 
55
  | [palm-2-l](https://ai.google/static/documents/palm2techreport.pdf) | Google Research | no | | | | [0.868](https://ai.google/static/documents/palm2techreport.pdf) | | | [0.869](https://ai.google/static/documents/palm2techreport.pdf) | | | | [0.861](https://ai.google/static/documents/palm2techreport.pdf) | | [0.830](https://ai.google/static/documents/palm2techreport.pdf) | |
56
  | [palm-2-l-instruct](https://ai.google/static/documents/palm2techreport.pdf) | Google Research | no | | | | | | | | | | | | | | [0.909](https://ai.google/static/documents/palm2techreport.pdf) |
57
  | [replit-code-v1-3b](https://huggingface.co/replit/replit-code-v1-3b) | Replit | yes | | | | | [0.219](https://twitter.com/amasad/status/1651019556423598081/photo/2) | | | | | | | | | |
58
+ | [stablelm-base-alpha-7b](https://huggingface.co/stabilityai/stablelm-base-alpha-7b) | Stability AI | yes | | | [0.412](https://gpt4all.io/reports/GPT4All_Technical_Report_3.pdf) | | | [0.533](https://www.mosaicml.com/blog/mpt-7b) | | [0.251](https://www.mosaicml.com/blog/mpt-7b) | | [0.049](https://www.mosaicml.com/blog/mpt-7b) | | [0.501](https://gpt4all.io/reports/GPT4All_Technical_Report_3.pdf) | | |
59
+ | [stablelm-tuned-alpha-7b](https://huggingface.co/stabilityai/stablelm-tuned-alpha-7b) | Stability AI | no | [858](https://lmsys.org/blog/2023-05-03-arena/) | | [0.536](https://gpt4all.io/reports/GPT4All_Technical_Report_3.pdf) | | | | | | | | | [0.548](https://gpt4all.io/reports/GPT4All_Technical_Report_3.pdf) | | |
60
  | [starcoder-base-16b](https://huggingface.co/bigcode/starcoderbase) | BigCode | yes | | | | | [0.304](https://drive.google.com/file/d/1cN-b9GnWtHzQRoE7M7gAEyivY0kl4BYs/view) | | | | | | | | | |
61
  | [starcoder-16b](https://huggingface.co/bigcode/starcoder) | BigCode | yes | | | | | [0.336](https://drive.google.com/file/d/1cN-b9GnWtHzQRoE7M7gAEyivY0kl4BYs/view) | | | | | | | | | |
62
  | [vicuna-13b](https://huggingface.co/lmsys/vicuna-13b-delta-v0) | Lmsys.org | no | [1169](https://lmsys.org/blog/2023-05-03-arena/) | | | | | | | | | | | | | |