llm-leaderboard / README.md
Ludwig Stumpp
Update
37bf1e8
|
raw
history blame
2.29 kB
# llm-leaderboard
A joint community effort to create one central leaderboard for LLMs
Visit the interactive leaderboard at TODO.
### Leaderboard
| Model Name | [Chatbot Arena Elo (llmsys)](https://lmsys.org/blog/2023-05-03-arena/) |
| --------------------------------------------------------------------------------------------------------------------- | ---------------------------------------------------------------------- |
| [alpaca-13b](https://crfm.stanford.edu/2023/03/13/alpaca.html) | 1008 |
| [chatglm-6b](https://chatglm.cn/blog) | 985 |
| [dolly-v2-12b](https://www.databricks.com/blog/2023/04/12/dolly-first-open-commercially-viable-instruction-tuned-llm) | 944 |
| [fastchat-t5-3b](https://huggingface.co/lmsys/fastchat-t5-3b-v1.0) | 951 |
| [koala-13b](https://bair.berkeley.edu/blog/2023/04/03/koala/) | 1082 |
| [llama-13b](https://ai.facebook.com/blog/large-language-model-llama-meta-ai/) | 932 |
| [stablelm-tuned-alpha-7b](https://github.com/stability-AI/stableLM) | 858 |
| [vicuna-13b](https://lmsys.org/blog/2023-03-30-vicuna/) | 1169 |
| [oasst-pythia-12b](https://open-assistant.io/) | 1065 |