abhinand
/

tamil-llama-7b-instruct-v0.2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

abhinand commited on Jan 24

Commit

07cb970

•

1 Parent(s): eef2948

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -46,6 +46,8 @@ The Tamil LLaMA models have been enhanced and tailored specifically with an exte
 Benchmarking was done using [LLM-Autoeval](https://github.com/mlabonne/llm-autoeval) on an RTX 3090 on [runpod](https://www.runpod.io/).
 | Benchmark     | Llama 2 Chat | Tamil Llama v0.2 Instruct | Telugu Llama Instruct | Malayalam Llama Instruct |
 |---------------|--------------|---------------------------|-----------------------|--------------------------|
 | ARC Challenge (25-shot) | 52.9         | **53.75**                     | 52.47                 | 52.82                    |

 Benchmarking was done using [LLM-Autoeval](https://github.com/mlabonne/llm-autoeval) on an RTX 3090 on [runpod](https://www.runpod.io/).
+> **Note:** Please note that discrepancies have been observed between the Open LLM Leaderboard scores and those obtained from local runs using the LM Eval Harness with identical configurations. The results mentioned here are based on our own benchmarking. To replicate these findings, you can utilize the LLM-Autoeval or use [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness) locally with the configurations described in Open LLM Leaderboard's About page.
 | Benchmark     | Llama 2 Chat | Tamil Llama v0.2 Instruct | Telugu Llama Instruct | Malayalam Llama Instruct |
 |---------------|--------------|---------------------------|-----------------------|--------------------------|
 | ARC Challenge (25-shot) | 52.9         | **53.75**                     | 52.47                 | 52.82                    |