Commit
•
57f73d8
1
Parent(s):
e39d68d
Update README.md
Browse files
README.md
CHANGED
@@ -20,7 +20,7 @@ widget:
|
|
20 |
|
21 |
The concept: 8-bit quantized version of [mGPT-13B](https://huggingface.co/ai-forever/mGPT-13B), an LLM released by AI-Forever / Sberbank AI in 2022-2023.
|
22 |
|
23 |
-
On the GPT scale, it is
|
24 |
|
25 |
My goal is to evaluate this on Hindi and Indonesian tasks, where there are fewer autoregressive language models in this size range.
|
26 |
|
|
|
20 |
|
21 |
The concept: 8-bit quantized version of [mGPT-13B](https://huggingface.co/ai-forever/mGPT-13B), an LLM released by AI-Forever / Sberbank AI in 2022-2023.
|
22 |
|
23 |
+
On the GPT scale, it is between the # of parameters for GPT-2 and GPT-3, but comparison is tricky after training on 60+ languages.
|
24 |
|
25 |
My goal is to evaluate this on Hindi and Indonesian tasks, where there are fewer autoregressive language models in this size range.
|
26 |
|