Update README.md
Browse files
README.md
CHANGED
@@ -16,11 +16,11 @@ Polyglot-Ko is a series of large-scale Korean autoregressive language models mad
|
|
16 |
|----------------------|----------------------------------------------------------------------------------------------------------------------------------------|
|
17 |
| \\(n_{parameters}\\) | 3,809,974,272 |
|
18 |
| \\(n_{layers}\\) | 32 |
|
19 |
-
| \\(d_{model}\\) |
|
20 |
| \\(d_{ff}\\) | 12,288 |
|
21 |
| \\(n_{heads}\\) | 24 |
|
22 |
| \\(d_{head}\\) | 128 |
|
23 |
-
| \\(n_{ctx}\\) |
|
24 |
| \\(n_{vocab}\\) | 30,003 / 30,080 |
|
25 |
| Positional Encoding | [Rotary Position Embedding (RoPE)](https://arxiv.org/abs/2104.09864) |
|
26 |
| RoPE Dimensions | [64](https://github.com/kingoflolz/mesh-transformer-jax/blob/f2aa66e0925de6593dcbb70e72399b97b4130482/mesh_transformer/layers.py#L223) |
|
|
|
16 |
|----------------------|----------------------------------------------------------------------------------------------------------------------------------------|
|
17 |
| \\(n_{parameters}\\) | 3,809,974,272 |
|
18 |
| \\(n_{layers}\\) | 32 |
|
19 |
+
| \\(d_{model}\\) | 3,072 |
|
20 |
| \\(d_{ff}\\) | 12,288 |
|
21 |
| \\(n_{heads}\\) | 24 |
|
22 |
| \\(d_{head}\\) | 128 |
|
23 |
+
| \\(n_{ctx}\\) | 2,048 |
|
24 |
| \\(n_{vocab}\\) | 30,003 / 30,080 |
|
25 |
| Positional Encoding | [Rotary Position Embedding (RoPE)](https://arxiv.org/abs/2104.09864) |
|
26 |
| RoPE Dimensions | [64](https://github.com/kingoflolz/mesh-transformer-jax/blob/f2aa66e0925de6593dcbb70e72399b97b4130482/mesh_transformer/layers.py#L223) |
|