Update README.md
Browse files
README.md
CHANGED
@@ -12,7 +12,7 @@ dataset: HuggingFaceFW/fineweb-edu
|
|
12 |
|
13 |
<!-- Provide a quick summary of what the model is/does. -->
|
14 |
|
15 |
-
This is a Dual-Attention Transformer Language Model, trained on the `fineweb-edu` dataset. The model is
|
16 |
|
17 |
|
18 |
## Model Details
|
|
|
12 |
|
13 |
<!-- Provide a quick summary of what the model is/does. -->
|
14 |
|
15 |
+
This is a Dual-Attention Transformer Language Model, trained on the `fineweb-edu` dataset. The model is 1.27B parameters.
|
16 |
|
17 |
|
18 |
## Model Details
|