Update README.md
Browse files
README.md
CHANGED
@@ -45,8 +45,9 @@ language:
|
|
45 |
|
46 |
# Salamandra Model Card
|
47 |
|
48 |
-
Salamandra
|
49 |
-
|
|
|
50 |
|
51 |
To visit the model cards of other Salamandra versions, please refer to the [Model Index](#model-index).
|
52 |
|
@@ -59,7 +60,7 @@ Along with the open weights, all training scripts and configuration files are ma
|
|
59 |
|
60 |
### Description
|
61 |
|
62 |
-
Transformer-based decoder-only language model that has been pre-trained on 7.8 trillion tokens of highly curated data.
|
63 |
The pre-training corpus contains text in 35 European languages and code.
|
64 |
|
65 |
### Hyperparameters
|
|
|
45 |
|
46 |
# Salamandra Model Card
|
47 |
|
48 |
+
Salamandra is a highly multilingual model pre-trained from scratch that comes in three different
|
49 |
+
sizes — 2B, 7B and 40B parameters — with their respective base and instruction-tuned variants.
|
50 |
+
This model card corresponds to the 7B instructed version.
|
51 |
|
52 |
To visit the model cards of other Salamandra versions, please refer to the [Model Index](#model-index).
|
53 |
|
|
|
60 |
|
61 |
### Description
|
62 |
|
63 |
+
Transformer-based decoder-only language model that has been pre-trained from scratch on 7.8 trillion tokens of highly curated data.
|
64 |
The pre-training corpus contains text in 35 European languages and code.
|
65 |
|
66 |
### Hyperparameters
|