Update README.md
Browse files
README.md
CHANGED
@@ -25,7 +25,7 @@ language:
|
|
25 |
|
26 |
This model is a <b>causal</b> language model for the <b>Italian</b> language, based on a GPT-like <b>[1]</b> architecture (more specifically, the model has been obtained by modifying Meta's XGLM architecture <b>[2]</b> and exploiting its 564M checkpoint).
|
27 |
|
28 |
-
It is a foundation model, pre-trained for causal language modeling, so it is mainly suitable for basic natural language generation, and you will have to fine-tune it in order to use it on more specific downstream tasks.
|
29 |
|
30 |
<h3>Quick usage</h3>
|
31 |
|
|
|
25 |
|
26 |
This model is a <b>causal</b> language model for the <b>Italian</b> language, based on a GPT-like <b>[1]</b> architecture (more specifically, the model has been obtained by modifying Meta's XGLM architecture <b>[2]</b> and exploiting its 564M checkpoint).
|
27 |
|
28 |
+
The model has ~354M parameters and a vocabulary of 50.335 tokens. It is a foundation model, pre-trained for causal language modeling, so it is mainly suitable for basic natural language generation, and you will have to fine-tune it in order to use it on more specific downstream tasks.
|
29 |
|
30 |
<h3>Quick usage</h3>
|
31 |
|