Update README.md
Browse files
README.md
CHANGED
@@ -11,7 +11,7 @@ datasets:
|
|
11 |
# Cedille AI
|
12 |
Cedille is a project to bring large language models to non-English languages.
|
13 |
|
14 |
-
##
|
15 |
Anna is a 6B parameter autoregressive language model based on the GPT-J architecture and trained using the [mesh-transformer-jax](https://github.com/kingoflolz/mesh-transformer-jax) codebase.
|
16 |
|
17 |
Anna was trained on German text with a similar methodology to [Boris](https://huggingface.co/Cedille/fr-boris), our French model. We started training from GPT-J, which has been trained on [The Pile](https://pile.eleuther.ai/). As a consequence the model still has good performance in English language. Anna makes use of the unmodified GPT-2 tokenizer.
|
|
|
11 |
# Cedille AI
|
12 |
Cedille is a project to bring large language models to non-English languages.
|
13 |
|
14 |
+
## de-anna
|
15 |
Anna is a 6B parameter autoregressive language model based on the GPT-J architecture and trained using the [mesh-transformer-jax](https://github.com/kingoflolz/mesh-transformer-jax) codebase.
|
16 |
|
17 |
Anna was trained on German text with a similar methodology to [Boris](https://huggingface.co/Cedille/fr-boris), our French model. We started training from GPT-J, which has been trained on [The Pile](https://pile.eleuther.ai/). As a consequence the model still has good performance in English language. Anna makes use of the unmodified GPT-2 tokenizer.
|