BramVanroy
commited on
Commit
•
18ca134
1
Parent(s):
18e5cda
Update README.md
Browse files
README.md
CHANGED
@@ -30,7 +30,7 @@ inference: false
|
|
30 |
> [!TIP]
|
31 |
> 🚀 Looking for the fast GGUF version? You can find it, and how to use it with `ollama` (command line) or LM Studio (interface), [here](https://huggingface.co/BramVanroy/fietje-2b-GGUF).
|
32 |
|
33 |
-
This model is an adapted version of [microsoft/phi-2](https://huggingface.co/microsoft/phi-2), finetuned for Dutch text generation. It was continue-pretrained on 28B Dutch tokens, which includes the full Dutch component of Wikipedia and supplemented with Dutch tokens from CulturaX. A newer version of this dataset can be found [here](https://huggingface.co/datasets/BramVanroy/wikipedia_culturax_dutch), which also describes the filtering that took place.
|
34 |
|
35 |
## Model description
|
36 |
|
|
|
30 |
> [!TIP]
|
31 |
> 🚀 Looking for the fast GGUF version? You can find it, and how to use it with `ollama` (command line) or LM Studio (interface), [here](https://huggingface.co/BramVanroy/fietje-2b-GGUF).
|
32 |
|
33 |
+
This model is an adapted version of [microsoft/phi-2](https://huggingface.co/microsoft/phi-2), finetuned for Dutch text generation. It was continue-pretrained on 28B Dutch tokens, which includes the full Dutch component of Wikipedia (around 15%)) and supplemented with Dutch tokens from CulturaX. A newer version of this dataset can be found [here](https://huggingface.co/datasets/BramVanroy/wikipedia_culturax_dutch), which also describes the filtering that took place.
|
34 |
|
35 |
## Model description
|
36 |
|