BramVanroy commited on
Commit
18ca134
•
1 Parent(s): 18e5cda

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -30,7 +30,7 @@ inference: false
30
  > [!TIP]
31
  > 🚀 Looking for the fast GGUF version? You can find it, and how to use it with `ollama` (command line) or LM Studio (interface), [here](https://huggingface.co/BramVanroy/fietje-2b-GGUF).
32
 
33
- This model is an adapted version of [microsoft/phi-2](https://huggingface.co/microsoft/phi-2), finetuned for Dutch text generation. It was continue-pretrained on 28B Dutch tokens, which includes the full Dutch component of Wikipedia and supplemented with Dutch tokens from CulturaX. A newer version of this dataset can be found [here](https://huggingface.co/datasets/BramVanroy/wikipedia_culturax_dutch), which also describes the filtering that took place.
34
 
35
  ## Model description
36
 
 
30
  > [!TIP]
31
  > 🚀 Looking for the fast GGUF version? You can find it, and how to use it with `ollama` (command line) or LM Studio (interface), [here](https://huggingface.co/BramVanroy/fietje-2b-GGUF).
32
 
33
+ This model is an adapted version of [microsoft/phi-2](https://huggingface.co/microsoft/phi-2), finetuned for Dutch text generation. It was continue-pretrained on 28B Dutch tokens, which includes the full Dutch component of Wikipedia (around 15%)) and supplemented with Dutch tokens from CulturaX. A newer version of this dataset can be found [here](https://huggingface.co/datasets/BramVanroy/wikipedia_culturax_dutch), which also describes the filtering that took place.
34
 
35
  ## Model description
36