BramVanroy
/

fietje-2

Text Generation

alignment-handbook

text-generation-inference

Model card Files Files and versions Community

BramVanroy commited on Apr 26

Commit

18ca134

•

1 Parent(s): 18e5cda

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -30,7 +30,7 @@ inference: false
 > [!TIP]
 > 🚀 Looking for the fast GGUF version? You can find it, and how to use it with `ollama` (command line) or LM Studio (interface), [here](https://huggingface.co/BramVanroy/fietje-2b-GGUF).
-This model is an adapted version of [microsoft/phi-2](https://huggingface.co/microsoft/phi-2), finetuned for Dutch text generation. It was continue-pretrained on 28B Dutch tokens, which includes the full Dutch component of Wikipedia and supplemented with Dutch tokens from CulturaX. A newer version of this dataset can be found [here](https://huggingface.co/datasets/BramVanroy/wikipedia_culturax_dutch), which also describes the filtering that took place.
 ## Model description

 > [!TIP]
 > 🚀 Looking for the fast GGUF version? You can find it, and how to use it with `ollama` (command line) or LM Studio (interface), [here](https://huggingface.co/BramVanroy/fietje-2b-GGUF).
+This model is an adapted version of [microsoft/phi-2](https://huggingface.co/microsoft/phi-2), finetuned for Dutch text generation. It was continue-pretrained on 28B Dutch tokens, which includes the full Dutch component of Wikipedia (around 15%)) and supplemented with Dutch tokens from CulturaX. A newer version of this dataset can be found [here](https://huggingface.co/datasets/BramVanroy/wikipedia_culturax_dutch), which also describes the filtering that took place.
 ## Model description