BramVanroy
commited on
Commit
·
6d18a4e
1
Parent(s):
87e000e
Update README.md
Browse files
README.md
CHANGED
@@ -27,10 +27,30 @@ This model is intended as a (poor) baseline for Dutch generative LLMs. It by no
|
|
27 |
Trained on the synthetic [BramVanroy/alpaca-dolly-dutch](https://huggingface.co/datasets/BramVanroy/alpaca-dolly-dutch) instruction dataset.
|
28 |
Therefore, commercial use of this model is forbidden. The model is intended for research purposes only.
|
29 |
|
|
|
|
|
|
|
30 |
## Training procedure
|
31 |
|
32 |
Trained with LoRA and merged before upload. The adapters are in the `adapters` branch.
|
33 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
34 |
### Training hyperparameters
|
35 |
|
36 |
The following hyperparameters were used during training:
|
|
|
27 |
Trained on the synthetic [BramVanroy/alpaca-dolly-dutch](https://huggingface.co/datasets/BramVanroy/alpaca-dolly-dutch) instruction dataset.
|
28 |
Therefore, commercial use of this model is forbidden. The model is intended for research purposes only.
|
29 |
|
30 |
+
- [Dolly 15k](https://huggingface.co/datasets/BramVanroy/dolly-15k-dutch) (translated to Dutch)
|
31 |
+
- [Alpaca cleaned](https://huggingface.co/datasets/BramVanroy/alpaca-cleaned-dutch) (translated to Dutch)
|
32 |
+
|
33 |
## Training procedure
|
34 |
|
35 |
Trained with LoRA and merged before upload. The adapters are in the `adapters` branch.
|
36 |
|
37 |
+
Prompt template (where the input is optional and can be left out):
|
38 |
+
|
39 |
+
```
|
40 |
+
Hieronder staat een instructie `Instruction` die een taak beschrijft, gecombineerd met een invoer `Input` die verdere context biedt. Schrijf een antwoord na `Response:` dat het verzoek op de juiste manier voltooit of beantwoordt.
|
41 |
+
|
42 |
+
### Instruction:
|
43 |
+
{instruction}
|
44 |
+
|
45 |
+
### Input:
|
46 |
+
{input}
|
47 |
+
|
48 |
+
### Response:
|
49 |
+
{response}
|
50 |
+
```
|
51 |
+
|
52 |
+
The loss was only calculated on the response prediction.
|
53 |
+
|
54 |
### Training hyperparameters
|
55 |
|
56 |
The following hyperparameters were used during training:
|