Update README.md
Browse files
README.md
CHANGED
@@ -1,19 +1,25 @@
|
|
1 |
---
|
2 |
library_name: transformers
|
3 |
-
|
4 |
-
|
|
|
|
|
|
|
|
|
5 |
---
|
6 |
-
## Model description
|
7 |
|
|
|
8 |
|
9 |
-
This is an early v0.1 version release of our Instruct finetuned model from https://huggingface.co/Finnish-NLP/llama-7b-finnish
|
10 |
-
Model was trained for 2 epochs using 11014 samples and for this release we chose checkpoint at 2500/4048 steps.
|
11 |
|
12 |
-
|
13 |
-
|
14 |
-
|
15 |
-
|
16 |
-
|
|
|
|
|
|
|
|
|
17 |
|
18 |
|
19 |
|
|
|
1 |
---
|
2 |
library_name: transformers
|
3 |
+
license: apache-2.0
|
4 |
+
tags:
|
5 |
+
- finnish
|
6 |
+
- llama
|
7 |
+
inference: true
|
8 |
+
pipeline_tag: text-generation
|
9 |
---
|
|
|
10 |
|
11 |
+
# Llama-7b-instruct-v0.1 for Finnish
|
12 |
|
|
|
|
|
13 |
|
14 |
+
This is an early v0.1 version release of our Instruct finetuned model from https://huggingface.co/Finnish-NLP/llama-7b-finnish \
|
15 |
+
Model was trained for 2 epochs using 11014 samples and for this release we chose checkpoint at 2500/4048 steps.
|
16 |
+
|
17 |
+
For finetuning we used mix of the following datasets: \
|
18 |
+
LIMA from https://github.com/TurkuNLP/finnish-instructions \
|
19 |
+
Dolly from https://github.com/TurkuNLP/finnish-instructions \
|
20 |
+
OASST from https://github.com/TurkuNLP/finnish-instructions \
|
21 |
+
Heavily filtered version of Ultrachat https://huggingface.co/datasets/HuggingFaceH4/ultrafeedback_binarized/viewer/default/train_sft + deepl translations by writing \
|
22 |
+
samples to file and uploading to deepl.com to filetranslation and then parsinig the translated files back to samples
|
23 |
|
24 |
|
25 |
|