RASMUS commited on
Commit
475f7cb
1 Parent(s): 8d0cd59

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -10
README.md CHANGED
@@ -1,19 +1,25 @@
1
  ---
2
  library_name: transformers
3
- language:
4
- - fi
 
 
 
 
5
  ---
6
- ## Model description
7
 
 
8
 
9
- This is an early v0.1 version release of our Instruct finetuned model from https://huggingface.co/Finnish-NLP/llama-7b-finnish
10
- Model was trained for 2 epochs using 11014 samples and for this release we chose checkpoint at 2500/4048 steps.
11
 
12
- For finetuning we used mix of following datasources:
13
- LIMA from https://github.com/TurkuNLP/finnish-instructions
14
- Dolly from https://github.com/TurkuNLP/finnish-instructions
15
- OASST from https://github.com/TurkuNLP/finnish-instructions
16
- Heavily filtered version of Ultrachat + deepl translations from https://huggingface.co/datasets/HuggingFaceH4/ultrafeedback_binarized/viewer/default/train_sft
 
 
 
 
17
 
18
 
19
 
 
1
  ---
2
  library_name: transformers
3
+ license: apache-2.0
4
+ tags:
5
+ - finnish
6
+ - llama
7
+ inference: true
8
+ pipeline_tag: text-generation
9
  ---
 
10
 
11
+ # Llama-7b-instruct-v0.1 for Finnish
12
 
 
 
13
 
14
+ This is an early v0.1 version release of our Instruct finetuned model from https://huggingface.co/Finnish-NLP/llama-7b-finnish \
15
+ Model was trained for 2 epochs using 11014 samples and for this release we chose checkpoint at 2500/4048 steps.
16
+
17
+ For finetuning we used mix of the following datasets: \
18
+ LIMA from https://github.com/TurkuNLP/finnish-instructions \
19
+ Dolly from https://github.com/TurkuNLP/finnish-instructions \
20
+ OASST from https://github.com/TurkuNLP/finnish-instructions \
21
+ Heavily filtered version of Ultrachat https://huggingface.co/datasets/HuggingFaceH4/ultrafeedback_binarized/viewer/default/train_sft + deepl translations by writing \
22
+ samples to file and uploading to deepl.com to filetranslation and then parsinig the translated files back to samples
23
 
24
 
25