Finnish-NLP
/

llama-7b-finnish-instruct-v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

RASMUS commited on Feb 2

Commit

60cda28

•

1 Parent(s): da585f3

Update README.md

Files changed (1) hide show

README.md +9 -9

README.md CHANGED Viewed

@@ -11,15 +11,15 @@ pipeline_tag: text-generation
 # Llama-7b-instruct-v0.1 for Finnish
-This is an early v0.1 version release of our Instruct finetuned model from https://huggingface.co/Finnish-NLP/llama-7b-finnish \
-Model was trained for 2 epochs using 11014 samples and for this release we chose checkpoint at 2500/4048 steps. \
-Future DPO/SFT+DPO variants are in the pipeline.
-For finetuning we used mix of the following datasets: \
-LIMA from https://github.com/TurkuNLP/finnish-instructions \
-Dolly from https://github.com/TurkuNLP/finnish-instructions \
-OASST from https://github.com/TurkuNLP/finnish-instructions \
-Heavily filtered version of Ultrachat https://huggingface.co/datasets/HuggingFaceH4/ultrafeedback_binarized/viewer/default/train_sft + deepl translations by writing \
 samples to file and uploading to deepl.com to filetranslation and then parsinig the translated files back to samples

 # Llama-7b-instruct-v0.1 for Finnish
+- This is an early v0.1 version release of our Instruct finetuned model from https://huggingface.co/Finnish-NLP/llama-7b-finnish
+- Model was trained for 2 epochs using 11014 samples and for this release we chose checkpoint at 2500/4048 steps.
+- Future DPO/SFT+DPO variants are in the pipeline.
+For finetuning we used mix of the following datasets:
+ - LIMA from https://github.com/TurkuNLP/finnish-instructions
+ - Dolly from https://github.com/TurkuNLP/finnish-instructions
+ - OASST from https://github.com/TurkuNLP/finnish-instructions
+ - Heavily filtered version of Ultrachat https://huggingface.co/datasets/HuggingFaceH4/ultrafeedback_binarized/viewer/default/train_sft + deepl translations by writing
 samples to file and uploading to deepl.com to filetranslation and then parsinig the translated files back to samples