VAGOsolutions
/

SauerkrautLM-Mixtral-8x7B-Instruct

Text Generation

Mixture of Experts

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

DavidGF commited on Dec 16, 2023

Commit

ee923ac

·

1 Parent(s): 257dc55

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -55,7 +55,7 @@ Aligned with **DPO**
 SauerkrautLM-Mixtral-8x7B-Instruct was trained with mix of German data augmentation and translated data.
 Aligned through **DPO** with our **new German SauerkrautLM-DPO dataset** based on parts of the SFT SauerkrautLM dataset
-as chosen answers and [Sauerkraut-7b-HerO](https://huggingface.co/VAGOsolutions/SauerkrautLM-7b-HerO) as rejected answers. Added with additional augmented Parts of the Ultrafeedback Dataset [HuggingFaceH4/ultrafeedback_binarized](https://huggingface.co/datasets/HuggingFaceH4/ultrafeedback_binarized) and [argilla/distilabel-math-preference-dpo](https://huggingface.co/datasets/argilla/distilabel-math-preference-dpo)
 We found, that only a simple translation of training data can lead to unnatural German phrasings.
 Data augmentation techniques were used to grant grammatical, syntactical correctness and a more natural German wording in our training data.

 SauerkrautLM-Mixtral-8x7B-Instruct was trained with mix of German data augmentation and translated data.
 Aligned through **DPO** with our **new German SauerkrautLM-DPO dataset** based on parts of the SFT SauerkrautLM dataset
+as chosen answers and [Sauerkraut-7b-HerO](https://huggingface.co/VAGOsolutions/SauerkrautLM-7b-HerO) as rejected answers. Added with additional **translated Parts of the [HuggingFaceH4/ultrafeedback_binarized](https://huggingface.co/datasets/HuggingFaceH4/ultrafeedback_binarized)** and **[argilla/distilabel-math-preference-dpo](https://huggingface.co/datasets/argilla/distilabel-math-preference-dpo).**
 We found, that only a simple translation of training data can lead to unnatural German phrasings.
 Data augmentation techniques were used to grant grammatical, syntactical correctness and a more natural German wording in our training data.