vistagi
/

Mixtral-8x7b-v0.1-sft

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

eelxpeng commited on Feb 18

Commit

fb517dd

•

1 Parent(s): 959aafd

Update README.md

Files changed (1) hide show

README.md +14 -1

README.md CHANGED Viewed

@@ -4,4 +4,17 @@ datasets:
 - HuggingFaceH4/ultrachat_200k
 language:
 - en
----

 - HuggingFaceH4/ultrachat_200k
 language:
 - en
+---
+# Introduction
+This model vistagi/Mixtral-8x7b-v0.1-sft is trained with Ultrachat-200K dataset through supervised finetuning using Mixtral-8x7b-v0.1 as the baseline model.
+The training is done with bfloat16 precision using LoRA.
+## Details
+Used Librarys
+- torch
+- deepspeed
+- pytorch lightning
+- transformers
+- peft