vistagi
/

Mixtral-8x7b-v0.1-dpo

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

eelxpeng commited on Feb 18

Commit

9eb8bc4

•

1 Parent(s): ebc9bd5

Create README.md

Files changed (1) hide show

README.md +18 -0

README.md ADDED Viewed

	@@ -0,0 +1,18 @@

+---
+license: apache-2.0
+datasets:
+- HuggingFaceH4/ultrafeedback_binarized
+language:
+- en
+---
+# Introduction
+This model vistagi/Mixtral-8x7b-v0.1-sft is trained with Ultrachat-200K dataset through supervised finetuning using Mixtral-8x7b-v0.1 as the baseline model. The training is done with bfloat16 precision using LoRA.
+## Details
+Used Librarys
+- torch
+- deepspeed
+- pytorch lightning
+- transformers
+- peft