NousResearch
/

Nous-Hermes-2-Mixtral-8x7B-DPO

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

teknium commited on Jan 12, 2024

Commit

fe31207

·

verified ·

1 Parent(s): 61fa08e

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -1,5 +1,5 @@
 ---
-base_model: upstage/SOLAR-10.7B-v1.0
 tags:
 - Mixtral
 - instruct
@@ -18,7 +18,7 @@ language:
 - en
 ---
-# Nous Hermes 2 - Solar 10.7B
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/qVRnEDL_BUEWulUvWBD95.png)
@@ -28,6 +28,8 @@ Nous Hermes 2 Mixtral 7bx8 DPO is the new flagship Nous Research model trained o
 The model was trained on over 1,000,000 entries of primarily GPT-4 generated data, as well as other high quality data from open datasets across the AI landscape, achieving state of the art performance on a variety of tasks.
 # Table of Contents
 1. [Example Outputs](#example-outputs)
 2. [Benchmark Results](#benchmark-results)

 ---
+base_model: mistralai/Mixtral-8x7B-v0.1
 tags:
 - Mixtral
 - instruct
 - en
 ---
+# Nous Hermes 2 - Mixtral 8x7B-DPO
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6317aade83d8d2fd903192d9/qVRnEDL_BUEWulUvWBD95.png)
 The model was trained on over 1,000,000 entries of primarily GPT-4 generated data, as well as other high quality data from open datasets across the AI landscape, achieving state of the art performance on a variety of tasks.
+This is the SFT + DPO version of Mixtral Hermes 2, we will also be providing an SFT only version, for people to find which works best for them.
 # Table of Contents
 1. [Example Outputs](#example-outputs)
 2. [Benchmark Results](#benchmark-results)