Vezora
/

Mistral-29b-Merge-Base

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Vezora commited on Nov 3, 2023

Commit

5626e8e

·

1 Parent(s): b5c6898

Update README.md

Files changed (1) hide show

README.md +24 -0

README.md CHANGED Viewed

@@ -1,3 +1,27 @@
 ---
 license: apache-2.0
 ---

 ---
 license: apache-2.0
 ---
+<!DOCTYPE html>
+<img src="https://imgur.com/a/3HUIVxJ" width="300">
+# Mistral 29b: A New Base Model
+The objective of this model is to serve as a new base model for Mistral 28b.
+This model spits out jargon, and needs to be fine tuned, either with qlora, with the adapter attached to every layer, or better yet a full fine tune.
+## Model Creation
+The model was created by merging stacking four models: Dolphin, Zephyr, Meta-math7b, and Speechless code, to form a single model.
+## Useful Resources
+- Model Merging (MergeKit) (https://github.com/cg123/mergekit)
+## Source Models
+- Dolphin2.1-mistral-7b by Eric Hartford (https://huggingface.co/ehartford/dolphin-2.1-mistral-7b)
+- Zephyr-7b-beta by HuggingFace (https://huggingface.co/HuggingFaceH4/zephyr-7b-beta)
+- MetaMath-Mistral-7B by meta-math (https://huggingface.co/meta-math/MetaMath-Mistral-7B)
+- speechless-code-mistral-7b-v1.0 (https://huggingface.co/uukuguy/speechless-code-mistral-7b-v1.0)