Wukong-0.1-Mistral-7B-v0.2

Join Our Discord! https://discord.gg/cognitivecomputations

Wukong-0.1-Mistral-7B-v0.2 is a dealigned chat finetune of the original fantastic Mistral-7B-v0.2 model by the Mistral team.

This model was trained on the teknium OpenHeremes-2.5 dataset, code datasets from Multimodal Art Projection https://m-a-p.ai, and the Dolphin dataset from Cognitive Computations https://erichartford.com/dolphin 🐬

This model was trained for 3 epochs over 4 4090's.

Example Outputs

TBD

Downloads last month: 334

Safetensors

Model size

7.24B params

Tensor type

BF16

Text Generation

This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

RESMPDEV
/

Wukong-0.1-Mistral-7B-v0.2

Wukong-0.1-Mistral-7B-v0.2

Example Outputs

Datasets used to train RESMPDEV/Wukong-0.1-Mistral-7B-v0.2