Edit model card
YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Quantization made by Richard Erkhov.

Github

Discord

Request more models

mistral-7b-zephyr-sft - bnb 4bits

Original model description:

license: mit library_name: transformers datasets: - HuggingFaceH4/deita-10k-v0-sft base_model: mistralai/Mistral-7B-v0.1

Visualize in Weights & Biases

Mistral 7B Zephyr SFT V2

The Zephyr SFT recipe applied on top of Mistral 7B (new recipe with chatML format)

Model description

  • Model type: A 7.2B parameter GPT-like model fine-tuned on a mix of publicly available, synthetic datasets.
  • Language(s) (NLP): Primarily English
  • Finetuned from model: mistralai/Mistral-7B-v0.1

Recipe

We trained using the alignment handbook recipe and logging to W&B

Visit the W&B workspace here

Compute provided by Lambda Labs - 8xA100 80GB node

Downloads last month
1
Safetensors
Model size
3.86B params
Tensor type
F32
FP16
U8
Inference API
Input a message to start chatting with RichardErkhov/wandb_-_mistral-7b-zephyr-sft-4bits.
This model can be loaded on Inference API (serverless).