RichardErkhov
/

wandb_-_mistral-7b-zephyr-sft-4bits

Text Generation

Inference Endpoints

text-generation-inference

4-bit precision

Model card Files Files and versions Community

Edit model card

YAML Metadata Warning: empty or missing yaml metadata in repo card (https://huggingface.co/docs/hub/model-cards#model-card-metadata)

Quantization made by Richard Erkhov.

Request more models

mistral-7b-zephyr-sft - bnb 4bits

Model creator: https://huggingface.co/wandb/
Original model: https://huggingface.co/wandb/mistral-7b-zephyr-sft/

Original model description:

license: mit library_name: transformers datasets: - HuggingFaceH4/deita-10k-v0-sft base_model: mistralai/Mistral-7B-v0.1

Mistral 7B Zephyr SFT V2

The Zephyr SFT recipe applied on top of Mistral 7B (new recipe with chatML format)

Model description

Model type: A 7.2B parameter GPT-like model fine-tuned on a mix of publicly available, synthetic datasets.
Language(s) (NLP): Primarily English
Finetuned from model: mistralai/Mistral-7B-v0.1

Recipe

We trained using the alignment handbook recipe and logging to W&B

Visit the W&B workspace here

Compute provided by Lambda Labs - 8xA100 80GB node

Downloads last month: 1

Safetensors

Model size

3.86B params

Tensor type

F32

·

FP16

·

U8

·