Aryanne
/

Mistral-3B-Instruct-v0.2-init

Text Generation

Transformers

Safetensors

GGUF

mistral

conversational

text-generation-inference

Model card Files Files and versions Community

Edit model card

Info

This is the model mistralai/Mistral-7B-Instruct-v0.2 which I cut all the intermediate(feed_forward_length) size with 14336 down to 3072, resulting in a ~2.81B model.

It's necessary to pre-train this model, cause at the moment is generating just gibberish.

Downloads last month: 218

Safetensors

Model size

2.81B params

Tensor type

BF16