Edit model card

Info

This is the model mistralai/Mistral-7B-Instruct-v0.2 which I cut all the intermediate(feed_forward_length) size with 14336 down to 3072, resulting in a ~2.81B model.

It's necessary to pre-train this model, cause at the moment is generating just gibberish.

Downloads last month
218
Safetensors
Model size
2.81B params
Tensor type
BF16
·
Inference API
Input a message to start chatting with Aryanne/Mistral-3B-Instruct-v0.2-init.
Inference API (serverless) has been turned off for this model.