Mistral-7B-Instruct-v0.3-Q4_K_M (GGUF)

This repository contains the Mistral-7B-Instruct-v0.3 model in GGUF format with Q4_K_M quantization.

Model Information

  • Base Model: Mistral-7B-Instruct-v0.3
  • Original Creator: Mistral AI
  • License: Apache 2.0
  • Format: GGUF (Quantized)
  • Quantization: Q4_K_M (4-bit quantization, medium quality)
  • Model Size: ~4.1 GB

About This Model

Mistral 7B Instruct v0.3 is a instruction-tuned large language model developed by Mistral AI. This version has been quantized to GGUF format for efficient inference with llama.cpp and compatible frameworks.

Key Features

  • 7 billion parameters
  • Optimized for instruction-following tasks
  • Supports extended vocabulary
  • Apache 2.0 licensed (commercial use allowed)

Usage

This model can be used with:

  • llama.cpp
  • Ollama
  • LM Studio
  • text-generation-webui
  • Any GGUF-compatible inference engine

Example with llama.cpp:

./main -m Mistral-7B-Instruct-v0.3-Q4_K_M.gguf -p "Your prompt here" -n 512

Example with Python (llama-cpp-python):

from llama_cpp import Llama

llm = Llama(model_path="Mistral-7B-Instruct-v0.3-Q4_K_M.gguf")
output = llm("Q: What is the capital of France? A:", max_tokens=256)
print(output)

Quantization Details

Q4_K_M quantization provides:

  • Good balance between size and quality
  • ~4-bit average quantization
  • Suitable for most use cases
  • Recommended for systems with limited VRAM

License

This model is licensed under Apache License 2.0.

You are free to:

  • โœ… Use commercially
  • โœ… Modify and distribute
  • โœ… Use privately
  • โœ… Patent use

Attribution Required: You must give appropriate credit to Mistral AI, provide a link to the license, and indicate if changes were made.

See the Apache 2.0 License for full details.

Attribution

Original model developed by Mistral AI.

Citation

@article{mistral7b,
  title={Mistral 7B},
  author={Mistral AI Team},
  year={2023}
}

Disclaimer

This model is provided "as is" without warranty of any kind. See the Apache 2.0 license for details.

Downloads last month
11
GGUF
Model size
7B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for ARAVINDS2022002/Mistral-7B-Instruct-v0.3-Q4_K_M

Quantized
(246)
this model