README.md · ddh0/Mistral-Large-Instruct-2407-q8_0-q8_0-GGUF at 29c01d4114550c032e349da23c6a9b64db272002

metadata

license: other
license_name: mrl
license_link: https://mistral.ai/licenses/MRL-0.1.md
language:
  - en
  - fr
  - de
  - es
  - it
  - pt
  - zh
  - ja
  - ru
  - ko

This is mistralai/Mistral-Large-Instruct-2407, converted to GGUF and quantized to q8_0. Both the model and the embedding/output tensors are q8_0.

The model is split using the llama.cpp/llama-gguf-split cli utility into shards no larger than 7GB. The purpose of this is to make it less painful to resume downloading if interrupted.

GGUFv3