LFM2.5-8B-A1B-Base GGUF

This repository contains an imatrix-assisted Q4_K_M GGUF quantization of LiquidAI/LFM2.5-8B-A1B-Base.

Files

  • LFM2.5-8B-A1B-Base-Q4_K_M.gguf: quantized GGUF model
  • LFM2.5-8B-A1B-Base-imatrix.gguf: importance matrix used during quantization

Build Notes

The source safetensors file was verified against the upstream SHA-256:

9a57e48f4f70c56d4a2c7718de5cd89bee5c0402ee74b4e5da3ec519192b5c77

The conversion used llama.cpp. The local converter needed the LFM2.5 tokenizer pre-tokenizer hash mapped to llama.cpp's existing lfm2 tokenizer identifier.

The imatrix calibration run completed, with partial coverage reported for a few MoE expert tensors in block 9. This can occur when a short calibration corpus does not route tokens through every expert equally.

Smoke Test

Before publishing, the quantized GGUF was loaded with llama.cpp in single-turn conversation mode and checked on arithmetic, JSON formatting, and a simple Python function prompt.

Downloads last month
273
GGUF
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for johnbean393/LFM2.5-8B-A1B-Base-GGUF

Quantized
(2)
this model