mlx-community/LFM2.5-8B-A1B-MLX-8bit

This model mlx-community/LFM2.5-8B-A1B-MLX-8bit was converted to MLX format from LiquidAI/LFM2.5-8B-A1B using mlx-lm version 0.31.1.

lfm2_moe (Lfm2MoeForCausalLM): hybrid 18× double-gated short-conv + 6× GQA attention, 32-expert top-4 MoE — 8.3B total / 1.5B active, 128k context.

Use with mlx

pip install mlx-lm
from mlx_lm import load, generate

model, tokenizer = load("mlx-community/LFM2.5-8B-A1B-MLX-8bit")

prompt = "hello"
if tokenizer.chat_template is not None:
    messages = [{"role": "user", "content": prompt}]
    prompt = tokenizer.apply_chat_template(messages, add_generation_prompt=True)

response = generate(model, tokenizer, prompt=prompt, verbose=True)

License

Derivative of LiquidAI/LFM2.5-8B-A1B under the LFM Open License v1.0 (lfm1.0); original model © Liquid AI. lfm1.0 commercial-use terms apply.

Downloads last month
121
Safetensors
Model size
8B params
Tensor type
BF16
·
U32
·
F32
·
MLX
Hardware compatibility
Log In to add your hardware

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for mlx-community/LFM2.5-8B-A1B-MLX-8bit

Quantized
(34)
this model