Unable to load with mlx-lm 0.31.3 — architecture not recognized

#1
by skaman5 - opened

Tested loading this model with mlx-lm and it fails due to the laguna architecture not being recognized.

Environment

  • mlx-lm: 0.31.3
  • MLX: 0.31.2
  • Hardware: MacBook Pro M1 Max (24-core GPU), 32 GB unified memory
  • macOS: 26.4.1

What I tried

mlx_lm.server --model mlx-community/Laguna-XS.2-4bit --port 8080

Fails during model load — the laguna architecture is not in mlx-lm's supported architecture list.

Cross-reference

The GGUF variant (Lucebox/Laguna-XS.2-GGUF) also fails on:

  • llama.cpp build 9330 (ggml-org/llama.cpp#23249)
  • LM Studio 0.4.14+4
  • Ollama 0.24.0

vLLM 0.21.0 has merged support (vllm-project/vllm#41129), confirming the architecture is implementable. vLLM Metal (Apple Silicon plugin, v0.2.0) does not yet include Laguna in its supported models list either.

Model info

Laguna-XS.2 by Poolside AI — 33B total / 3B active MoE, 256 experts + 1 shared, sliding window attention 512 tokens, 131K context, Apache 2.0.

Happy to re-test when mlx-lm adds support for this architecture.

Sign up or log in to comment