MLX
vision
kimi
exo

Kimi-K2.7-Code-vision

Vision-only weights (MoonViT tower + multimodal projector) extracted from moonshotai/Kimi-K2.7-Code for use with MLX-based inference stacks such as exo, in the same format as exolabs/Kimi-K2.6-vision.

Contents

  • kimi_k27_vision.safetensors โ€” all 335 vision_tower.* and mm_projector.* tensors from the official repo (shards 63โ€“64), original bfloat16, unmodified.
  • config.json โ€” vision config copied from the official config.json (verified byte-identical to Kimi-K2.6's vision config: 27-layer MoonViT, hidden 1152, patch 14, sd2_tpool merger, projector to 7168).
  • extract_vision_weights.py โ€” the script used to produce this repo, for reproducibility.

Usage with exo

Add a model card for moonshotai/Kimi-K2.7-Code with:

capabilities = ["text", "thinking", "thinking_toggle", "vision"]

[vision]
image_token_id = 163605
model_type = "kimi_vl"
weights_repo = "aidiffuser/Kimi-K2.7-Code-vision"
processor_repo = "moonshotai/Kimi-K2.7-Code"

Tested working: distributed (2ร— Mac Studio M3 Ultra, tensor parallelism) with the official INT4 text weights, image understanding confirmed.

License

Same Modified MIT license as the source model; these are a subset of the original weights, unmodified. All credit to Moonshot AI.

Downloads last month
-
MLX
Hardware compatibility
Log In to add your hardware

Quantized

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for aidiffuser/Kimi-K2.7-Code-vision

Finetuned
(3)
this model