North-Mini-Code-1.0 MLX BF16

BF16 MLX conversion of CohereLabs/North-Mini-Code-1.0.

  • Source revision: effaeda477c041c107d5a3d8c599cb5d6c5878ef
  • Architecture: Cohere2MoeForCausalLM / cohere2_moe
  • Parameters: 30.48B total, 3B active
  • Artifact size: 60.99 GB, 13 safetensor shards
  • Verification: conversion completed and all safetensor headers were readable
  • Local runtime on M2 Max 32 GB: failed with Metal out-of-memory

Requires pinned experimental MLX-LM cohere2_moe support until it lands in a release:

pip install "mlx-lm @ git+https://github.com/Terrencezzj/mlx-lm.git@f43507c5c30bdebdb92d308ac11aa8f96b418c2e"
Downloads last month
-
Safetensors
Model size
30B params
Tensor type
BF16
·
MLX
Hardware compatibility
Log In to add your hardware

Quantized

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for bsisduck/North-Mini-Code-1.0-MLX-BF16

Finetuned
(3)
this model