Instructions to use leonsarmiento/Ornith-1.0-35B-5bit-mlx with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- MLX
How to use leonsarmiento/Ornith-1.0-35B-5bit-mlx with MLX:
# Make sure mlx-vlm is installed # pip install --upgrade mlx-vlm from mlx_vlm import load, generate from mlx_vlm.prompt_utils import apply_chat_template from mlx_vlm.utils import load_config # Load the model model, processor = load("leonsarmiento/Ornith-1.0-35B-5bit-mlx") config = load_config("leonsarmiento/Ornith-1.0-35B-5bit-mlx") # Prepare input image = ["http://images.cocodataset.org/val2017/000000039769.jpg"] prompt = "Describe this image." # Apply chat template formatted_prompt = apply_chat_template( processor, config, prompt, num_images=1 ) # Generate output output = generate(model, processor, formatted_prompt, image) print(output) - Notebooks
- Google Colab
- Kaggle
- Local Apps Settings
- LM Studio
- Pi
How to use leonsarmiento/Ornith-1.0-35B-5bit-mlx with Pi:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "leonsarmiento/Ornith-1.0-35B-5bit-mlx"
Configure the model in Pi
# Install Pi: npm install -g @mariozechner/pi-coding-agent # Add to ~/.pi/agent/models.json: { "providers": { "mlx-lm": { "baseUrl": "http://localhost:8080/v1", "api": "openai-completions", "apiKey": "none", "models": [ { "id": "leonsarmiento/Ornith-1.0-35B-5bit-mlx" } ] } } }Run Pi
# Start Pi in your project directory: pi
- Hermes Agent new
How to use leonsarmiento/Ornith-1.0-35B-5bit-mlx with Hermes Agent:
Start the MLX server
# Install MLX LM: uv tool install mlx-lm # Start a local OpenAI-compatible server: mlx_lm.server --model "leonsarmiento/Ornith-1.0-35B-5bit-mlx"
Configure Hermes
# Install Hermes: curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash hermes setup # Point Hermes at the local server: hermes config set model.provider custom hermes config set model.base_url http://127.0.0.1:8080/v1 hermes config set model.default leonsarmiento/Ornith-1.0-35B-5bit-mlx
Run Hermes
hermes
Won't load on LM Studio
It's throwing an error during load: ValueError: Unsupported model type: qwen3_5_moe_vision
Full error dump:
Failed to load model.
Error when loading model: Exception: Encountered fatal exception in the backend generation thread: Traceback (most recent call last):
File "/Users/gw/.lmstudio/extensions/backends/vendor/_amphibian/app-mlx-generate-mac14-arm64@29/lib/python3.11/site-packages/mlx_engine/model_kit/batched_vision/model_kit.py", line 318, in _generate_with_exception_handling
self._generate()
File "/Users/gw/.lmstudio/extensions/backends/vendor/_amphibian/app-mlx-generate-mac14-arm64@29/lib/python3.11/site-packages/mlx_engine/model_kit/batched_vision/model_kit.py", line 539, in _generate
self._load_model()
File "/Users/gw/.lmstudio/extensions/backends/vendor/_amphibian/app-mlx-generate-mac14-arm64@29/lib/python3.11/site-packages/mlx_engine/model_kit/batched_vision/model_kit.py", line 179, in _load_model
self.model, _ = mlx_vlm.utils.load_model(
^^^^^^^^^^^^^^^^^^^^^^^^^
File "/Users/gw/.lmstudio/extensions/backends/vendor/_amphibian/app-mlx-generate-mac14-arm64@29/lib/python3.11/site-packages/mlx_vlm/utils.py", line 251, in load_model
model = model_class.Model(model_config)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/Users/gw/.lmstudio/extensions/backends/vendor/_amphibian/app-mlx-generate-mac14-arm64@29/lib/python3.11/site-packages/mlx_vlm/models/qwen3_5_moe/qwen3_5_moe.py", line 16, in __init__
self.vision_tower = VisionModel(config.vision_config)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/Users/gw/.lmstudio/extensions/backends/vendor/_amphibian/app-mlx-generate-mac14-arm64@29/lib/python3.11/site-packages/mlx_vlm/models/qwen3_vl/vision.py", line 201, in __init__
raise ValueError(f"Unsupported model type: {self.model_type}")
ValueError: Unsupported model type: qwen3_5_moe_vision
Sorry, I actually opened this on the wrong model -- should have been 4bit. Nevermind, I see there's not a separate 4bit to comment on.
mlx-vlm's VisionModel only accepts the following values for model_type (see mlx_vlm/models/qwen3_vl/vision.py):
if self.model_type not in ["qwen3_vl", "qwen3_5", "qwen3_5_moe"]:
raise ValueError(f"Unsupported model type: {self.model_type}")
qwen3_5_moe_vision is not in the allowed list, so the model fails to load.
Comparison with the official Qwen3.6 MLX model
The official lmstudio-community/Qwen3.6-35B-A3B-MLX-4bit uses qwen3_5_moe for the same field and loads without issues:
"vision_config": {
"model_type": "qwen3_5_moe",
...
}
Expected behavior
vision_config.model_type should be qwen3_5_moe (not qwen3_5_moe_vision), consistent with the official Qwen3.6 MLX conversion.
Suggested fix
In config.json, change:
"vision_config": {
- "model_type": "qwen3_5_moe_vision",
+ "model_type": "qwen3_5_moe",
...
}
I recommended to use oMLX for this, and pretty much any other mlx model, specially if you are using agentic tools.