Ornith-1.0-35B-8bit

This repository contains an MLX-LM conversion of deepreinforce-ai/Ornith-1.0-35B.

Conversion Details

  • Original model: deepreinforce-ai/Ornith-1.0-35B
  • Model size: 35B
  • MLX model type: qwen3_5_moe
  • Quantization: MLX-LM affine quantization
  • Bits: 8-bit
  • Group size: 64
  • Local MLX folder size: 34.32 GiB
  • Local safetensors weight size: 34.30 GiB

This conversion used a small MLX weight-name adaptation for the MoE expert tensors before quantization.

Usage

mlx_lm.generate --model DreamFoundries/Ornith-1.0-35B-8bit --prompt "Hello"

Benchmarks

No comparative benchmarks have been run for this conversion. The uploaded files were checked locally by loading the model with mlx_lm.generate and generating a short sample, but no quality, speed, memory, or benchmark comparisons against the original Hugging Face weights or other quantizations are provided.

License

The original model is released under the MIT license. See the original repository for the upstream model card, license, and usage notes: deepreinforce-ai/Ornith-1.0-35B.

Downloads last month
2
Safetensors
Model size
35B params
Tensor type
BF16
·
U32
·
MLX
Hardware compatibility
Log In to add your hardware

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for DreamFoundries/Ornith-1.0-35B-8bit

Quantized
(81)
this model

Collection including DreamFoundries/Ornith-1.0-35B-8bit