Qwen3.6-rust
Collection
12 items • Updated
How to use jedisct1/Qwen3.6-35B-rust-v2-bf16.mlx with MLX:
# Download the model from the Hub pip install huggingface_hub[hf_xet] huggingface-cli download --local-dir Qwen3.6-35B-rust-v2-bf16.mlx jedisct1/Qwen3.6-35B-rust-v2-bf16.mlx
A Rust-focused Qwen3.6-35B-A3B model for Apple Silicon, packaged in MLX.
Use it as a coding assistant for Rust projects: generating focused patches, explaining diffs, tightening tests, reading command output, and making small repo-aware edits. It was tested with Swival on local tool-calling workflows.
This is the plain BF16 full-precision compatibility variant. It does not include native MTP tensors, so it is useful when you want full precision but your loader does not support MTP sidecars.
jedisct1/Qwen3.6-35B-rust-v2-4bit.mlx if you want the smallest plain package.jedisct1/Qwen3.6-35B-rust-v2-8bit.mlx if you want a quantized plain package with more precision than 4-bit.jedisct1/Qwen3.6-35B-rust-v2-MTP-bf16.mlx if your runtime supports native MTP and you want full precision with the MTP path.Requires mlx-lm:
pip install mlx-lm
from mlx_lm import load, generate
model, tokenizer = load("jedisct1/Qwen3.6-35B-rust-v2-bf16.mlx")
messages = [
{"role": "system", "content": "You are an expert Rust developer."},
{"role": "user", "content": "Generate a focused patch that replaces unwrap() calls in parse_config() with proper error propagation."},
]
prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
response = generate(model, tokenizer, prompt=prompt, max_tokens=500)
print(response)
Quantized
Base model
Qwen/Qwen3.6-35B-A3B