Qwen3-4B-Instruct-2507 (BaseRT)

BaseRT .base builds of Qwen/Qwen3-4B-Instruct-2507 for fast local inference on Apple Silicon.

File Quant
Qwen3-4B-Instruct-2507-Q4.base 4-bit (default)
Qwen3-4B-Instruct-2507-Q8.base 8-bit
basert serve basecompute/Qwen3-4B-Instruct-2507                       # Q4 (default)
basert serve basecompute/Qwen3-4B-Instruct-2507 --variant default-q8     # Q8

Documentation: https://docs.basecompute.co

Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for basecompute/Qwen3-4B-Instruct-2507

Finetuned
(1787)
this model