Qwen3-4B-Instruct-2507 (BaseRT)
BaseRT .base builds of
Qwen/Qwen3-4B-Instruct-2507 for fast local inference on Apple Silicon.
| File | Quant |
|---|---|
Qwen3-4B-Instruct-2507-Q4.base |
4-bit (default) |
Qwen3-4B-Instruct-2507-Q8.base |
8-bit |
basert serve basecompute/Qwen3-4B-Instruct-2507 # Q4 (default)
basert serve basecompute/Qwen3-4B-Instruct-2507 --variant default-q8 # Q8
Documentation: https://docs.basecompute.co
- Downloads last month
- -
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support
Model tree for basecompute/Qwen3-4B-Instruct-2507
Base model
Qwen/Qwen3-4B-Instruct-2507