Llama-3.2-3B-Instruct

BaseRT .base builds of meta-llama/Llama-3.2-3B-Instruct for fast local inference on Apple Silicon (Metal).

Files

File Precision Size
Llama-3.2-3B-Instruct-Q4.base 4-bit 1.7G
Llama-3.2-3B-Instruct-Q8.base 8-bit 3.1G

Usage

curl -LsSf https://basecompute.co/install.sh | sh
basert pull basecompute/Llama-3.2-3B-Instruct
basert chat basecompute/Llama-3.2-3B-Instruct

Released under the llama3.2 license, inherited from the base model.

Downloads last month
31
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for basecompute/Llama-3.2-3B-Instruct

Finetuned
(1674)
this model