Model Sources

https://huggingface.co/HuggingFaceTB/SmolLM-360M-Instruct

Uses

v v small model for running on edge with :fire: TTFT & Throughput

Direct Use

Use llama.cpp to inference the model

Downloads last month
27
GGUF
Model size
362M params
Architecture
llama

16-bit

Inference API
Unable to determine this model's library. Check the docs .