GGUF
English
Edit model card

TinyLlama 1.1B Chat v0.3 - GGUF

Support for calm

These models support the calm language model runner. The particular quants selected for this repo are in support of calm, which is a language model runner that automatically uses the right prompts, templates, context size, etc.

Downloads last month
53
GGUF
Model size
1.1B params
Architecture
llama

4-bit

6-bit

16-bit

Inference API (serverless) has been turned off for this model.

Quantized from

Datasets used to train iandennismiller/TinyLlama-1.1B-Chat-v0.3-GGUF