Edit model card

Felladrin/Smol-Llama-101M-Chat-v1-GGUF

Quantized GGUF model files for Smol-Llama-101M-Chat-v1 from Felladrin

Original Model Card:

A Llama Chat Model of 101M Parameters

Recommended Prompt Format

The recommended prompt format is as follows:

<|im_start|>system
{system_message}<|im_end|>
<|im_start|>user
{user_message}<|im_end|>
<|im_start|>assistant

Recommended Inference Parameters

To get the best results, add special tokens and prefer using contrastive search for inference:

add_special_tokens: true
penalty_alpha: 0.5
top_k: 5
Downloads last month
53
GGUF
Model size
101M params
Architecture
llama

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference API (serverless) has been turned off for this model.

Quantized from

Datasets used to train afrideva/Smol-Llama-101M-Chat-v1-GGUF