larenspear
/

TinyLlama-1.1B-Chat-v1.0-GGUF

Inference Endpoints

Model card Files Files and versions Community

Edit model card

About

GGUF Quantizations of https://huggingface.co/TinyLlama/TinyLlama-1.1B-Chat-v1.0

Provided Quantizations

Link	Type
GGUF	Q2_K
GGUF	Q3_K_S
GGUF	Q3_K_M
GGUF	Q3_K_L
GGUF	Q4_0
GGUF	Q4_K_S
GGUF	Q4_K_M
GGUF	Q5_0
GGUF	Q5_K_S
GGUF	Q5_K_M
GGUF	Q6_K
GGUF	Q8_0

In a circular citation, I borrowed the format of this file from https://huggingface.co/mradermacher/copy_of_wildjailbreak_13-GGUF.

Downloads last month: 99

GGUF

Model size

1.1B params

Architecture

llama

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference API

Unable to determine this model’s pipeline type. Check the docs .

Model tree for larenspear/TinyLlama-1.1B-Chat-v1.0-GGUF

Base model

TinyLlama/TinyLlama-1.1B-Chat-v1.0

Quantized

(66)

this model