Edit model card

Original model: Poro-34B-chat

Description

GGUF-format model files quantized using llama.cpp

We have Q4_K_M and Q5_K_M quantized models available.

Downloads last month
199
GGUF
Model size
35.1B params
Architecture
bloom

4-bit

5-bit

Inference API
Unable to determine this model's library. Check the docs .

Collection including LumiOpen/Poro-34B-chat-GGUF