matrixportal
/

aya-23-8B-GGUF

Model card Files Files and versions Community

matrixportal/aya-23-8B-GGUF

This model was converted to GGUF format from CohereForAI/aya-23-8B using llama.cpp via the ggml.ai's all-gguf-same-where space. Refer to the original model card for more details on the model.

✅ Quantized Models Download List

✨ Recommended for CPU: Q4_K_M | ⚡ Recommended for ARM CPU: Q4_0 | 🏆 Best Quality: Q8_0

🚀 Download	🔢 Type	📝 Notes
Download		Basic quantization
Download		Small size
Download		Balanced quality
Download		Better quality
Download		Fast on ARM
Download		Fast, recommended
Download	⭐	Best balance
Download		Good quality
Download		Balanced
Download		High quality
Download	🏆	Very good quality
Download	⚡	Fast, best quality
Download		Maximum accuracy

💡 Tip: Use F16 for maximum precision when quality is critical

Downloads last month: 362

GGUF

Model size

8.03B params

Architecture

command-r

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

16-bit

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for matrixportal/aya-23-8B-GGUF

Base model

CohereForAI/aya-23-8B

Quantized

(17)

this model

Collection including matrixportal/aya-23-8B-GGUF

Türkçe

Türkçe dilinde de eğitilmiş, anlamlı ve faydalı Türkçe yanıt verebilen modeller. Llama modelleri Türkçe konusunda pek yeterli değil maalesef. • 10 items • Updated about 19 hours ago • 1