Llama-Sahabat-AI-v2-70B-IT - Q2_K_S GGUF

Quantized GGUF version of Sahabat-AI/Llama-Sahabat-AI-v2-70B-IT.

About the base model

Sahabat-AI adalah model bahasa besar (LLM) yang dikembangkan secara kolaboratif oleh BRIN, GoTo, dan Bukalapak untuk mendukung ekosistem AI berbahasa Indonesia. Model ini dilatih menggunakan data bahasa Indonesia yang kaya dan beragam, sehingga mampu memahami konteks budaya, bahasa, dan kebutuhan spesifik pengguna Indonesia dengan lebih baik.

Sahabat AI is a Large Language Model (LLM) collaboratively developed by BRIN (National Research and Innovation Agency), GoTo, and Bukalapak to support the Indonesian-language AI ecosystem. Trained on rich and diverse Indonesian language data, it better understands the cultural context, language nuances, and specific needs of Indonesian users.

Quantization details

Property Value
Quantization Q2_K_S
Bits per weight 2.77 BPW
File size ~23 GB
Original size ~141 GB (bf16)
imatrix Yes (generated from Q3_K_S with groups_merged.txt calibration data)

Note: Q2_K_S is the most aggressive quantization โ€” expect noticeable quality degradation vs Q3_K_S or Q8_0. Use Q3_K_S for better quality or Q8_0 for near-lossless inference.

Other quantizations

Usage

Load with any llama.cpp-compatible runner (llama.cpp, ollama, LM Studio, etc.):

llama-cli -m llama-sahabat-70b-Q2_K_S.gguf -p "Halo, apa kabar?"
Downloads last month
7
GGUF
Model size
71B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

2-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for blackshell69/Llama-Sahabat-AI-v2-70B-IT-Q2_K_S

Quantized
(4)
this model