Edit model card

QuantFactory/Replete-Coder-Instruct-8b-Merged-GGUF

This is quantized version of Replete-AI/Replete-Coder-Instruct-8b-Merged created using llama.cpp

Model Description

This is a Ties merge between the following models:

The Coding, and Overall performance of this models seems to be better than both base models used in the merge. Benchmarks are coming in the future.

Downloads last month
312
GGUF
Model size
8.03B params
Architecture
llama

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Examples
Unable to determine this model's library. Check the docs .