QuantFactory/Replete-Coder-Instruct-8b-Merged-GGUF

This is quantized version of Replete-AI/Replete-Coder-Instruct-8b-Merged created using llama.cpp

Model Description

This is a Ties merge between the following models:

The Coding, and Overall performance of this models seems to be better than both base models used in the merge. Benchmarks are coming in the future.

GGUF

Model size

8.03B params

Architecture

llama

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Examples

Unable to determine this model's library. Check the docs .