Edit model card

QuantFactory Banner

QuantFactory/Rombos-LLM-V2.6-Qwen-14b-GGUF

This is quantized version of rombodawg/Rombos-LLM-V2.6-Qwen-14b created using llama.cpp

Original Model Card

Rombos-LLM-V2.5-Qwen-14b

image/jpeg

Rombos-LLM-V2.6-Qwen-14b is the upgraded version of "rombodawg/Rombos-LLM-V2.5-Qwen-14b". The magic I performed to make this model better than it already was is only known to the Deepest state, dankest memers and God himself, so dont ask 😉. But it does perform a decent bit better than version 2.5 from my hand testing. Benchmarks will come later.

Check out the Continuous Finetuning method that I apply to all my models bellow:

Quants:

Benchmarks: (Coming soon)

Downloads last month
134
GGUF
Model size
14.8B params
Architecture
qwen2

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference API
Unable to determine this model’s pipeline type. Check the docs .

Model tree for QuantFactory/Rombos-LLM-V2.6-Qwen-14b-GGUF

Base model

Qwen/Qwen2.5-14B
Quantized
(75)
this model