QuantFactory/Rombos-LLM-V2.6-Qwen-14b-GGUF

This is quantized version of rombodawg/Rombos-LLM-V2.6-Qwen-14b created using llama.cpp

Original Model Card

Rombos-LLM-V2.5-Qwen-14b

Rombos-LLM-V2.6-Qwen-14b is the upgraded version of "rombodawg/Rombos-LLM-V2.5-Qwen-14b". The magic I performed to make this model better than it already was is only known to the Deepest state, dankest memers and God himself, so dont ask 😉. But it does perform a decent bit better than version 2.5 from my hand testing. Benchmarks will come later.

Check out the Continuous Finetuning method that I apply to all my models bellow:

https://docs.google.com/document/d/1OjbjU5AOz4Ftn9xHQrX3oFQGhQ6RDUuXQipnQ9gn6tU/edit?usp=sharing

Quants:

https://huggingface.co/rombodawg/Rombos-LLM-V2.6-Qwen-14b-Q8_0-GGUF
https://huggingface.co/rombodawg/Rombos-LLM-V2.6-Qwen-14b-Q5_K_M-GGUF

Benchmarks: (Coming soon)

QuantFactory
/

Rombos-LLM-V2.6-Qwen-14b-GGUF

QuantFactory/Rombos-LLM-V2.6-Qwen-14b-GGUF

Original Model Card

Rombos-LLM-V2.5-Qwen-14b

Model tree for QuantFactory/Rombos-LLM-V2.6-Qwen-14b-GGUF