README.md · R136a1/InfinityKumon-2x7B-GGUF at main

metadata

license: apache-2.0
language:
  - en
tags:
  - nsfw
  - not-for-all-audiences
  - roleplay

InfinityKumon-2x7B

GGUF - Imatrix quant of InfinityKumon-2x7B

The reason? Because I like InfinityRP-v1-7B so much and wondering if I can improve it even more by merging 2 great models into MoE.

Using llama.cpp/perplexity with private roleplay dataset.

I don't really recomend using Q2_K based on the ppl, the other quants are fine.

Alpaca or ChatML

Switch: FP16 - GGUF