---
pipeline_tag: text-generation
---

List of small RP models (12 gb vram range). No benchmarks, only vibes. Updated nov 2024.

## base models

- Llama3 (8B)
- Mistral Nemo (12B)
- **Qwen 2.5 (14B)** - a contender
- **Mistral Small (22B)** - a winner?

## instruct and roleplay finetunes

- [Sao10K/L3-8B-Stheno-v3.2](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.2)
- [Sao10K/Fimbulvetr-11B-v2](https://huggingface.co/Sao10K/Fimbulvetr-11B-v2)
- [mistralai/Mistral-Nemo-Instruct-2407](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407) - **base is good**
- [nothingiisreal/MN-12B-Celeste-V1.9](https://huggingface.co/nothingiisreal/MN-12B-Celeste-V1.9)
- [anthracite-org/magnum-12b-v2](https://huggingface.co/anthracite-org/magnum-12b-v2)
- [NeverSleep/Lumimaid-v0.2-8B](https://huggingface.co/NeverSleep/Lumimaid-v0.2-8B)
- [nbeerbower/mistral-nemo-gutenberg-12B-v2](https://huggingface.co/nbeerbower/mistral-nemo-gutenberg-12B-v2)

## 🧟‍♂️ merges

- [MarinaraSpaghetti/NemoMix-Unleashed-12B](https://huggingface.co/MarinaraSpaghetti/NemoMix-Unleashed-12B) - **personal favorite**
- [Sao10K/L3-8B-Lunaris-v1](https://huggingface.co/Sao10K/L3-8B-Lunaris-v1)
- [aetherwiing/MN-12B-Starcannon-v3](https://huggingface.co/aetherwiing/MN-12B-Starcannon-v3)

### misc. links

- [llama.cpp](https://github.com/ggerganov/llama.cpp) and [llama-cpp-python](https://github.com/abetlen/llama-cpp-python) - **preferred LLM software**
- [/r/localllama](https://www.reddit.com/r/LocalLLaMA/)
- [/lmg/](https://boards.4chan.org/search#/lmg/g)
- [LMSys Chatbot Arena Leaderboard](https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard)
- [Uncensored General Intelligence Leaderboard](https://huggingface.co/spaces/DontPlanToEnd/UGI-Leaderboard)
- [/r/SillyTavernAI](https://www.reddit.com/r/SillyTavernAI/)
- NothingiisReal discord
- NeverSleep discord
- SillyTavern discord
- BeaverAI discord