--- pipeline_tag: text-generation --- List of small RP models (12 gb vram range). No benchmarks, only vibes. Updated nov 2024. ## base models - Llama3 (8B) - Mistral Nemo (12B) - **Qwen 2.5 (14B)** - a contender - **Mistral Small (22B)** - a winner? ## instruct and roleplay finetunes - [Sao10K/L3-8B-Stheno-v3.2](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.2) - [Sao10K/Fimbulvetr-11B-v2](https://huggingface.co/Sao10K/Fimbulvetr-11B-v2) - [mistralai/Mistral-Nemo-Instruct-2407](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407) - **base is good** - [nothingiisreal/MN-12B-Celeste-V1.9](https://huggingface.co/nothingiisreal/MN-12B-Celeste-V1.9) - [anthracite-org/magnum-12b-v2](https://huggingface.co/anthracite-org/magnum-12b-v2) - [NeverSleep/Lumimaid-v0.2-8B](https://huggingface.co/NeverSleep/Lumimaid-v0.2-8B) - [nbeerbower/mistral-nemo-gutenberg-12B-v2](https://huggingface.co/nbeerbower/mistral-nemo-gutenberg-12B-v2) ## 🧟‍♂️ merges - [MarinaraSpaghetti/NemoMix-Unleashed-12B](https://huggingface.co/MarinaraSpaghetti/NemoMix-Unleashed-12B) - **personal favorite** - [Sao10K/L3-8B-Lunaris-v1](https://huggingface.co/Sao10K/L3-8B-Lunaris-v1) - [aetherwiing/MN-12B-Starcannon-v3](https://huggingface.co/aetherwiing/MN-12B-Starcannon-v3) ### misc. links - [llama.cpp](https://github.com/ggerganov/llama.cpp) and [llama-cpp-python](https://github.com/abetlen/llama-cpp-python) - **preferred LLM software** - [/r/localllama](https://www.reddit.com/r/LocalLLaMA/) - [/lmg/](https://boards.4chan.org/search#/lmg/g) - [LMSys Chatbot Arena Leaderboard](https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard) - [Uncensored General Intelligence Leaderboard](https://huggingface.co/spaces/DontPlanToEnd/UGI-Leaderboard) - [/r/SillyTavernAI](https://www.reddit.com/r/SillyTavernAI/) - NothingiisReal discord - NeverSleep discord - SillyTavern discord - BeaverAI discord