Edit model card

QuantFactory/Gemmasutra-Mini-2B-v1-GGUF

This is quantized version of TheDrummer/Gemmasutra-Mini-2B-v1 created using llama.cpp

Original Model Card

Join our Discord! https://discord.gg/Nbv9pQ88Xb

Works on Kobold 1.72 and Layla (iOS / Android)


image/png

BeaverAI team proudly presents

Gemmasutra Mini 2B v1 🧘

Gemma's been training for ALL of you.

image/webp

A tiny RP model packing an unbelievable PUNCH. Finetuned by yours truly.

Description

Gone are the days when models below 7B were too small to give you a satisfying RP experience.

Gemmasutra Mini 2B v1 marks the beginning of a new chapter in our local LLM community.

Be it in your browser, your crappy laptop, your mid-tier phone running Layla, or your Raspberry Pi...

This lil 2B will give you a playthrough worth having.

A model for thee, a model for all!

(And yes, it is uncensored and unaligned. Enjoy!)

Links

Usage

  • For the best experience, use the Gemma Instruct template and modify it to support the system role (e.g., <start_of_turn>system)
  • Chat Completion works well too
  • Don't use it for Math

Examples

A big thanks to @kurgan1138 for most of the logs!

SFW

image/png

image/png

image/png

image/png

4K context

image/png

image/png

NSFW NSFW NSFW NSFW NSFW

NSFW NSFW NSFW NSFW NSFW

NSFW NSFW NSFW NSFW NSFW

image/png

image/png

image/png

Forgiveness Meter

image/png

Group Character

image/png

image/png

image/png

image/png

image/png

Downloads last month
74
GGUF

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference API
Unable to determine this model's library. Check the docs .