GGUF
Not-For-All-Audiences
nsfw
Inference Endpoints
conversational
Edit model card

Lumimaid 0.2

Image
8b - [12b] - 70b - 123b

This model is based on: Mistral-Nemo-Instruct-2407

Wandb: https://wandb.ai/undis95/Lumi-Mistral-Nemo?nw=nwuserundis95

NOTE: As explained on Mistral-Nemo-Instruct-2407 repo, it's recommended to use a low temperature, please experiment!

Lumimaid 0.1 -> 0.2 is a HUGE step up dataset wise.

As some people have told us our models are sloppy, Ikari decided to say fuck it and literally nuke all chats out with most slop.

Our dataset stayed the same since day one, we added data over time, cleaned them, and repeat. After not releasing model for a while because we were never satisfied, we think it's time to come back!

Prompt template: Mistral

<s>[INST] {input} [/INST] {output}</s>

Credits:

  • Undi
  • IkariDev

Training data we used to make our dataset:

We sadly didn't find the sources of the following, DM us if you recognize your set !

  • Opus_Instruct-v2-6.5K-Filtered-v2-sharegpt
  • claude_sharegpt_trimmed
  • CapybaraPure_Decontaminated-ShareGPT_reduced

Datasets credits:

  • Epiculous
  • ChaoticNeutrals
  • Gryphe
  • meseca
  • PJMixers
  • NobodyExistsOnTheInternet
  • cgato
  • kalomaze
  • Doctor-Shotgun
  • Norquinal
  • nothingiisreal

Others

Undi: If you want to support us, you can here.

IkariDev: Visit my retro/neocities style website please kek

Downloads last month
5,316
GGUF
Model size
12.2B params
Architecture
llama

4-bit

5-bit

6-bit

8-bit

Inference API
Unable to determine this model's library. Check the docs .