Not-For-All-Audiences
nsfw
Edit model card

Exl2 version of Undi95/FlatDolphinMaid-8x7B

branch

3.5bh8 : 3.5bpw h8

Using ThePile 0007.parquet as dataset

Quantization settings : python convert.py -i models/Undi95_FlatDolphinMaid-8x7B -o FlatDolphinMaid-8x7B-temp -cf FlatDolphinMaid-8x7B-3.5bpw-h8-exl2 -c 0007.parquet -l 8192 -b 3.5 -hb 8 -m FlatDolphinMaid-8x7B-measurement.json -ml 8192

below this line is original readme

First experimental merge of Noromaid 8x7b (Instruct) and dolphin 8x7b. The idea behind this is to add a little more IQ to the model, because Noromaid was only trained on RP/ERP data. Dolphin 2.7 is the only real Mixtral finetune I consider "usable", and so the merging quest begin again kek.

Merged Dolphin 2.7 with Mixtral Base (Dolphin was at 1.0 weight) to get rid of ChatLM, and then I merged Noromaid 8x7b with the output, SLERP method.

This model feel better on the IQ chart and have the ~same average ERP score on ayumi bench' than Noromaid 8x7b, but it's softer and more prude too, it also have the typical Mixtral repeat issue at some point. Choose your poison.

image/png

Description

This repo contains fp16 files of FlatDolphinMaid-8x7B.

Models used

Custom format:

### Instruction:
{system prompt}

### Input:
{input}

### Response:
{reply}

If you want to support me, you can here.

Downloads last month
0
Unable to determine this model's library. Check the docs .