QuantFactory/EVA-Yi-1.5-9B-32K-V1-GGUF

This is quantized version of EVA-UNIT-01/EVA-Yi-1.5-9B-32K-V1 created using llama.cpp

Original Model Card

EVA Yi 1.5 9B v1

A RP/storywriting focused model, full-parameter finetune of Yi-1.5-9B-32K on mixture of synthetic and natural data.
A continuation of nothingiisreal's Celeste 1.x series, made to improve stability and versatility, without losing unique, diverse writing style of Celeste.

Quants: (GGUF is not recommended, lcpp breaks tokenizer fix)

We recommend using original BFloat16 weights, quantization seems to affect Yi significantly more than other model architectures.

Prompt format is ChatML.

Recommended sampler values:

Temperature: 1
Min-P: 0.05

Recommended SillyTavern presets (via CalamitousFelicitousness):

Training data:

Celeste 70B 0.1 data mixture minus Opus Instruct subset. See that model's card for details.
Kalomaze's Opus_Instruct_25k dataset, filtered for refusals.

Hardware used:

4x3090Ti for 5 days.

Model was trained by Kearm and Auri.

Special thanks:

to Lemmy, Gryphe, Kalomaze and Nopm for the data
to ALK, Fizz and CalamitousFelicitousness for Yi tokenizer fix
and to InfermaticAI's community for their continued support for our endeavors

QuantFactory
/

EVA-Yi-1.5-9B-32K-V1-GGUF