maldv
/

SHRDFU-7b-beta

Text Generation

text-generation-inference

Model card Files Files and versions Community

SHRDFU-7b β

Developed by: maldv
License: cc-by-nc-4.0
Finetuned from model: cgato/Thespis-CurtainCall-7b-v0.3
Methodology: Targeting attention layers with peft to condition; then small full layer tuning; extending intelligence and problem solving w/ crabcanon

As I work on understanding how to layer information in to the model, I'm having a tough time with this one without making it crazy. Maybe crazy was the way to go?

Downloads last month: 6

Safetensors

Model size

7.24B params

Tensor type

BF16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for maldv/SHRDFU-7b-beta

Base model

cgato/Thespis-CurtainCall-7b-v0.3

Finetuned

(2)

this model

Quantizations

1 model

Dataset used to train maldv/SHRDFU-7b-beta