Edit model card

SHRDFU-7b β

  • Developed by: maldv
  • License: cc-by-nc-4.0
  • Finetuned from model: cgato/Thespis-CurtainCall-7b-v0.3
  • Methodology: Targeting attention layers with peft to condition; then small full layer tuning; extending intelligence and problem solving w/ crabcanon

As I work on understanding how to layer information in to the model, I'm having a tough time with this one without making it crazy. Maybe crazy was the way to go?

Downloads last month
1,744
Safetensors
Model size
7.24B params
Tensor type
BF16
·

Finetuned from

Dataset used to train maldv/SHRDFU-7b-beta