Edit model card

SHRDFU-7b Γ

  • Developed by: maldv
  • License: cc-by-nc-4.0
  • Finetuned from model: ammarali32/multi_verse_model
  • Methodology: Targeting attention layers with peft to condition; then small full layer tuning; extending intelligence and problem solving w/ crabcanon

As I work on understanding how to layer information in to the model, this dataset has some good parts and bad. I think one or two more experiments and I move on.

Downloads last month
2
Safetensors
Model size
7.24B params
Tensor type
BF16
·

Finetuned from

Dataset used to train maldv/SHRDFU-7b-gamma