SHRDFU-7b-beta / README.md
maldv's picture
Update README.md
069b892 verified
metadata
language:
  - en
license: cc-by-nc-4.0
tags:
  - text-generation-inference
  - transformers
  - unsloth
  - mistral
  - GEB
base_model: cgato/Thespis-CurtainCall-7b-v0.3
datasets:
  - maldv/crabcanon

SHRDFU-7b β

  • Developed by: maldv
  • License: cc-by-nc-4.0
  • Finetuned from model: cgato/Thespis-CurtainCall-7b-v0.3
  • Methodology: Targeting attention layers with peft to condition; then small full layer tuning; extending intelligence and problem solving w/ crabcanon

As I work on understanding how to layer information in to the model, I'm having a tough time with this one without making it crazy. Maybe crazy was the way to go?