mistral-box / README.md
gate369's picture
Update README.md
2d09e39 verified
|
raw
history blame
No virus
1.73 kB
metadata
language:
  - en
license: apache-2.0
tags:
  - text-generation-inference
  - transformers
  - unsloth
  - mistral
  - trl
base_model: unsloth/mistral-7b-bnb-4bit

""" an attempt at making a working "program" inside an llm thru fine tuning. kinda works kinda doesn't commands are: start/start box/box up/down/left/right heres an example convo: user: "start box" ai: " 12345 _____ 1|X....| 2|.....| 3|.....| 4|.....| 5|.....| _____ start" user: "right" ai: "right 12345 _____ 1|.X...| 2|.....| 3|.....| 4|.....| 5|.....| _____ " user: "down" ai: "down 12345 _____ 1|.....| 2|.X...| 3|.....| 4|.....| 5|.....| _____ " user: "right" ai: "right 12345 _____ 1|.....| 2|..X..| 3|.....| 4|.....| 5|.....| _____ " user: "up" ai: "up 12345 _____ 1|..X..| 2|.....| 3|.....| 4|.....| 5|.....| _____ " user: "left" ai: "left error: unable to move left from this position 12345 _____ 1|..X..| 2|.....| 3|.....| 4|.....| 5|.....| _____ "

after this it stopped working but you get the gist. imagine playing doom thrun an llm lol


Uploaded model

  • Developed by: liminerity
  • License: apache-2.0
  • Finetuned from model : unsloth/mistral-7b-bnb-4bit

This mistral model was trained 2x faster with Unsloth and Huggingface's TRL library.