mistral-box / README.md
gate369's picture
Update README.md
1f939aa verified
---
language:
- en
license: apache-2.0
tags:
- text-generation-inference
- transformers
- unsloth
- mistral
- trl
base_model: unsloth/mistral-7b-bnb-4bit
---
"""
- an attempt at making a working "program" inside an llm thru fine tuning. kinda works kinda doesn't
- commands are:
- start/start box/box
- up/down/left/right
- heres an example convo:
- user: "start box"
- ai: "```
- 12345
- _____
- 1|X....|
- 2|.....|
- 3|.....|
- 4|.....|
- 5|.....|
- _____
- ```
- start"
- user: "right"
- ai: "right
- ```
- 12345
- _____
- 1|.X...|
- 2|.....|
- 3|.....|
- 4|.....|
- 5|.....|
- _____
- ```"
- user: "down"
- ai: "down
- ```
- 12345
- _____
- 1|.....|
- 2|.X...|
- 3|.....|
- 4|.....|
- 5|.....|
- _____
- ```"
- user: "right"
- ai: "right
- ```
- 12345
- _____
- 1|.....|
- 2|..X..|
- 3|.....|
- 4|.....|
- 5|.....|
- _____
- ```"
- user: "up"
- ai: "up
- ```
- 12345
- _____
- 1|..X..|
- 2|.....|
- 3|.....|
- 4|.....|
- 5|.....|
- _____
- ```"
- user: "left"
- ai: "left
- error: unable to move left from this position
- ```
- 12345
- _____
- 1|..X..|
- 2|.....|
- 3|.....|
- 4|.....|
- 5|.....|
- _____
- ```"
-
- after this it stopped working but you get the gist. imagine playing doom thrun an llm lol
-
"""
# Uploaded model
- **Developed by:** liminerity
- **License:** apache-2.0
- **Finetuned from model :** unsloth/mistral-7b-bnb-4bit
This mistral model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)