mpasila
/

Llama-3-LimaRP-Instruct-LoRA-8B

text-generation-inference

Not-For-All-Audiences

Model card Files Files and versions Community

mpasila commited on Apr 25

Commit

6214aeb

•

1 Parent(s): cdc0e8c

Update README.md

Files changed (1) hide show

README.md +12 -1

README.md CHANGED Viewed

@@ -8,8 +8,19 @@ tags:
 - unsloth
 - llama
 - trl
 base_model: unsloth/llama-3-8b-bnb-4bit
 ---
 # Uploaded  model
@@ -19,4 +30,4 @@ base_model: unsloth/llama-3-8b-bnb-4bit
 This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

 - unsloth
 - llama
 - trl
+- not-for-all-audiences
 base_model: unsloth/llama-3-8b-bnb-4bit
+datasets:
+- grimulkan/LimaRP-augmented
+- mpasila/LimaRP-augmented-8k-context
 ---
+This was made using the Llama 3 Instruct prompt formatting so that it should be easier to be merged with other models using that format.
+LoRA trained in 4-bit with 8k context using [meta-llama/Meta-Llama-3-8B](https://huggingface.co/meta-llama/Meta-Llama-3-8B/) as the base model for 1 epoch.
+Dataset used is [a modified](https://huggingface.co/datasets/mpasila/LimaRP-augmented-8k-context) version of [grimulkan/LimaRP-augmented](https://huggingface.co/datasets/grimulkan/LimaRP-augmented).
+### Prompt format: Llama 3 Instruct
 # Uploaded  model
 This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
+[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)