mpasila
/

Llama-3-Instruct-LiPPA-LoRA-8B

text-generation-inference

Not-For-All-Audiences

Model card Files Files and versions Community

mpasila commited on May 18

Commit

89b79d6

•

1 Parent(s): f9c7b10

Update README.md

Files changed (1) hide show

README.md +16 -3

README.md CHANGED Viewed

@@ -1,7 +1,7 @@
 ---
 language:
 - en
-license: apache-2.0
 tags:
 - text-generation-inference
 - transformers
@@ -9,14 +9,27 @@ tags:
 - llama
 - trl
 base_model: unsloth/llama-3-8b-Instruct-bnb-4bit
 ---
 # Uploaded  model
 - **Developed by:** mpasila
-- **License:** apache-2.0
 - **Finetuned from model :** unsloth/llama-3-8b-Instruct-bnb-4bit
 This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
-[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)

 ---
 language:
 - en
+license: llama3
 tags:
 - text-generation-inference
 - transformers
 - llama
 - trl
 base_model: unsloth/llama-3-8b-Instruct-bnb-4bit
+datasets:
+- mpasila/LimaRP-PIPPA-Mix-8K-Context
+- grimulkan/LimaRP-augmented
+- KaraKaraWitch/PIPPA-ShareGPT-formatted
 ---
+LoRA trained in 4-bit with 8k context using [meta-llama/Meta-Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct/) as the base model for 1 epoch.
+Dataset used is [mpasila/LimaRP-PIPPA-Mix-8K-Context](https://huggingface.co/datasets/mpasila/LimaRP-PIPPA-Mix-8K-Context) which was made using [grimulkan/LimaRP-augmented](https://huggingface.co/datasets/grimulkan/LimaRP-augmented) and [KaraKaraWitch/PIPPA-ShareGPT-formatted](https://huggingface.co/datasets/KaraKaraWitch/PIPPA-ShareGPT-formatted).
+This has been trained on the instruct model and not the base model. The model trained with the base model using the same dataset is here: [mpasila/Llama-3-LiPPA-LoRA-8B](https://huggingface.co/mpasila/Llama-3-LiPPA-LoRA-8B)
+### Prompt format: Llama 3 Instruct
+Unsloth changed assistant to gpt and user to human.
 # Uploaded  model
 - **Developed by:** mpasila
+- **License:** Llama 3 Community License
 - **Finetuned from model :** unsloth/llama-3-8b-Instruct-bnb-4bit
 This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
+[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)