monsterapi
/

mistral_7b_norobots

Model card Files Files and versions Community

souvik0306 commited on Nov 22, 2023

Commit

624be22

•

1 Parent(s): a9ac195

Update README.md

Files changed (1) hide show

README.md +41 -1

README.md CHANGED Viewed

@@ -8,4 +8,44 @@ datasets:
 - HuggingFaceH4/no_robots
 base_model: mistralai/Mistral-7B-v0.1
 license: apache-2.0
----

 - HuggingFaceH4/no_robots
 base_model: mistralai/Mistral-7B-v0.1
 license: apache-2.0
+---
+### Finetuning Overview:
+**Model Used:** mistralai/Mistral-7B-v0.1
+**Dataset:** HuggingFaceH4/no_robots
+#### Dataset Insights:
+[No Robots](https://huggingface.co/datasets/HuggingFaceH4/no_robots) is a high-quality dataset of 10,000 instructions and demonstrations created by skilled human annotators. This data can be used for supervised fine-tuning (SFT) to make language models follow instructions better.
+#### Finetuning Details:
+With the utilization of [MonsterAPI](https://monsterapi.ai)'s [LLM finetuner](https://docs.monsterapi.ai/fine-tune-a-large-language-model-llm), this finetuning:
+- Was achieved with great cost-effectiveness.
+- Completed in a total duration of 36mins 27secs for 1 epoch using an A6000 48GB GPU.
+- Costed `$1.212` for the entire epoch.
+#### Hyperparameters & Additional Details:
+- **Epochs:** 2
+- **Cost Per Epoch:** $1.212
+- **Total Finetuning Cost:** $1.212
+- **Model Path:** mistralai/Mistral-7B-v0.1
+- **Learning Rate:** 0.0002
+- **Data Split:** 100% train
+- **Gradient Accumulation Steps:** 4
+- **lora r:** 32
+- **lora alpha:** 64
+#### Prompt Structure
+```
+<|system|> </s> <|user|> [USER PROMPT] </s> <|assistant|> [ASSISTANT ANSWER] </s>
+```
+#### Train loss :
+![eval loss](https://cdn-uploads.huggingface.co/production/uploads/63ba46aa0a9866b28cb19a14/WDbw92-Vmuc7QttRHvJU6.png)
+license: apache-2.0