rombodawg
/

Llama-3-8B-Instruct-Coder

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

rombodawg commited on May 4

Commit

9cf7c5e

•

1 Parent(s): 6c5ae53

Update README.md

Files changed (1) hide show

README.md +6 -0

README.md CHANGED Viewed

@@ -19,4 +19,10 @@ This model is llama-3-8b-instruct from Meta (uploaded by unsloth) trained on the
 The Qalore method uses Qlora training along with the methods from Galore for additional reductions in VRAM allowing for llama-3-8b to be loaded on 14.5 GB of VRAM. This allowed this training to be completed on an RTX A4000 16GB in 130 hours for less than $20.
 - https://huggingface.co/datasets/Replete-AI/OpenCodeInterpreterData

 The Qalore method uses Qlora training along with the methods from Galore for additional reductions in VRAM allowing for llama-3-8b to be loaded on 14.5 GB of VRAM. This allowed this training to be completed on an RTX A4000 16GB in 130 hours for less than $20.
+Dataset used for training this model:
 - https://huggingface.co/datasets/Replete-AI/OpenCodeInterpreterData
+Qalore notebook for training:
+- https://colab.research.google.com/drive/1bX4BsjLcdNJnoAf7lGXmWOgaY8yekg8p?usp=sharing