lemonilia
/

LimaRP-Llama2-7B-v3-EXPERIMENT

Model card Files Files and versions Community

lemonilia commited on Sep 20, 2023

Commit

7f1630d

•

1 Parent(s): a23656e

Update README.md

Files changed (1) hide show

README.md +7 -3

README.md CHANGED Viewed

@@ -81,9 +81,13 @@ your desired response length:
 ## Training procedure
 [Axolotl](https://github.com/OpenAccess-AI-Collective/axolotl) was used for training
-on a single NVidia RTX3090 GPU. The model has been trained as a 4-bit LoRA adapter, which
-is so large because a LoRA rank of 256 was used. It's suggested to merge the adapter to
-the base Llama2-7B model (or other Llama2-based models).
 ### Training hyperparameters
 For the first pass these settings were used:

 ## Training procedure
 [Axolotl](https://github.com/OpenAccess-AI-Collective/axolotl) was used for training
+on a single NVidia RTX3090 GPU. The model has been trained as a 4-bit LoRA adapter, and
+it's so large because a LoRA rank of 256 was also used. The reasoning was that this
+might have helped the model internalize any newly acquired information, making the
+training process closer to a full finetune.
+It's suggested to merge the adapter to the base Llama2-7B model (or other Llama2-based
+models).
 ### Training hyperparameters
 For the first pass these settings were used: