jordiclive
/

gpt4all-alpaca-oa-codealpaca-lora-7b

Text Generation

Model card Files Files and versions Community

jordiclive commited on Apr 4, 2023

Commit

d9791fd

•

1 Parent(s): d82458f

Create README.md

Files changed (1) hide show

README.md +16 -0

README.md ADDED Viewed

	@@ -0,0 +1,16 @@

+---
+license: mit
+---
+This repo contains a low-rank adapter for LLaMA-7b fit on `Nebulous/gpt4all_pruned`, `sahil2801/CodeAlpaca-20k`, `yahma/alpaca-cleaned` and some datasets part of the OpenAssistant project.
+This version of the weights was trained with the following hyperparameters:
+- Epochs: 2
+- Batch size: 128
+- Max Length: 2048
+- Learning rate: 4e-6
+- Lora _r_: 16
+- Lora target modules: q_proj, k_proj, v_proj, o_proj