lsmille
/

lora_evo_ta_all_layers_13

Generated from Trainer

Model card Files Files and versions Community

lsmille commited on May 29

Commit

b29d446

•

1 Parent(s): 9ea77f8

Update README.md

Files changed (1) hide show

README.md +19 -1

README.md CHANGED Viewed

@@ -20,7 +20,25 @@ It achieves the following results on the evaluation set:
 ## Model description
-More information needed
 ## Intended uses & limitations

 ## Model description
+Trained on 1K dataset instead of 400
+lora_alpha = 256
+lora_dropout = 0.05
+lora_r = 128
+epochs = 3
+learning rate = 3e-4
+warmup_steps=100
+gradient_accumulation_steps = 1
+train_batch = 2
+eval_batch = 2
 ## Intended uses & limitations