serpdotai
/

sparsetral-16x7B-v2-SPIN_iter1

Text Generation

Model card Files Files and versions Community

francislabounty commited on Feb 19, 2024

Commit

ac86b9b

·

verified ·

1 Parent(s): efa62c0

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -23,7 +23,7 @@ Kuru~ Kuru~
 - Effective batch size: 64
 - Learning Rate: 5e-7 with linear decay (0.1 warmup ratio)
 - Epochs: 2
-- 100k samples (50K new SPIN + 50K from iter_0)
 - QLoRA:
   - 256 r and 256 alpha
   - ```python

 - Effective batch size: 64
 - Learning Rate: 5e-7 with linear decay (0.1 warmup ratio)
 - Epochs: 2
+- 100k samples (50k new SPIN + 50k from iter_0)
 - QLoRA:
   - 256 r and 256 alpha
   - ```python