Text Generation
Transformers
Safetensors
English
sparsetral
conversational
custom_code
Inference Endpoints
4 papers
francislabounty commited on
Commit
ac86b9b
1 Parent(s): efa62c0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -23,7 +23,7 @@ Kuru~ Kuru~
23
  - Effective batch size: 64
24
  - Learning Rate: 5e-7 with linear decay (0.1 warmup ratio)
25
  - Epochs: 2
26
- - 100k samples (50K new SPIN + 50K from iter_0)
27
  - QLoRA:
28
  - 256 r and 256 alpha
29
  - ```python
 
23
  - Effective batch size: 64
24
  - Learning Rate: 5e-7 with linear decay (0.1 warmup ratio)
25
  - Epochs: 2
26
+ - 100k samples (50k new SPIN + 50k from iter_0)
27
  - QLoRA:
28
  - 256 r and 256 alpha
29
  - ```python