malmarjeh commited on
Commit
6f8477c
1 Parent(s): 73b4395

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -4
README.md CHANGED
@@ -50,10 +50,10 @@ The model was fine-tuned using a private Bitext dataset designed for question an
50
 
51
  - **Optimizer**: AdamW with betas=(0.9, 0.999) and epsilon=1e-08
52
  - **Learning Rate**: 0.0002 with a cosine learning rate scheduler
53
- - **Epochs**: 1
54
- - **Batch Size**: 8
55
- - **Gradient Accumulation Steps**: 4
56
- - **Maximum Sequence Length**: 1024 tokens
57
 
58
  ### Environment
59
 
 
50
 
51
  - **Optimizer**: AdamW with betas=(0.9, 0.999) and epsilon=1e-08
52
  - **Learning Rate**: 0.0002 with a cosine learning rate scheduler
53
+ - **Epochs**: 4
54
+ - **Batch Size**: 10
55
+ - **Gradient Accumulation Steps**: 8
56
+ - **Maximum Sequence Length**: 8192 tokens
57
 
58
  ### Environment
59