5roop commited on
Commit
e65b698
1 Parent(s): 3135966

Added hyperparams to readme

Browse files
Files changed (1) hide show
  1. README.md +15 -0
README.md CHANGED
@@ -61,3 +61,18 @@ transcription = processor.batch_decode(predicted_ids)
61
 
62
  # transcription: ['veliki broj poslovnih subjekata posluje sa minusom velik dio']
63
  ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
61
 
62
  # transcription: ['veliki broj poslovnih subjekata posluje sa minusom velik dio']
63
  ```
64
+
65
+ ## Training hyperparameters
66
+
67
+ In fine-tuning, the following arguments were used:
68
+
69
+ |arg | value|
70
+ |---|---|
71
+ |`group_by_length` |True |
72
+ | `per_device_train_batch_size`|16 |
73
+ |`gradient_accumulation_steps` |4 |
74
+ |`num_train_epochs` |8 |
75
+ |`gradient_checkpointing` |True |
76
+ |`fp16` |True |
77
+ |`learning_rate` | 3e-4|
78
+ |`warmup_steps` | 500|