farsipal commited on
Commit
115b272
1 Parent(s): eea944c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +46 -1
README.md CHANGED
@@ -53,7 +53,52 @@ The test set was similarly used for validation.
53
 
54
  ## Training procedure
55
 
56
- The script used to perform the training is included in the files of this space:
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
57
 
58
  ### Training hyperparameters
59
 
 
53
 
54
  ## Training procedure
55
 
56
+ The script used to perform the training `run_speech_recognition_seq2seq_streaming.py` is included in the files of this space with the following arguments:
57
+
58
+ ```
59
+ --model_name_or_path "openai/whisper-small"
60
+ --model_revision "main"
61
+ --do_train True
62
+ --do_eval True
63
+ --use_auth_token False
64
+ --freeze_encoder True
65
+ --model_index_name "Whisper Small - Greek (el)"
66
+ --dataset_name "mozilla-foundation/common_voice_11_0"
67
+ --dataset_config_name "el"
68
+ --audio_column_name "audio"
69
+ --text_column_name "sentence"
70
+ --max_duration_in_seconds 30
71
+ --train_split_name "train+validation"
72
+ --eval_split_name "test"
73
+ --do_lower_case False
74
+ --do_remove_punctuation False
75
+ --do_normalize_eval True
76
+ --language "greek"
77
+ --task "translate"
78
+ --shuffle_buffer_size 500
79
+ --output_dir "./data/finetuningRuns/whisper-sm-el-frzEnc-xlate"
80
+ --per_device_train_batch_size 16
81
+ --gradient_accumulation_steps 4
82
+ --learning_rate 1e-5
83
+ --warmup_steps 500
84
+ --max_steps 5000
85
+ --gradient_checkpointing True
86
+ --fp16 True
87
+ --evaluation_strategy "steps"
88
+ --per_device_eval_batch_size 8
89
+ --predict_with_generate True
90
+ --generation_max_length 225
91
+ --save_steps 1000
92
+ --eval_steps 1000
93
+ --logging_steps 25
94
+ --report_to "tensorboard"
95
+ --load_best_model_at_end True
96
+ --metric_for_best_model "wer"
97
+ --greater_is_better False
98
+ --push_to_hub False
99
+ --overwrite_output_dir True
100
+
101
+ ```
102
 
103
  ### Training hyperparameters
104