marianbasti
commited on
Commit
•
86b132d
1
Parent(s):
0d8f5da
Update README.md
Browse files
README.md
CHANGED
@@ -132,7 +132,7 @@ assistant_model = AutoModelForCausalLM.from_pretrained(
|
|
132 |
assistant_model_id, torch_dtype=torch_dtype, low_cpu_mem_usage=True, use_safetensors=True
|
133 |
)
|
134 |
assistant_model.to(device)
|
135 |
-
model_id = "openai/whisper-large-
|
136 |
model = AutoModelForSpeechSeq2Seq.from_pretrained(
|
137 |
model_id, torch_dtype=torch_dtype, low_cpu_mem_usage=True, use_safetensors=True
|
138 |
)
|
@@ -155,7 +155,7 @@ print(result["text"])
|
|
155 |
```
|
156 |
## Training
|
157 |
|
158 |
-
The model was trained for 40,000 optimisation steps (or
|
159 |
```
|
160 |
--teacher_model_name_or_path "openai/whisper-large-v3"
|
161 |
--train_dataset_name "mozilla-foundation/common_voice_16_1"
|
|
|
132 |
assistant_model_id, torch_dtype=torch_dtype, low_cpu_mem_usage=True, use_safetensors=True
|
133 |
)
|
134 |
assistant_model.to(device)
|
135 |
+
model_id = "openai/whisper-large-v3"
|
136 |
model = AutoModelForSpeechSeq2Seq.from_pretrained(
|
137 |
model_id, torch_dtype=torch_dtype, low_cpu_mem_usage=True, use_safetensors=True
|
138 |
)
|
|
|
155 |
```
|
156 |
## Training
|
157 |
|
158 |
+
The model was trained for 40,000 optimisation steps (or 0.98 epochs), on a single RTX3090 for ~30 hours, using the following training parameters:
|
159 |
```
|
160 |
--teacher_model_name_or_path "openai/whisper-large-v3"
|
161 |
--train_dataset_name "mozilla-foundation/common_voice_16_1"
|