rumeyskeskn
commited on
Commit
•
6ffd2c3
1
Parent(s):
b5d2137
Update README.md
Browse files
README.md
CHANGED
@@ -64,6 +64,30 @@ The following hyperparameters were used during training:
|
|
64 |
- num_epochs: 2
|
65 |
- mixed_precision_training: Native AMP
|
66 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
67 |
### Training results
|
68 |
|
69 |
| Training Loss | Epoch | Step | Validation Loss | Wer |
|
|
|
64 |
- num_epochs: 2
|
65 |
- mixed_precision_training: Native AMP
|
66 |
|
67 |
+
## Model Inference
|
68 |
+
```python
|
69 |
+
from transformers import Wav2Vec2ForCTC, Wav2Vec2Processor
|
70 |
+
|
71 |
+
model = Wav2Vec2ForCTC.from_pretrained("rumeyskeskn/wav2vec2-large-xls-r-300m-tr-cv16.1").to("cpu")
|
72 |
+
processor = Wav2Vec2Processor.from_pretrained("rumeyskeskn/wav2vec2-large-xls-r-300m-tr-cv16.1")
|
73 |
+
audio_path = "audio.wav"
|
74 |
+
|
75 |
+
audio_array, sampling_rate = librosa.load(audio_path, sr=16000)
|
76 |
+
|
77 |
+
input_values = processor(audio_array, sampling_rate=sampling_rate).input_values[0]
|
78 |
+
|
79 |
+
input_dict = processor(input_values, return_tensors="pt", padding=True)
|
80 |
+
|
81 |
+
|
82 |
+
logits = model(input_dict.input_values).logits
|
83 |
+
|
84 |
+
pred_ids = torch.argmax(logits, dim=-1)
|
85 |
+
prediction = processor.decode(pred_ids[0])
|
86 |
+
|
87 |
+
print("Prediction:")
|
88 |
+
print(prediction)
|
89 |
+
```
|
90 |
+
|
91 |
### Training results
|
92 |
|
93 |
| Training Loss | Epoch | Step | Validation Loss | Wer |
|