File size: 1,429 Bytes
0dd785e
 
 
 
 
 
9b353fb
 
 
 
 
 
 
 
 
 
 
 
 
 
9b6d81a
 
4338d77
9b353fb
 
 
 
 
ff7b4f6
9b353fb
 
4338d77
9b353fb
 
 
2839f3d
9b353fb
 
 
 
 
 
 
ff7b4f6
9b353fb
 
2839f3d
9b353fb
ff7b4f6
9b353fb
379ef44
 
183dd11
9b353fb
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
---
language:
- ta
metrics:
- wer
pipeline_tag: automatic-speech-recognition
library_name: adapter-transformers
---
# Model Card for Model ID

<!-- Provide a quick summary of what the model is/does. -->

This model is a fine-tuned version of XLS-R on Tamil speech data from Tamil Vulnerable Speech Recognition.
## Model Details

### Model Description

<!-- Provide a longer summary of what this model is. -->


- **Model type:** [Transfoemr based model]
- **Language(s) :** [Tamil]
- **Finetuned from model :** [XLS-R]


## Uses

<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
Thw model is used to perform speech-to-text in Tamil.


### Downstream Use

<!-- This section is for the model use when fine-tuned for a task, or when plugged into a larger ecosystem/app -->

Tamil Speech Recogniton


## Training Details

### Training Data

<!-- This should link to a Dataset Card, perhaps with a short stub of information on what the training data is all about as well as documentation related to data pre-processing or additional filtering. -->
Tamil vulnerable speech dataset.


#### Preprocessing

All the .wav files are resampled to 16000 Hz and Log-Mel Spectrogram is extracted

## Traning

The training code is accessible through [here](https://colab.research.google.com/drive/1YKCibhy4L_Udqai7pqOrzkiTXC_l4wCY?usp=sharing)