File size: 1,156 Bytes
067ad5b
 
d4f4590
 
 
 
8657a3e
47222d2
6155c4f
8657a3e
ab49101
43b8501
10e968a
 
 
 
ab49101
 
 
 
 
 
 
 
 
 
 
38bb1e5
 
b369553
982e531
 
b369553
 
38bb1e5
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
---
license: cc-by-nc-4.0
language:
- bn
library_name: nemo
pipeline_tag: automatic-speech-recognition
---
## Hishab BN FastConformer
__Hishab BN FastConformer__ is a [fastconformer](https://docs.nvidia.com/deeplearning/nemo/user-guide/docs/en/stable/asr/models.html#fast-conformer) based model trained on ~18K Hours [MegaBNSpeech]() corpus.

## Using method
This model can be used for transcribing Bangla audio and also can be used as pre-trained model to fine-tuning on custom datasets using [NeMo](https://github.com/NVIDIA/NeMo) framework.

### Installation
To install [NeMo](https://github.com/NVIDIA/NeMo) check NeMo documentation.

### Inferencing
```py
import nemo.collections.asr as nemo_asr
asr_model = nemo_asr.models.ASRModel.from_pretrained("hishab/hishab_bn_fastconformer")

transcriptions = asr_model.transcribe(["file.wav"])
```
## Training Datasets

## Training Details

## Evaluation


![image/png](https://cdn-uploads.huggingface.co/production/uploads/64df9253cccd823564c3303b/WvMlp95z2-GXT6AYfwW8Y.png)

![image/png](https://cdn-uploads.huggingface.co/production/uploads/64df9253cccd823564c3303b/O2RA9TAedIv1OTqgdIap5.png)

## Citation