parthiv11 commited on
Commit
ad564c4
1 Parent(s): 1175546

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +81 -0
README.md ADDED
@@ -0,0 +1,81 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - hi
4
+ metrics:
5
+ - wer
6
+ pipeline_tag: automatic-speech-recognition
7
+ tags:
8
+ - music
9
+ license: mit
10
+ ---
11
+
12
+ # IndicWhisper With JAX (more faster)
13
+
14
+ IndicWhisper is a state-of-the-art speech recognition model fine-tuned on Indian languages. This repository contains the code for training and evaluating the model, as well as pre-trained checkpoints for immediate use.
15
+
16
+ ## Overview
17
+
18
+ IndicWhisper achieves impressive Word Error Rates (WERs) on various benchmarks for Indian languages. It outperforms other publicly available models, making it a valuable asset for speech recognition tasks in Indian languages.
19
+
20
+ ### Performance on Vistaar Benchmark (Hindi Subset)
21
+
22
+ | Model | Kathbath | Kathbath-Hard | FLEURS | CommonVoice | IndicTTS | MUCS | Gramvaani | Average |
23
+ |---------------|----------|---------------|----------|-------------|----------|--------------|-----------|-----------|
24
+ | Google STT | 14.3 | 16.7 | 19.4 | 20.8 | 18.3 | 17.8 | 59.9 | 23.9 |
25
+ | IndicWav2vec | 12.2 | 16.2 | 18.3 | 20.2 | 15 | 22.9 | 42.1 | 21 |
26
+ | Azure STT | 13.6 | 15.1 | 24.3 | 14.6 | 15.2 | 15.1 | 42.3 | 20 |
27
+ | Nvidia-medium | 14 | 15.6 | 19.4 | 20.4 | 12.3 | 12.4 | 41.3 | 19.4 |
28
+ | Nvidia-large | 12.7 | 14.2 | 15.7 | 21.2 | 12.2 | **11.8** | 42.6 | 18.6 |
29
+ | IndicWhisper | **10.3** | **12.0** | **11.4** | **15.0** | **7.6** | 12 | **26.8** | **13.6** |
30
+
31
+
32
+
33
+ ## Usage
34
+
35
+
36
+
37
+ ## New Feature: JAX Mode
38
+
39
+ We have recently added support for JAX mode, which significantly enhances performance on both TPUs and GPUs. This feature is particularly useful for high-performance computing environments and is optimized for speed and efficiency.
40
+
41
+ This repository provides an optimized JAX model for the Indic Whisper Model, built upon the foundation of the 🤗 Indic Whisper implementation by AI4 Bharat. The JAX implementation significantly enhances performance, running over 70x compared to the original Indic Whisper PyTorch code. This makes it the fastest Whisper implementation available.
42
+
43
+ ```python
44
+ from whisper_jax import FlaxWhisperForConditionalGeneration, FlaxWhisperPipline
45
+ import jax.numpy as jnp
46
+
47
+ pipeline = FlaxWhisperPipline('parthiv11/indic_whisper_hi_multi_gpu', dtype=jnp.bfloat16)
48
+ transcript= pipeline('sample.mp3')
49
+
50
+ ```
51
+
52
+ ### Acknowledgements
53
+
54
+ We would like to express our gratitude to the following organizations for their support:
55
+
56
+ - EkStep Foundation for their generous grant, which facilitated the establishment of the Centre for AI4Bharat at IIT Madras.
57
+ - The Ministry of Electronics and Information Technology (NLTM) for its grant to support the creation of datasets and models for Indian languages under the Bhashini project.
58
+ - The Centre for Development of Advanced Computing, India (C-DAC), for providing access to the Param Siddhi supercomputer for training our models.
59
+ - Microsoft for its grant to create datasets, tools, and resources for Indian languages.
60
+ - For JAX guide on [github](https://github.com/sanchit-gandhi/whisper-jax)
61
+
62
+
63
+ ### License
64
+
65
+ IndicWhisper and the associated Vistaar benchmark are MIT-licensed. This license applies to all the fine-tuned language models included in this repository.
66
+
67
+ ### Contributors
68
+
69
+ - Kaushal Bhogale (AI4Bharat)
70
+ - Sai Narayan Sundaresan (IITKGP, AI4Bharat)
71
+ - Abhigyan Raman (AI4Bharat)
72
+ - Tahir Javed (IITM, AI4Bharat)
73
+ - Mitesh Khapra (IITM, AI4Bharat, RBCDSAI)
74
+ - Pratyush Kumar (Microsoft, AI4Bharat)
75
+
76
+
77
+ ## Contributing
78
+
79
+ We welcome contributions from the community to further improve IndicWhisper. If you have any ideas, bug fixes, or enhancements, please feel free to submit a pull request.
80
+
81
+ Thank you for your interest in IndicWhisper! We hope it proves to be a valuable tool for your speech recognition needs in Indian languages.