chaanks commited on
Commit
9c923d3
1 Parent(s): ad47820

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +71 -0
README.md ADDED
@@ -0,0 +1,71 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ thumbnail: null
5
+ pipeline_tag: automatic-speech-recognition
6
+ tags:
7
+ - whisper
8
+ - pytorch
9
+ - speechbrain
10
+ - Transformer
11
+ - hf-asr-leaderboard
12
+ license: apache-2.0
13
+ ---
14
+
15
+ <iframe src="https://ghbtns.com/github-btn.html?user=speechbrain&repo=speechbrain&type=star&count=true&size=large&v=2" frameborder="0" scrolling="0" width="170" height="30" title="GitHub"></iframe>
16
+ <br/><br/>
17
+
18
+ # Whisper large v2 SpeechBrain
19
+
20
+ This repository provides all the necessary tools to perform automatic speech
21
+ recognition from an end-to-end whisper model within
22
+ SpeechBrain. Please note that this is not an official Speechbrain repository.
23
+
24
+ ## Install SpeechBrain
25
+
26
+ First of all, please install tranformers and SpeechBrain with the following command:
27
+
28
+ ```
29
+ pip install speechbrain transformers==4.28.0
30
+ ```
31
+
32
+ Please notice that we encourage you to read our tutorials and learn more about
33
+ [SpeechBrain](https://speechbrain.github.io).
34
+
35
+ ### Transcribing your own audio files
36
+
37
+ ```python
38
+
39
+ from speechbrain.pretrained import WhisperASR
40
+
41
+ asr_model = WhisperASR.from_hparams(source="chaanks/asr-whisper-large-v2-sb", savedir="pretrained_models/asr-whisper-large-v2-sb")
42
+ asr_model.transcribe_file("chaanks/asr-whisper-large-v2-sb/example.wav")
43
+
44
+
45
+ ```
46
+ ### Inference on GPU
47
+ To perform inference on the GPU, add `run_opts={"device":"cuda"}` when calling the `from_hparams` method.
48
+
49
+
50
+ ### Limitations
51
+ The SpeechBrain team does not provide any warranty on the performance achieved by this model when used on other datasets.
52
+
53
+ #### Referencing SpeechBrain
54
+
55
+ ```
56
+ @misc{SB2021,
57
+ author = {Ravanelli, Mirco and Parcollet, Titouan and Rouhe, Aku and Plantinga, Peter and Rastorgueva, Elena and Lugosch, Loren and Dawalatabad, Nauman and Ju-Chieh, Chou and Heba, Abdel and Grondin, Francois and Aris, William and Liao, Chien-Feng and Cornell, Samuele and Yeh, Sung-Lin and Na, Hwidong and Gao, Yan and Fu, Szu-Wei and Subakan, Cem and De Mori, Renato and Bengio, Yoshua },
58
+ title = {SpeechBrain},
59
+ year = {2021},
60
+ publisher = {GitHub},
61
+ journal = {GitHub repository},
62
+ howpublished = {\\\\url{https://github.com/speechbrain/speechbrain}},
63
+ }
64
+ ```
65
+
66
+ #### About SpeechBrain
67
+ SpeechBrain is an open-source and all-in-one speech toolkit. It is designed to be simple, extremely flexible, and user-friendly. Competitive or state-of-the-art performance is obtained in various domains.
68
+
69
+ Website: https://speechbrain.github.io/
70
+
71
+ GitHub: https://github.com/speechbrain/speechbrain