Xenova HF Staff commited on
Commit
336d94a
·
verified ·
1 Parent(s): aa03b38

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +168 -0
README.md ADDED
@@ -0,0 +1,168 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ datasets:
5
+ - librispeech_asr
6
+ - fisher_corpus
7
+ - Switchboard-1
8
+ - WSJ-0
9
+ - WSJ-1
10
+ - National-Singapore-Corpus-Part-1
11
+ - National-Singapore-Corpus-Part-6
12
+ - vctk
13
+ - voxpopuli
14
+ - europarl
15
+ - multilingual_librispeech
16
+ - mozilla-foundation/common_voice_8_0
17
+ - MLCommons/peoples_speech
18
+ thumbnail: null
19
+ tags:
20
+ - transformers
21
+ - automatic-speech-recognition
22
+ - speech
23
+ - audio
24
+ - FastConformer
25
+ - Conformer
26
+ - pytorch
27
+ - NeMo
28
+ - hf-asr-leaderboard
29
+ - ctc
30
+ license: cc-by-4.0
31
+ widget:
32
+ - example_title: Librispeech sample 1
33
+ src: https://cdn-media.huggingface.co/speech_samples/sample1.flac
34
+ - example_title: Librispeech sample 2
35
+ src: https://cdn-media.huggingface.co/speech_samples/sample2.flac
36
+ model-index:
37
+ - name: parakeet-ctc-0.6b
38
+ results:
39
+ - task:
40
+ name: Automatic Speech Recognition
41
+ type: automatic-speech-recognition
42
+ dataset:
43
+ name: AMI (Meetings test)
44
+ type: edinburghcstr/ami
45
+ config: ihm
46
+ split: test
47
+ args:
48
+ language: en
49
+ metrics:
50
+ - name: Test WER
51
+ type: wer
52
+ value: 16.3
53
+ - task:
54
+ name: Automatic Speech Recognition
55
+ type: automatic-speech-recognition
56
+ dataset:
57
+ name: Earnings-22
58
+ type: revdotcom/earnings22
59
+ split: test
60
+ args:
61
+ language: en
62
+ metrics:
63
+ - name: Test WER
64
+ type: wer
65
+ value: 14.14
66
+ - task:
67
+ name: Automatic Speech Recognition
68
+ type: automatic-speech-recognition
69
+ dataset:
70
+ name: GigaSpeech
71
+ type: speechcolab/gigaspeech
72
+ split: test
73
+ args:
74
+ language: en
75
+ metrics:
76
+ - name: Test WER
77
+ type: wer
78
+ value: 10.35
79
+ - task:
80
+ name: Automatic Speech Recognition
81
+ type: automatic-speech-recognition
82
+ dataset:
83
+ name: LibriSpeech (clean)
84
+ type: librispeech_asr
85
+ config: other
86
+ split: test
87
+ args:
88
+ language: en
89
+ metrics:
90
+ - name: Test WER
91
+ type: wer
92
+ value: 1.87
93
+ - task:
94
+ name: Automatic Speech Recognition
95
+ type: automatic-speech-recognition
96
+ dataset:
97
+ name: LibriSpeech (other)
98
+ type: librispeech_asr
99
+ config: other
100
+ split: test
101
+ args:
102
+ language: en
103
+ metrics:
104
+ - name: Test WER
105
+ type: wer
106
+ value: 3.76
107
+ - task:
108
+ type: Automatic Speech Recognition
109
+ name: automatic-speech-recognition
110
+ dataset:
111
+ name: SPGI Speech
112
+ type: kensho/spgispeech
113
+ config: test
114
+ split: test
115
+ args:
116
+ language: en
117
+ metrics:
118
+ - name: Test WER
119
+ type: wer
120
+ value: 4.11
121
+ - task:
122
+ type: Automatic Speech Recognition
123
+ name: automatic-speech-recognition
124
+ dataset:
125
+ name: tedlium-v3
126
+ type: LIUM/tedlium
127
+ config: release1
128
+ split: test
129
+ args:
130
+ language: en
131
+ metrics:
132
+ - name: Test WER
133
+ type: wer
134
+ value: 3.78
135
+ - task:
136
+ name: Automatic Speech Recognition
137
+ type: automatic-speech-recognition
138
+ dataset:
139
+ name: Vox Populi
140
+ type: facebook/voxpopuli
141
+ config: en
142
+ split: test
143
+ args:
144
+ language: en
145
+ metrics:
146
+ - name: Test WER
147
+ type: wer
148
+ value: 7
149
+ - task:
150
+ type: Automatic Speech Recognition
151
+ name: automatic-speech-recognition
152
+ dataset:
153
+ name: Mozilla Common Voice 9.0
154
+ type: mozilla-foundation/common_voice_9_0
155
+ config: en
156
+ split: test
157
+ args:
158
+ language: en
159
+ metrics:
160
+ - name: Test WER
161
+ type: wer
162
+ value: 10.57
163
+ metrics:
164
+ - wer
165
+ pipeline_tag: automatic-speech-recognition
166
+ base_model:
167
+ - nvidia/parakeet-ctc-0.6b
168
+ ---