Librarian Bot: Add base_model information to model
Browse filesThis pull request aims to enrich the metadata of your model by adding [`facebook/wav2vec2-large-xlsr-53`](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) as a `base_model` field, situated in the `YAML` block of your model's `README.md`.
How did we find this information? We performed a regular expression match on your `README.md` file to determine the connection.
**Why add this?** Enhancing your model's metadata in this way:
- **Boosts Discoverability** - It becomes straightforward to trace the relationships between various models on the Hugging Face Hub.
- **Highlights Impact** - It showcases the contributions and influences different models have within the community.
For a hands-on example of how such metadata can play a pivotal role in mapping model connections, take a look at [librarian-bots/base_model_explorer](https://huggingface.co/spaces/librarian-bots/base_model_explorer).
This PR comes courtesy of [Librarian Bot](https://huggingface.co/librarian-bot). If you have any feedback, queries, or need assistance, please don't hesitate to reach out to [@davanstrien](https://huggingface.co/davanstrien).
If you want to automatically add `base_model` metadata to more of your modes you can use the [Librarian Bot](https://huggingface.co/librarian-bot) [Metadata Request Service](https://huggingface.co/spaces/librarian-bots/metadata_request_service)!
@@ -1,11 +1,6 @@
|
|
1 |
---
|
2 |
language: en
|
3 |
-
|
4 |
-
- common_voice
|
5 |
-
- mozilla-foundation/common_voice_6_0
|
6 |
-
metrics:
|
7 |
-
- wer
|
8 |
-
- cer
|
9 |
tags:
|
10 |
- audio
|
11 |
- automatic-speech-recognition
|
@@ -15,50 +10,56 @@ tags:
|
|
15 |
- robust-speech-event
|
16 |
- speech
|
17 |
- xlsr-fine-tuning-week
|
18 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
19 |
model-index:
|
20 |
- name: XLSR Wav2Vec2 English by Jonatas Grosman
|
21 |
results:
|
22 |
- task:
|
23 |
-
name: Automatic Speech Recognition
|
24 |
type: automatic-speech-recognition
|
|
|
25 |
dataset:
|
26 |
name: Common Voice en
|
27 |
type: common_voice
|
28 |
args: en
|
29 |
metrics:
|
30 |
-
-
|
31 |
-
type: wer
|
32 |
value: 19.06
|
33 |
-
|
34 |
-
|
35 |
value: 7.69
|
36 |
-
|
37 |
-
|
38 |
value: 14.81
|
39 |
-
|
40 |
-
|
41 |
value: 6.84
|
|
|
42 |
- task:
|
43 |
-
name: Automatic Speech Recognition
|
44 |
type: automatic-speech-recognition
|
|
|
45 |
dataset:
|
46 |
name: Robust Speech Event - Dev Data
|
47 |
type: speech-recognition-community-v2/dev_data
|
48 |
args: en
|
49 |
metrics:
|
50 |
-
-
|
51 |
-
type: wer
|
52 |
value: 27.72
|
53 |
-
|
54 |
-
|
55 |
value: 11.65
|
56 |
-
|
57 |
-
|
58 |
value: 20.85
|
59 |
-
|
60 |
-
|
61 |
value: 11.01
|
|
|
62 |
---
|
63 |
|
64 |
# Fine-tuned XLSR-53 large model for speech recognition in English
|
|
|
1 |
---
|
2 |
language: en
|
3 |
+
license: apache-2.0
|
|
|
|
|
|
|
|
|
|
|
4 |
tags:
|
5 |
- audio
|
6 |
- automatic-speech-recognition
|
|
|
10 |
- robust-speech-event
|
11 |
- speech
|
12 |
- xlsr-fine-tuning-week
|
13 |
+
datasets:
|
14 |
+
- common_voice
|
15 |
+
- mozilla-foundation/common_voice_6_0
|
16 |
+
metrics:
|
17 |
+
- wer
|
18 |
+
- cer
|
19 |
+
base_model: facebook/wav2vec2-large-xlsr-53
|
20 |
model-index:
|
21 |
- name: XLSR Wav2Vec2 English by Jonatas Grosman
|
22 |
results:
|
23 |
- task:
|
|
|
24 |
type: automatic-speech-recognition
|
25 |
+
name: Automatic Speech Recognition
|
26 |
dataset:
|
27 |
name: Common Voice en
|
28 |
type: common_voice
|
29 |
args: en
|
30 |
metrics:
|
31 |
+
- type: wer
|
|
|
32 |
value: 19.06
|
33 |
+
name: Test WER
|
34 |
+
- type: cer
|
35 |
value: 7.69
|
36 |
+
name: Test CER
|
37 |
+
- type: wer
|
38 |
value: 14.81
|
39 |
+
name: Test WER (+LM)
|
40 |
+
- type: cer
|
41 |
value: 6.84
|
42 |
+
name: Test CER (+LM)
|
43 |
- task:
|
|
|
44 |
type: automatic-speech-recognition
|
45 |
+
name: Automatic Speech Recognition
|
46 |
dataset:
|
47 |
name: Robust Speech Event - Dev Data
|
48 |
type: speech-recognition-community-v2/dev_data
|
49 |
args: en
|
50 |
metrics:
|
51 |
+
- type: wer
|
|
|
52 |
value: 27.72
|
53 |
+
name: Dev WER
|
54 |
+
- type: cer
|
55 |
value: 11.65
|
56 |
+
name: Dev CER
|
57 |
+
- type: wer
|
58 |
value: 20.85
|
59 |
+
name: Dev WER (+LM)
|
60 |
+
- type: cer
|
61 |
value: 11.01
|
62 |
+
name: Dev CER (+LM)
|
63 |
---
|
64 |
|
65 |
# Fine-tuned XLSR-53 large model for speech recognition in English
|