mzboito commited on
Commit
5fd5c42
1 Parent(s): e8d2abd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +41 -1
README.md CHANGED
@@ -1,3 +1,43 @@
1
  ---
2
- license: apache-2.0
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ language: "fr"
3
+ thumbnail:
4
+ tags:
5
+ - wav2vec2
6
+ license: "apache-2.0"
7
  ---
8
+
9
+ # LeBenchmark: wav2vec2 base model trained on 1K hours of French *female-only* speech
10
+
11
+
12
+ LeBenchmark provides an ensemble of pretrained wav2vec2 models on different French datasets containing spontaneous, read, and broadcasted speech.
13
+
14
+ For more information about our gender study for SSL moddels, please refer to our paper at: [A Study of Gender Impact in Self-supervised Models for Speech-to-Text Systems]()
15
+
16
+
17
+ ## Model and data descriptions
18
+
19
+ We release four gender-specific models trained on 1K hours of speech.
20
+
21
+ - [wav2vec2-FR-1K-Male-large](https://huggingface.co/LeBenchmark/wav2vec-FR-1K-Male-large/)
22
+ - [wav2vec2-FR-1k-Male-base](https://huggingface.co/LeBenchmark/wav2vec-FR-1K-Male-base/)
23
+ - [wav2vec2-FR-1K-Female-large](https://huggingface.co/LeBenchmark/wav2vec-FR-1K-Female-large/)
24
+ - [wav2vec2-FR-1K-Female-base](https://huggingface.co/LeBenchmark/wav2vec-FR-1K-Female-base/)
25
+
26
+ ## Intended uses & limitations
27
+
28
+ Pretrained wav2vec2 models are distributed under the Apache-2.0 license. Hence, they can be reused extensively without strict limitations. However, benchmarks and data may be linked to corpora that are not completely open-sourced.
29
+
30
+ ## Referencing our gender-specific models
31
+
32
+ <soon to be added>
33
+
34
+ ## Referencing LeBenchmark
35
+
36
+ ```
37
+ @article{Evain2021LeBenchmarkAR,
38
+ title={LeBenchmark: A Reproducible Framework for Assessing Self-Supervised Representation Learning from Speech},
39
+ author={Sol{\`e}ne Evain and Ha Nguyen and Hang Le and Marcely Zanon Boito and Salima Mdhaffar and Sina Alisamir and Ziyi Tong and N. Tomashenko and Marco Dinarelli and Titouan Parcollet and A. Allauzen and Y. Est{\`e}ve and B. Lecouteux and F. Portet and S. Rossato and F. Ringeval and D. Schwab and L. Besacier},
40
+ journal={ArXiv},
41
+ year={2021},
42
+ volume={abs/2104.11462}
43
+ }