nguyenvulebinh commited on
Commit
0a5c13e
1 Parent(s): a8a24c7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +36 -5
README.md CHANGED
@@ -7,16 +7,47 @@ tags:
7
  license: cc-by-nc-4.0
8
  ---
9
 
10
- # Vietnamese Wav2Vec2-Base model
11
 
12
- Our self-supervised model is pre-trained on a massive audio set of 13k hours of Vietnamese youtube audio.
13
 
14
- # Usage
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
15
 
16
  Since our model has the same architecture as the English wav2vec2 version, you can use [this notebook](https://colab.research.google.com/drive/1FjTsqbYKphl9kL-eILgUc-bl4zVThL8F?usp=sharing) for more information on how to fine-tune the model.
17
 
18
- # Contact
19
 
20
  nguyenvulebinh@gmail.com / binh@vietai.org
21
 
22
- [![Follow](https://img.shields.io/twitter/follow/nguyenvulebinh?style=social)](https://twitter.com/intent/follow?screen_name=nguyenvulebinh)
 
 
 
7
  license: cc-by-nc-4.0
8
  ---
9
 
10
+ # Vietnamese Self-Supervised Learning Wav2Vec2 model
11
 
12
+ ## Model
13
 
14
+ We use wav2vec2 architecture for doing Self-Supervised learning
15
+
16
+ <img src="https://raw.githubusercontent.com/patrickvonplaten/scientific_images/master/wav2vec2.png" width=50% height=50%>
17
+
18
+ ## Data
19
+
20
+ Our self-supervised model is pre-trained on a massive audio set of 13k hours of Vietnamese youtube audio, which includes:
21
+ - Clean audio
22
+ - Noise audio
23
+ - Conversation
24
+ - Multi-gender and dialects
25
+
26
+ ## Download
27
+
28
+ We have already upload our pre-trained model to the Huggingface.
29
+ - [Based version](https://huggingface.co/nguyenvulebinh/wav2vec2-base-vi) ~ 95M params
30
+ - [Large version](https://huggingface.co/nguyenvulebinh/wav2vec2-large-vi) ~ 317M params
31
+
32
+ ## Usage
33
+
34
+ ```python
35
+ from transformers import Wav2Vec2ForPreTraining, Wav2Vec2Processor
36
+
37
+ model_name = 'nguyenvulebinh/wav2vec2-base-vi'
38
+ # model_name = 'nguyenvulebinh/wav2vec2-large-vi'
39
+
40
+ model = Wav2Vec2ForPreTraining.from_pretrained(model_name)
41
+ processor = Wav2Vec2Processor.from_pretrained(model_name)
42
+
43
+ ```
44
 
45
  Since our model has the same architecture as the English wav2vec2 version, you can use [this notebook](https://colab.research.google.com/drive/1FjTsqbYKphl9kL-eILgUc-bl4zVThL8F?usp=sharing) for more information on how to fine-tune the model.
46
 
47
+ ## Contact
48
 
49
  nguyenvulebinh@gmail.com / binh@vietai.org
50
 
51
+ [![Follow](https://img.shields.io/twitter/follow/nguyenvulebinh?style=social)](https://twitter.com/intent/follow?screen_name=nguyenvulebinh)
52
+
53
+