Files changed (1) hide show
  1. README.md +7 -1
README.md CHANGED
@@ -44,4 +44,10 @@ To generate the transcription for a song, we can use the Transformers [`pipeline
44
  ```
45
 
46
  ## Training Data
47
- The training dataset consists of 8,000 Vietnamese songs across various V-Pop styles. Due to privacy concerns, the data is not publicly available. Each song includes lyrics along with corresponding timestamps, enabling precise mapping of audio segments to their respective lyrics based on the provided timestamp information.
 
 
 
 
 
 
 
44
  ```
45
 
46
  ## Training Data
47
+ The training dataset consists of 7,000 Vietnamese songs, in total of roughly 550 hours of audio, across various Vietnamese music genres, dialects and accents. Due to IP concerns, the data is not publicly available. Each song includes lyrics along with corresponding line-level timestamps, enabling precise mapping of audio segments to their respective lyrics based on the provided timestamp information.
48
+
49
+ Technical report coming soon.
50
+ This project was made through equal contributions from:
51
+ - [Kevin Soh](https://github.com/kelvinbksoh)
52
+ - [Bernard Cheng Zheng Zhuan](https://github.com/bernardcheng)
53
+ - [Nguyen Quoc Anh](https://github.com/BatmanofZuhandArrgh)