Vietnamese OCR dataset, including word level and line level image data.
![Data Studio's profile picture](https://cdn-avatars.huggingface.co/v1/production/uploads/6287e21e6184c2fa05334041/hXyzJisHJDthqJjx48_U-.png)
Data Studio
community
AI & ML interests
Dataset for Machine Learning.
Organization Card
About org cards
Contact me if needed: minhquannguyen1800@gmail.com (Minh Quan)
Data Information:
OCR
Vietnamese Document with Red Seal: 223,830 samples
Vietnamese Document with Black Seal: 71,970 samples
Vietnamese Document: 1,305,220 samples
Vietnamese Document with underline text: 365,919 samples
Vietnamese Document with 5 colors Highlight: 135,295 samples
Vietnamese Document with Yellow Highlight: 174,282 samples
High quality Vietnamese Document: 22,524 samples
Text-to-Speech
>1000 hours Vietnamese Male & Female Voice - 1000000 audios
datasets
61
DataStudio/Vietnamese_Text2Speech_AB0
Viewer
•
Updated
•
888k
DataStudio/Vietnamese_Text2Speech_AB4
Viewer
•
Updated
•
20.9k
DataStudio/Vietnamese_Text2Speech_AB3
Viewer
•
Updated
•
77.7k
DataStudio/Vietnamese_Text2Speech_AB2
Viewer
•
Updated
•
327k
DataStudio/Vietnamese_Text2Speech_AB1
Viewer
•
Updated
•
389k
DataStudio/S2W_format
Viewer
•
Updated
•
16M
•
1
DataStudio/Vietnamese_ASR_TestingData
Viewer
•
Updated
•
200
•
34
DataStudio/Viet-wikipedia
Viewer
•
Updated
•
1.29M
•
27
DataStudio/T2S_dataset_v2
Viewer
•
Updated
•
215k
DataStudio/Vietnamese_ASR_TestingData_Old
Viewer
•
Updated
•
100
•
29