Vietnamese OCR dataset, including word level and line level image data.
Data Studio
community
AI & ML interests
Dataset for Machine Learning.
Organization Card
Please feel free to contact me if needed: nguyenminh.quan0663@gmail.com (Minh Quan).
Data Information:
OCR
Vietnamese Document with 14 different types of noises: > 2 million samples (line level).
Vietnamese Document with noises: > 100 thousand samples (word level).
Text-to-Speech
> 3000 hours Vietnamese Male & Female Voices.
Collections
2
datasets
65
DataStudio/Vietnamese_Text2Speech_AB9
Viewer
•
Updated
•
329k
•
32
DataStudio/Vietnamese_Text2Speech_AB7
Viewer
•
Updated
•
86.8k
•
31
DataStudio/Vietnamese_Text2Speech_AB6
Viewer
•
Updated
•
25.1k
•
32
DataStudio/Vietnamese_Text2Speech_AB5
Viewer
•
Updated
•
90.4k
•
33
DataStudio/Vietnamese_Text2Speech_AB0
Viewer
•
Updated
•
888k
•
33
DataStudio/Vietnamese_Text2Speech_AB4
Viewer
•
Updated
•
20.9k
•
34
DataStudio/Vietnamese_Text2Speech_AB3
Viewer
•
Updated
•
77.7k
•
36
•
2
DataStudio/Vietnamese_Text2Speech_AB2
Viewer
•
Updated
•
327k
•
33
DataStudio/Vietnamese_Text2Speech_AB1
Viewer
•
Updated
•
389k
•
33
DataStudio/S2W_format
Viewer
•
Updated
•
16M
•
59