Vietnamese OCR dataset, including word level and line level image data.
Data Studio
community
AI & ML interests
Dataset for Machine Learning.
Organization Card
About org cards
Contact me if needed: minhquannguyen1800@gmail.com (Minh Quan)
Data Information:
OCR
Vietnamese Document with Red Seal: 223,830 samples
Vietnamese Document with Black Seal: 71,970 samples
Vietnamese Document: 1,305,220 samples
Vietnamese Document with underline text: 365,919 samples
Vietnamese Document with 5 colors Highlight: 135,295 samples
Vietnamese Document with Yellow Highlight: 174,282 samples
High quality Vietnamese Document: 22,524 samples
Text-to-Speech
100 hours Vietnamese Male & Female Voice - 100000 audios
datasets
57
DataStudio/Vietnamese_Text2Speech_AB1
Viewer
•
Updated
DataStudio/S2W_format
Viewer
•
Updated
DataStudio/Vietnamese_ASR_TestingData
Viewer
•
Updated
•
49
DataStudio/Viet-wikipedia
Viewer
•
Updated
•
2
DataStudio/T2S_dataset_v2
Viewer
•
Updated
DataStudio/Vietnamese_ASR_TestingData_Old
Viewer
•
Updated
•
7
DataStudio/OCRWordLevelClear_07
Viewer
•
Updated
DataStudio/OCRWordLevelClear_06
Viewer
•
Updated
DataStudio/Vietnamese_Audio_v1.0
Viewer
•
Updated
DataStudio/OCRWordLevelClear_05
Viewer
•
Updated