--- license: mit language: - zh pipeline_tag: image-to-text --- # Target: Convert Scanned Images of IPA symbols to Pinyin Scanned images of IPA phonetic symbols for Chengdunese (成都话) in The Great Dictionary of Modern Chinese Dialects (現代漢語方言大詞典). # Training and Test Set * 2,553 images of IPA phonetic symbols generated from Pinyin pronunciations found in Sichuanese Dialect Dictionary (四川方言词典 教你一口地道的四川话) and the word list of the Shupin (蜀拼) input method. * 80/20 split on train/test # Results * Trained for 180 steps with a batch size of 32 * Final Character Error Rate of 0.795% on test set * TODO: label part of the scanned images to see if model generalizes on target task