light-doc-ocr / config.py
nbonetto's picture
fix: trained model to 0.49 cer and fixed streamlit app issues
ee142e9
raw
history blame contribute delete
183 Bytes
DATASET_NAME = 'Teklia/IAM-line'
MODEL_NAME = 'microsoft/trocr-small-printed'
TRAIN_SPLIT = 'train[:200]'
TEST_SPLIT_RATIO = 0.2
BATCH_SIZE = 8
EPOCHS = 8
OUTPUT_DIR = './trocr-ocr'