tomofi's picture
Add application file
2366e36
|
raw
history blame
1.88 kB

SegOCR

Abstract

Just a simple Seg-based baseline for text recognition tasks.

Dataset

Train Dataset

trainset instance_num repeat_num source
SynthText 7266686 1 synth

Test Dataset

testset instance_num type
IIIT5K 3000 regular
SVT 647 regular
IC13 1015 regular
CT80 288 irregular

Results and Models

Backbone Neck Head Regular Text Irregular Text download
IIIT5K SVT IC13 CT80
R31-1/16 FPNOCR 1x 90.9 81.8 90.7 80.9 model | log

:::{note}

  • R31-1/16 means the size (both height and width ) of feature from backbone is 1/16 of input image.
  • 1x means the size (both height and width) of feature from head is the same with input image. :::

Citation

@unpublished{key,
  title={SegOCR Simple Baseline.},
  author={},
  note={Unpublished Manuscript},
  year={2021}
}