Update README.md
Browse files
README.md
CHANGED
|
@@ -15,7 +15,12 @@ base_model_relation: finetune
|
|
| 15 |
|
| 16 |
## Introduction
|
| 17 |
|
| 18 |
-
We are excited to announce the release of `
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 19 |
|
| 20 |

|
| 21 |
|
|
|
|
| 15 |
|
| 16 |
## Introduction
|
| 17 |
|
| 18 |
+
We are excited to announce the release of `TokenOCR`, the first token-level visual foundation model specifically tailored for text-image-related tasks,
|
| 19 |
+
designed to support a variety of traditional downstream applications. To facilitate the pretraining of TokenOCR,
|
| 20 |
+
we also devise a high-quality data production pipeline that constructs the first token-level image text dataset,
|
| 21 |
+
\textbf{TokenIT}, comprising 20 million images and 1.8 billion token-mask pairs.
|
| 22 |
+
Furthermore, leveraging this foundation with exceptional image-as-text capability,
|
| 23 |
+
we seamlessly replace previous VFMs with TokenOCR to construct a document-level MLLM, \textbf{TokenVL}, for VQA-based document understanding tasks.
|
| 24 |
|
| 25 |

|
| 26 |
|