Add reference to donut
Browse files
README.md
CHANGED
@@ -9,7 +9,10 @@ pipeline_tag: image-to-text
|
|
9 |
# Optical Music Recognition Transformer
|
10 |
|
11 |
<!-- Provide a quick summary of what the model is/does. [Optional] -->
|
12 |
-
Image-To-Text model for optical music recognition.
|
|
|
|
|
|
|
13 |
|
14 |
## Demo
|
15 |
|
|
|
9 |
# Optical Music Recognition Transformer
|
10 |
|
11 |
<!-- Provide a quick summary of what the model is/does. [Optional] -->
|
12 |
+
Image-To-Text model for optical music recognition.
|
13 |
+
The model is trained to predict simple notes in the lilypond format from a given image.
|
14 |
+
Training data consists of artificial, handwritten and white board images.
|
15 |
+
The model itself is based on [Donut](https://huggingface.co/docs/transformers/model_doc/donut).
|
16 |
|
17 |
## Demo
|
18 |
|