Flova
/

omr_transformer

vision-encoder-decoder

image-text-to-text

Inference Endpoints

Model card Files Files and versions Community

Flova commited on Mar 16, 2023

Commit

47bbe1e

•

1 Parent(s): 4e07994

Add reference to donut

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -9,7 +9,10 @@ pipeline_tag: image-to-text
 # Optical Music Recognition Transformer
 <!-- Provide a quick summary of what the model is/does. [Optional] -->
-Image-To-Text model for optical music recognition. The model is trained to predict simple notes in the lilypond format from a given image. Training data consists of artificial, handwritten and white board images.
 ## Demo

 # Optical Music Recognition Transformer
 <!-- Provide a quick summary of what the model is/does. [Optional] -->
+Image-To-Text model for optical music recognition.
+The model is trained to predict simple notes in the lilypond format from a given image.
+Training data consists of artificial, handwritten and white board images.
+The model itself is based on [Donut](https://huggingface.co/docs/transformers/model_doc/donut).
 ## Demo