DunnBC22
/

trocr-large-printed-cmc7_tesseract_MICR_ocr

vision-encoder-decoder

image-text-to-text

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

DunnBC22 commited on Jul 23, 2023

Commit

fbaf5c3

·

1 Parent(s): 9b8d0ba

Update README.md

Files changed (1) hide show

README.md +17 -9

README.md CHANGED Viewed

@@ -5,26 +5,34 @@ tags:
 model-index:
 - name: trocr-large-printed-cmc7_tesseract_MICR_ocr
   results: []
 ---
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
 # trocr-large-printed-cmc7_tesseract_MICR_ocr
-This model is a fine-tuned version of [microsoft/trocr-large-printed](https://huggingface.co/microsoft/trocr-large-printed) on an unknown dataset.
 ## Model description
-More information needed
 ## Intended uses & limitations
-More information needed
 ## Training and evaluation data
-More information needed
 ## Training procedure
@@ -41,11 +49,11 @@ The following hyperparameters were used during training:
 ### Training results
-The Character Error Rate (CER) for this model is 0.004970720413999727
 ### Framework versions
 - Transformers 4.31.0
 - Pytorch 2.0.1+cu118
 - Datasets 2.13.1
-- Tokenizers 0.13.3

 model-index:
 - name: trocr-large-printed-cmc7_tesseract_MICR_ocr
   results: []
+license: bsd-3-clause
+language:
+- en
+metrics:
+- cer
+pipeline_tag: image-to-text
 ---
 # trocr-large-printed-cmc7_tesseract_MICR_ocr
+This model is a fine-tuned version of [microsoft/trocr-large-printed](https://huggingface.co/microsoft/trocr-large-printed).
 ## Model description
+For more information on how it was created, check out the following link: https://github.com/DunnBC22/Vision_Audio_and_Multimodal_Projects/blob/main/Optical%20Character%20Recognition%20(OCR)/Tesseract%20MICR%20(CMC7%20Dataset)/TrOCR_cmc7_tesseractMICR.ipynb
 ## Intended uses & limitations
+This model is intended to demonstrate my ability to solve a complex problem using technology.
 ## Training and evaluation data
+Dataset Source: https://github.com/DoubangoTelecom/tesseractMICR/tree/master/datasets/cmc7
+**Histogram of Label Character Lengths**
+![Histogram of Label Character Lengths](https://raw.githubusercontent.com/DunnBC22/Vision_Audio_and_Multimodal_Projects/main/Optical%20Character%20Recognition%20(OCR)/Tesseract%20MICR%20(CMC7%20Dataset)/Images/Histogram%20of%20Label%20Character%20Length.png)
 ## Training procedure
 ### Training results
+The Character Error Rate (CER) for this model is 0.004970720413999727.
 ### Framework versions
 - Transformers 4.31.0
 - Pytorch 2.0.1+cu118
 - Datasets 2.13.1
+- Tokenizers 0.13.3