DunnBC22 commited on
Commit
fbaf5c3
1 Parent(s): 9b8d0ba

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +17 -9
README.md CHANGED
@@ -5,26 +5,34 @@ tags:
5
  model-index:
6
  - name: trocr-large-printed-cmc7_tesseract_MICR_ocr
7
  results: []
 
 
 
 
 
 
8
  ---
9
 
10
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
11
- should probably proofread and complete it, then remove this comment. -->
12
-
13
  # trocr-large-printed-cmc7_tesseract_MICR_ocr
14
 
15
- This model is a fine-tuned version of [microsoft/trocr-large-printed](https://huggingface.co/microsoft/trocr-large-printed) on an unknown dataset.
16
 
17
  ## Model description
18
 
19
- More information needed
20
 
21
  ## Intended uses & limitations
22
 
23
- More information needed
24
 
25
  ## Training and evaluation data
26
 
27
- More information needed
 
 
 
 
 
28
 
29
  ## Training procedure
30
 
@@ -41,11 +49,11 @@ The following hyperparameters were used during training:
41
 
42
  ### Training results
43
 
44
- The Character Error Rate (CER) for this model is 0.004970720413999727
45
 
46
  ### Framework versions
47
 
48
  - Transformers 4.31.0
49
  - Pytorch 2.0.1+cu118
50
  - Datasets 2.13.1
51
- - Tokenizers 0.13.3
 
5
  model-index:
6
  - name: trocr-large-printed-cmc7_tesseract_MICR_ocr
7
  results: []
8
+ license: bsd-3-clause
9
+ language:
10
+ - en
11
+ metrics:
12
+ - cer
13
+ pipeline_tag: image-to-text
14
  ---
15
 
 
 
 
16
  # trocr-large-printed-cmc7_tesseract_MICR_ocr
17
 
18
+ This model is a fine-tuned version of [microsoft/trocr-large-printed](https://huggingface.co/microsoft/trocr-large-printed).
19
 
20
  ## Model description
21
 
22
+ For more information on how it was created, check out the following link: https://github.com/DunnBC22/Vision_Audio_and_Multimodal_Projects/blob/main/Optical%20Character%20Recognition%20(OCR)/Tesseract%20MICR%20(CMC7%20Dataset)/TrOCR_cmc7_tesseractMICR.ipynb
23
 
24
  ## Intended uses & limitations
25
 
26
+ This model is intended to demonstrate my ability to solve a complex problem using technology.
27
 
28
  ## Training and evaluation data
29
 
30
+ Dataset Source: https://github.com/DoubangoTelecom/tesseractMICR/tree/master/datasets/cmc7
31
+
32
+ **Histogram of Label Character Lengths**
33
+
34
+ ![Histogram of Label Character Lengths](https://raw.githubusercontent.com/DunnBC22/Vision_Audio_and_Multimodal_Projects/main/Optical%20Character%20Recognition%20(OCR)/Tesseract%20MICR%20(CMC7%20Dataset)/Images/Histogram%20of%20Label%20Character%20Length.png)
35
+
36
 
37
  ## Training procedure
38
 
 
49
 
50
  ### Training results
51
 
52
+ The Character Error Rate (CER) for this model is 0.004970720413999727.
53
 
54
  ### Framework versions
55
 
56
  - Transformers 4.31.0
57
  - Pytorch 2.0.1+cu118
58
  - Datasets 2.13.1
59
+ - Tokenizers 0.13.3