DunnBC22 commited on
Commit
980f798
1 Parent(s): 9e58f8a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -10
README.md CHANGED
@@ -1,32 +1,41 @@
1
  ---
2
  tags:
3
  - generated_from_trainer
 
4
  model-index:
5
  - name: trocr-large-printed-e13b_tesseract_MICR_ocr
6
  results: []
 
 
 
 
 
7
  ---
8
 
9
- <!-- This model card has been generated automatically according to the information the Trainer had access to. You
10
- should probably proofread and complete it, then remove this comment. -->
11
-
12
  # trocr-large-printed-e13b_tesseract_MICR_ocr
13
 
14
- This model is a fine-tuned version of [microsoft/trocr-large-printed](https://huggingface.co/microsoft/trocr-large-printed) on an unknown dataset.
 
15
  It achieves the following results on the evaluation set:
16
  - Loss: 0.2432
17
- - Cer: 0.0036
18
 
19
  ## Model description
20
 
21
- More information needed
22
 
23
  ## Intended uses & limitations
24
 
25
- More information needed
26
 
27
  ## Training and evaluation data
28
 
29
- More information needed
 
 
 
 
 
30
 
31
  ## Training procedure
32
 
@@ -43,7 +52,7 @@ The following hyperparameters were used during training:
43
 
44
  ### Training results
45
 
46
- | Training Loss | Epoch | Step | Validation Loss | Cer |
47
  |:-------------:|:-----:|:----:|:---------------:|:------:|
48
  | 0.486 | 1.0 | 841 | 0.5168 | 0.0428 |
49
  | 0.2187 | 2.0 | 1682 | 0.2432 | 0.0036 |
@@ -54,4 +63,4 @@ The following hyperparameters were used during training:
54
  - Transformers 4.28.1
55
  - Pytorch 2.0.1
56
  - Datasets 2.13.1
57
- - Tokenizers 0.13.3
 
1
  ---
2
  tags:
3
  - generated_from_trainer
4
+ - TrOCR
5
  model-index:
6
  - name: trocr-large-printed-e13b_tesseract_MICR_ocr
7
  results: []
8
+ license: bsd-3-clause
9
+ language:
10
+ - en
11
+ metrics:
12
+ - cer
13
  ---
14
 
 
 
 
15
  # trocr-large-printed-e13b_tesseract_MICR_ocr
16
 
17
+ This model is a fine-tuned version of [microsoft/trocr-large-printed](https://huggingface.co/microsoft/trocr-large-printed).
18
+
19
  It achieves the following results on the evaluation set:
20
  - Loss: 0.2432
21
+ - CER: 0.0036
22
 
23
  ## Model description
24
 
25
+ For more information on how it was created, check out the following link: https://github.com/DunnBC22/Vision_Audio_and_Multimodal_Projects/blob/main/Optical%20Character%20Recognition%20(OCR)/Tesseract%20MICR%20(E15B%20Dataset)/TrOCR-e13b%20-%20tesseractMICR.ipynb
26
 
27
  ## Intended uses & limitations
28
 
29
+ This model is intended to demonstrate my ability to solve a complex problem using technology.
30
 
31
  ## Training and evaluation data
32
 
33
+ Dataset Source: https://github.com/DoubangoTelecom/tesseractMICR/tree/master/datasets/e13b
34
+
35
+ __Histogram of Label Character Lengths__
36
+
37
+ ![Histogram of Label Character Lengths](https://raw.githubusercontent.com/DunnBC22/Vision_Audio_and_Multimodal_Projects/main/Optical%20Character%20Recognition%20(OCR)/Tesseract%20MICR%20(E15B%20Dataset)/Images/Histogram%20of%20Label%20Character%20Length.png)
38
+
39
 
40
  ## Training procedure
41
 
 
52
 
53
  ### Training results
54
 
55
+ | Training Loss | Epoch | Step | Validation Loss | CER |
56
  |:-------------:|:-----:|:----:|:---------------:|:------:|
57
  | 0.486 | 1.0 | 841 | 0.5168 | 0.0428 |
58
  | 0.2187 | 2.0 | 1682 | 0.2432 | 0.0036 |
 
63
  - Transformers 4.28.1
64
  - Pytorch 2.0.1
65
  - Datasets 2.13.1
66
+ - Tokenizers 0.13.3