hsarfraz commited on
Commit
c269efd
·
verified ·
1 Parent(s): 1df1310

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +54 -1
README.md CHANGED
@@ -14,6 +14,9 @@ The base model is ['naver-clova-ix/donut-base'][base], the model is finetuned fo
14
 
15
  For inference use image size width: 1536 px and height: 1536 px
16
 
 
 
 
17
  ```python
18
  from transformers import DonutProcessor, VisionEncoderDecoderModel
19
  from PIL import Image
@@ -70,5 +73,55 @@ print('----------------------------------')
70
  print(output_json)
71
 
72
  ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
73
 
74
- [base]: https://huggingface.co/naver-clova-ix/donut-base
 
 
14
 
15
  For inference use image size width: 1536 px and height: 1536 px
16
 
17
+ # How to Get Started with the Model
18
+ Use the code below to get started with the model.
19
+
20
  ```python
21
  from transformers import DonutProcessor, VisionEncoderDecoderModel
22
  from PIL import Image
 
73
  print(output_json)
74
 
75
  ```
76
+ # FAKE Synthetic Form 1040 (2023) for illustration purposes only
77
+
78
+ ![FAKE 1040 form for illustration purposes][fake_doc]
79
+
80
+ # Example of json output (based on FAKE 1040 form)
81
+
82
+ ```json
83
+ {
84
+ "lbl_0_03": "Michael Evans",
85
+ "lbl_0_04": "Caldwell",
86
+ "lbl_0_05": "741-52-5353",
87
+ "lbl_0_06": "None",
88
+ "lbl_0_07": "None",
89
+ "lbl_0_08": "None",
90
+ "lbl_0_09": "289 Blackwell Land Suite 380 New Tiffany, NH 07548",
91
+ "lbl_0_11": "East Amandaport",
92
+ "lbl_0_12": "VI",
93
+ "lbl_0_13": "47832",
94
+ "lbl_0_14": "None",
95
+ "lbl_0_15": "None",
96
+ "lbl_0_16": "25677",
97
+ "lbl_0_55": "385321.36",
98
+ "lbl_0_56": "None",
99
+ "lbl_0_57": "None",
100
+ "lbl_0_58": "None",
101
+ "lbl_0_59": "None",
102
+ "lbl_0_60": "None",
103
+ "lbl_0_61": "None",
104
+ "lbl_0_62": "None",
105
+ "lbl_0_63": "None",
106
+ "lbl_0_67": "None",
107
+ "lbl_0_68": "481161.23",
108
+ "lbl_0_69": "None",
109
+ "lbl_0_70": "None",
110
+ "lbl_0_71": "None",
111
+ "lbl_0_72": "749100.68",
112
+ "lbl_0_73": "418381-6",
113
+ "lbl_0_74": "None",
114
+ "lbl_0_77": "755042.64",
115
+ "lbl_0_78": "None",
116
+ "lbl_0_79": "560928.32",
117
+ "lbl_0_80": "493913.73",
118
+ "lbl_0_81": "None",
119
+ "lbl_0_82": "738597.72",
120
+ "lbl_0_83": "34990.46"
121
+ }
122
+ ```
123
+
124
+
125
 
126
+ [base]: https://huggingface.co/naver-clova-ix/donut-base
127
+ [fake_doc]: https://huggingface.co/hsarfraz/irs-tax-form-1040-2023-doc-parser/blob/main/fake_synthetic_form_1040_example.png