Update README.md
Browse files
README.md
CHANGED
@@ -157,3 +157,26 @@ tokenizer = AutoTokenizer.from_pretrained(model_id, trust_remote_code=True)
|
|
157 |
Dataset use for finetuning:
|
158 |
|
159 |
mychen76/invoices-and-receipts_ocr_v1
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
157 |
Dataset use for finetuning:
|
158 |
|
159 |
mychen76/invoices-and-receipts_ocr_v1
|
160 |
+
|
161 |
+
|
162 |
+
### Usage Notebooks
|
163 |
+
*English Receipts
|
164 |
+
model_id="mychen76/mistral7b_ocr_to_json_v1"
|
165 |
+
https://github.com/minyang-chen/LLM_convert_receipt_image-to-json_or_xml/blob/main/Convert_Receipt_Image-to-Json_using_OCR_to_JSON_v1-English.ipynb
|
166 |
+
|
167 |
+
model_id: mychen76/mistral_ocr2json_v3_chatml
|
168 |
+
https://github.com/minyang-chen/LLM_convert_receipt_image-to-json_or_xml/blob/main/Convert_Receipt_Image-to-Json_using_OCR_to_JSON_v2_ChatML.ipynb
|
169 |
+
|
170 |
+
|
171 |
+
*German Receipts
|
172 |
+
model_id="mychen76/mistral7b_ocr_to_json_v1"
|
173 |
+
|
174 |
+
Test01
|
175 |
+
https://github.com/minyang-chen/LLM_convert_receipt_image-to-json_or_xml/blob/main/Convert_Receipt_Image-to-Json_using_OCR_to_JSON_v1-German-Test1-passed.ipynb
|
176 |
+
|
177 |
+
Test02
|
178 |
+
https://github.com/minyang-chen/LLM_convert_receipt_image-to-json_or_xml/blob/main/Convert_Receipt_Image-to-Json_using_OCR_to_JSON_v1-German-Test2-failed.ipynb
|
179 |
+
|
180 |
+
Test03
|
181 |
+
https://github.com/minyang-chen/LLM_convert_receipt_image-to-json_or_xml/blob/main/Convert_Receipt_Image-to-Json_using_OCR_to_JSON_v1-German-Test3-okay.ipynb
|
182 |
+
|