Spaces:
Sleeping
Sleeping
rmayormartins
commited on
Commit
β’
9aebca9
1
Parent(s):
4f90a60
Subindo arquivos4
Browse files- README.md +39 -3
- app.py +1 -1
- requirements.txt +2 -1
README.md
CHANGED
@@ -1,6 +1,6 @@
|
|
1 |
---
|
2 |
-
title:
|
3 |
-
emoji:
|
4 |
colorFrom: indigo
|
5 |
colorTo: green
|
6 |
sdk: gradio
|
@@ -10,4 +10,40 @@ pinned: false
|
|
10 |
license: ecl-2.0
|
11 |
---
|
12 |
|
13 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
+
title: OCR Img2txt
|
3 |
+
emoji: π
|
4 |
colorFrom: indigo
|
5 |
colorTo: green
|
6 |
sdk: gradio
|
|
|
10 |
license: ecl-2.0
|
11 |
---
|
12 |
|
13 |
+
# OCR Img2txt π
|
14 |
+
|
15 |
+
Try my OCR-Img2txt tool! This tool leverages both Tesseract and EasyOCR engines to extract text from images, offering users the possibility to choose between two leading OCR technologies.
|
16 |
+
|
17 |
+
## Features
|
18 |
+
|
19 |
+
- **OCR Engines**: Choose between Tesseract OCR and EasyOCR for text extraction.
|
20 |
+
- **Multilingual Support**: EasyOCR engine supports multiple languages including English, Portuguese (Brazil), and Spanish.
|
21 |
+
- **User-Friendly Interface**: Gradio interface for easy operation.
|
22 |
+
- **Versatile Use Cases**: Ideal for digitizing documents, extracting text from photos, and converting scanned images into editable text formats.
|
23 |
+
|
24 |
+
## How to Use
|
25 |
+
|
26 |
+
- **Start the App**: Run the `app.py` script to launch the OCR-Img2txt interface.
|
27 |
+
- **Select OCR Engine**: Use the dropdown to choose between Tesseract OCR or EasyOCR.
|
28 |
+
- **Upload Image**: Drag and drop an image or select a file (.jpg, .jpeg, .png) from your device.
|
29 |
+
- **View Results**: Get the extracted text instantly displayed on the screen.
|
30 |
+
|
31 |
+
## Developer
|
32 |
+
|
33 |
+
Developed by Ramon Mayor Martins (2023)
|
34 |
+
|
35 |
+
- Email: [rmayormartins@gmail.com](mailto:rmayormartins@gmail.com)
|
36 |
+
- Homepage: [https://rmayormartins.github.io/](https://rmayormartins.github.io/)
|
37 |
+
- Twitter: [@rmayormartins](https://twitter.com/rmayormartins)
|
38 |
+
- GitHub: [https://github.com/rmayormartins](https://github.com/rmayormartins)
|
39 |
+
|
40 |
+
## Feedback and Contributions
|
41 |
+
|
42 |
+
Your feedback and contributions are welcome! Feel free to open an issue or submit a pull request on GitHub.
|
43 |
+
|
44 |
+
## License
|
45 |
+
|
46 |
+
This project is released under the ECL-2.0 license.
|
47 |
+
|
48 |
+
---
|
49 |
+
*For more information on Hugging Face Spaces configuration, visit [Hugging Face Spaces Config Reference](https://huggingface.co/docs/hub/spaces-config-reference).*
|
app.py
CHANGED
@@ -29,7 +29,7 @@ iface = gr.Interface(
|
|
29 |
inputs=[gr.Dropdown(["Tesseract", "EasyOCR"], label="Select the OCR Engine"), "image"],
|
30 |
outputs="text",
|
31 |
title="OCR Img2txt",
|
32 |
-
description="This application uses Optical Character Recognition (OCR) technology to extract text from images. Choose between Tesseract OCR and EasyOCR engine to process images containing printed text, converting it into editable and searchable text."
|
33 |
examples=[["confusionmatrix.jpg"]]
|
34 |
)
|
35 |
|
|
|
29 |
inputs=[gr.Dropdown(["Tesseract", "EasyOCR"], label="Select the OCR Engine"), "image"],
|
30 |
outputs="text",
|
31 |
title="OCR Img2txt",
|
32 |
+
description="This application uses Optical Character Recognition (OCR) technology to extract text from images. Choose between Tesseract OCR and EasyOCR engine to process images containing printed text, converting it into editable and searchable text.",
|
33 |
examples=[["confusionmatrix.jpg"]]
|
34 |
)
|
35 |
|
requirements.txt
CHANGED
@@ -1,3 +1,4 @@
|
|
|
|
1 |
pytesseract
|
2 |
Pillow
|
3 |
-
|
|
|
1 |
+
gradio
|
2 |
pytesseract
|
3 |
Pillow
|
4 |
+
easyocr
|