|
--- |
|
license: apache-2.0 |
|
--- |
|
|
|
# Suryolo : Layout Model For Arabic Documents |
|
|
|
Suryolo is combination of Surya layout Model form SuryaOCR(based on Segformer) and YoloV10 objection detection. |
|
|
|
## Setup Instructions |
|
|
|
### Clone the Surya OCR GitHub Repository |
|
|
|
```bash |
|
git clone https://github.com/VikParuchuri/surya.git |
|
cd surya |
|
``` |
|
|
|
### Switch to v0.4.14 |
|
|
|
```bash |
|
git checkout f7c6c04 |
|
``` |
|
|
|
### Install Dependencies |
|
|
|
The author has not provided requirements.txt file, but `environment.yml` from our conda environment has been uploaded, This file can be used to recreate environment for Suryolo model. |
|
|
|
|
|
### Suryolo Pipeline |
|
|
|
Download `surya_yolo_pipeline_copy.cpython-310-x86_64-linux-gnu.so` , `yolov10x_best.pt` and `surya folder` from the Repository. |
|
Place `surya_yolo_pipeline_copy.cpython-310-x86_64-linux-gnu.so`, `yolov10x_best.pt` and `surya folder` in same directory (They are dependent on each other). |
|
|
|
```python |
|
from surya_yolo_pipeline_copy import suryolo # This import will originate from surya_yolo_pipeline_copy.cpython-310-x86_64-linux-gnu.so , which is present in the repo. Also this works with Linux based OS only. |
|
from surya.postprocessing.heatmap import draw_bboxes_on_image |
|
|
|
image_path = "sample.jpg" |
|
image = Image.open(image_path) |
|
bboxes = suryolo(image_path) |
|
plotted_image = draw_bboxes_on_image(bboxes,image) |
|
``` |
|
#### Refer to `benchmark.ipynb` for comparison between Traditional Surya Layout Model and Suryolo Layout Model. |