UniParser
/

MolDet

AI4Industry commited on Apr 15

Commit

c1b9a7b

verified ·

1 Parent(s): 706490c

Update README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -8,15 +8,17 @@ From paper: "*MolParser: End-to-end Visual Recognition of Molecule Structures in
 We provide several [ultralytics YOLO11]((https://github.com/ultralytics/ultralytics)) weights for molecule detection with different size & input resolution.
-## general molecule detection
 `moldet_yolo11[size]_640_general.pt`
 * 640x640 input resolution
 * support handwritten molecules
 * multiscale input (inputs can be single/multiple molecular cutouts, reaction or table cutouts, or single-page PDF images)
-<span style='color:gray'>For single-molecule input (used as a classification model), appropriate padding can be added to enhance the performance.</span>
 Result in private testing:
 | size | map50 | map50-95 |
@@ -33,13 +35,18 @@ model = YOLO("moldet_yolo11l_640_general.pt")
 model.predict("path/to/image.png", save=True, imgsz=640, conf=0.5)
 ```
-## PDF molecule detection
 `moldet_yolo11[size]_960_doc.pt`
 * 960x960 input resolution
 * single page PDF image input
 Result in private testing:
 | size | map50 | map50-95 |
 | ---- | ----- | -------- |
@@ -52,4 +59,5 @@ usage:
 ```python
 from ultralytics import YOLO
 model = YOLO("moldet_yolo11l_960_doc.pt")
-model.predict("path/to/pdf_page_image.png", save=True, imgsz=960, conf=0.5)

 We provide several [ultralytics YOLO11]((https://github.com/ultralytics/ultralytics)) weights for molecule detection with different size & input resolution.
+## general molecule structure detection models
 `moldet_yolo11[size]_640_general.pt`
+YOLO11 weights trained on 35k human annotated image crops and 100k generated images
 * 640x640 input resolution
 * support handwritten molecules
 * multiscale input (inputs can be single/multiple molecular cutouts, reaction or table cutouts, or single-page PDF images)
+<span style='color:gray'>Warning: For single-molecule input (used as a classification model), appropriate padding can be added to enhance the performance.</span>
 Result in private testing:
 | size | map50 | map50-95 |
 model.predict("path/to/image.png", save=True, imgsz=640, conf=0.5)
 ```
+## PDF molecule structure detection models
 `moldet_yolo11[size]_960_doc.pt`
+YOLO11 weights trained on 26k human annotated PDF pages (patents, papers, and books)
 * 960x960 input resolution
 * single page PDF image input
+<span style='color:gray'>Warning: It is recommended to use MuPDF to render PDF pages at more than 144dpi.</span>
 Result in private testing:
 | size | map50 | map50-95 |
 | ---- | ----- | -------- |
 ```python
 from ultralytics import YOLO
 model = YOLO("moldet_yolo11l_960_doc.pt")
+model.predict("path/to/pdf_page_image.png", save=True, imgsz=960, conf=0.5)
+```