Theo Viel commited on
Commit
c4d1dfe
·
1 Parent(s): 8a72e26
Files changed (2) hide show
  1. README.md +3 -0
  2. viz.png +3 -0
README.md CHANGED
@@ -18,6 +18,9 @@ tags:
18
 
19
  ## Model Overview
20
 
 
 
 
21
  ### Description
22
 
23
  The **NeMo Retriever Page Elements v3** model is a specialized object detection model designed to identify and extract key elements from charts and graphs. While the underlying technology builds upon work from [Megvii Technology](https://github.com/Megvii-BaseDetection/YOLOX), we developed our own base model through complete retraining rather than using pre-trained weights. YOLOX is an anchor-free version of YOLO (You Only Look Once), this model combines a simpler architecture with enhanced performance. The model is trained to detect **tables**, **charts**, **infographics**, **titles**, **header/footers** and **texts** in documents.
 
18
 
19
  ## Model Overview
20
 
21
+ ![viz.png](viz.png)
22
+ *Preview of the model output on the example image.*
23
+
24
  ### Description
25
 
26
  The **NeMo Retriever Page Elements v3** model is a specialized object detection model designed to identify and extract key elements from charts and graphs. While the underlying technology builds upon work from [Megvii Technology](https://github.com/Megvii-BaseDetection/YOLOX), we developed our own base model through complete retraining rather than using pre-trained weights. YOLOX is an anchor-free version of YOLO (You Only Look Once), this model combines a simpler architecture with enhanced performance. The model is trained to detect **tables**, **charts**, **infographics**, **titles**, **header/footers** and **texts** in documents.
viz.png ADDED

Git LFS Details

  • SHA256: 37f80953d6aef00ae589e02b8b994bfb3e5741476c1753cd221e7f8e5332c7db
  • Pointer size: 131 Bytes
  • Size of remote file: 277 kB