Add ParseBench evaluation results

by boyang-runllama - opened 1 day ago

base: refs/heads/main

←

from: refs/pr/4

Discussion Files changed

+60

-0

boyang-runllama

1 day ago

This PR ensures your model shows up at https://huggingface.co/datasets/llamaindex/ParseBench.

This is based on the new evaluation results feature: https://huggingface.co/docs/hub/eval-results.

Note: this includes per-dimension performance across all 5 ParseBench dimensions (text_content, text_formatting, layout, chart, table) along with the overall mean score.

Add ParseBench evaluation results3e0df125

rhirae

about 21 hours ago

Worth a caveat next to these numbers. On clean, structured pages it scores well, which is most of what ParseBench measures. On faint or low quality scans it hallucinates when it can't read the text it invents plausible content instead of leaving it blank.
In a side by side which I performed, it was the LEAST faithful of several OCR models -

olmOCR-2 > Qianfan-OCR > PaddleOCR-VL > HunyuanOCR > Unlimited-OCR

shihad22

about 16 hours ago

can you share your training data

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment