Document datasets with .pdf files that are usable with pixparse libraries and tools.
Pixel Parsing
Enterprise
community
AI & ML interests
Document and User Interface Parsing, Understanding, Q&A.
Organization Card
About org cards
Multi-modal document, image, and text datasets and models for document understanding, OCR, VQA tasks.
GitHub repos:
- Data Loading:
chug
- https://github.com/huggingface/chug - Modelling:
pixparse
- coming soon
models
None public yet