Datasets scrapped from PDFs on the web. Contains pairs of question-answers on a given page of a given PDF document.
ColDoc
university
AI & ML interests
None defined yet.
Collections
7
datasets
33
coldoc/baseline_cap_infovqa_test_subsampled_ocr_chunk
Viewer
•
Updated
coldoc/baseline_cap_tatdqa_test_ocr_chunk
Viewer
•
Updated
coldoc/baseline_cap_syntheticDocQA_artificial_intelligence_test_ocr_chunk
Viewer
•
Updated
coldoc/baseline_cap_syntheticDocQA_government_reports_test_ocr_chunk
Viewer
•
Updated
coldoc/baseline_cap_syntheticDocQA_energy_test_ocr_chunk
Viewer
•
Updated
coldoc/baseline_cap_shiftproject_test_ocr_chunk
Viewer
•
Updated
coldoc/baseline_cap_docvqa_test_subsampled_ocr_chunk
Viewer
•
Updated
coldoc/baseline_cap_arxivqa_test_subsampled_ocr_chunk
Viewer
•
Updated
coldoc/baseline_cap_tabfquad_test_subsampled_ocr_chunk
Viewer
•
Updated
coldoc/shiftproject_test_tesseract
Viewer
•
Updated
•
1