Document Models (Pretrained) Various pretrained models for analyzing documents. These need to be fine-tuned for a task naver-clova-ix/donut-base Image-to-Text • Updated Aug 13, 2022 • 27.3k • 149 google/pix2struct-base Image-to-Text • Updated Dec 24, 2023 • 4.99k • 60 google/pix2struct-large Image-to-Text • Updated Sep 6, 2023 • 4.39k • 26 microsoft/layoutlmv3-base Updated 16 days ago • 9.22M • 267
Document Models (Fine-tuned) naver-clova-ix/donut-base-finetuned-cord-v2 Image-to-Text • Updated Aug 13, 2022 • 22.3k • 65 google/pix2struct-docvqa-base Visual Question Answering • Updated Dec 24, 2023 • 140k • 32 google/pix2struct-docvqa-large Visual Question Answering • Updated May 19, 2023 • 2.01k • 29 google/pix2struct-screen2words-base Visual Question Answering • Updated May 19, 2023 • 191 • 20
google/pix2struct-screen2words-base Visual Question Answering • Updated May 19, 2023 • 191 • 20