olmOCR Collection olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 3 items • Updated 10 days ago • 89
Platypus: A Generalized Specialist Model for Reading Text in Various Forms Paper • 2408.14805 • Published Aug 27, 2024 • 15
OnnxTR Collection https://github.com/felixdittrich92/OnnxTR • 21 items • Updated Aug 16, 2024 • 6
Running on CPU Upgrade 5k 5k MTEB Leaderboard 🥇 Select benchmarks and languages for text embeddings evaluation
aarhus-city-archives/historical-danish-handwriting Viewer • Updated 15 days ago • 11.3k • 1k • 1