Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1
Prasanna Iyer
prasiyer
Follow
AMajesticRasun17381's profile picture
LeroyDyer's profile picture
2 followers
ยท
2 following
prasiyer
AI & ML interests
None yet
Recent Activity
liked
a model
14 days ago
agentica-org/DeepCoder-14B-Preview
reacted
to
fdaudens
's
post
with ๐ฅ
about 2 months ago
Is this the best tool to extract clean info from PDFs, handwriting and complex documents yet? Open source olmOCR just dropped and the results are impressive. Tested the free demo with various documents, including a handwritten Claes Oldenburg letter. The speed is impressive: 3000 tokens/second on your own GPU - that's 1/32 the cost of GPT-4o ($190/million pages). Game-changer for content extraction and digital archives. To achieve this, Ai2 trained a 7B vision language model on 260K pages from 100K PDFs using "document anchoring" - combining PDF metadata with page images. Best part: it actually understands document structure (columns, tables, equations) instead of just jumbling everything together like most OCR tools. Their human eval results back this up. ๐ Try the demo: https://olmocr.allenai.org Going right into the AI toolkit: https://huggingface.co/spaces/JournalistsonHF/ai-toolkit
reacted
to
nroggendorff
's
post
with ๐
3 months ago
maybe a page where you can find open orgs to get started in collaboration with hf. i see so many people that dont have a direction. i dont have ulterior motives, so dont ask
View all activity
Organizations
None yet
prasiyer
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
a model
14 days ago
agentica-org/DeepCoder-14B-Preview
Text Generation
โข
Updated
13 days ago
โข
35.4k
โข
597