Model prithivMLmods/Qwen2-VL-OCR-2B-Instruct is currently loading
how to use with api interference
AMIA THIERRY STEPHANE
r4gamia
·
AI & ML interests
None yet
Recent Activity
replied to
prithivMLmods's
post
6 days ago
Qwen2VL Models: Vision and Language Processing 🍉
📍FT; [ Latex OCR, Math Parsing, Text Analogy OCRTest ]
❄️Demo : https://huggingface.co/spaces/prithivMLmods/Qwen2-VL-2B . The demo includes the Qwen2VL 2B Base Model.
🎯The space handles documenting content from the input image along with standardized plain text. It includes adjustment tools with over 30 font styles, file formatting support for PDF and DOCX, textual alignments, font size adjustments, and line spacing modifications.
📄PDFs are rendered using the ReportLab software library toolkit.
🧵Models :
+ https://huggingface.co/prithivMLmods/Qwen2-VL-OCR-2B-Instruct
+ https://huggingface.co/prithivMLmods/Qwen2-VL-Ocrtest-2B-Instruct
+ https://huggingface.co/prithivMLmods/Qwen2-VL-Math-Prase-2B-Instruct
🚀Sample Document :
+ https://drive.google.com/file/d/1Hfqqzq4Xc-3eTjbz-jcQY84V5E1YM71E/view?usp=sharing
📦Collection :
+ https://huggingface.co/collections/prithivMLmods/vision-language-models-67639f790e806e1f9799979f
.
.
.
@prithivMLmods 🤗
Organizations
None yet
r4gamia's activity
replied to
prithivMLmods's
post
6 days ago