AMIA THIERRY STEPHANE's picture

1

AMIA THIERRY STEPHANE

r4gamia

·

AI & ML interests

None yet

Recent Activity

liked a Space 28 days ago

prithivMLmods/Multimodal-OCR

replied to prithivMLmods's post about 2 months ago

Qwen2VL Models: Vision and Language Processing 🍉 📍FT; [ Latex OCR, Math Parsing, Text Analogy OCRTest ] Colab Demo: https://huggingface.co/prithivMLmods/Qwen2-VL-OCR-2B-Instruct/blob/main/Demo/ocrtest_qwen.ipynb ❄️Demo : https://huggingface.co/spaces/prithivMLmods/Qwen2-VL-2B . The demo includes the Qwen2VL 2B Base Model. 🎯The space handles documenting content from the input image along with standardized plain text. It includes adjustment tools with over 30 font styles, file formatting support for PDF and DOCX, textual alignments, font size adjustments, and line spacing modifications. 📄PDFs are rendered using the ReportLab software library toolkit. 🧵Models : + https://huggingface.co/prithivMLmods/Qwen2-VL-OCR-2B-Instruct + https://huggingface.co/prithivMLmods/Qwen2-VL-Ocrtest-2B-Instruct + https://huggingface.co/prithivMLmods/Qwen2-VL-Math-Prase-2B-Instruct 🚀Sample Document : + https://drive.google.com/file/d/1Hfqqzq4Xc-3eTjbz-jcQY84V5E1YM71E/view?usp=sharing 📦Collection : + https://huggingface.co/collections/prithivMLmods/vision-language-models-67639f790e806e1f9799979f . . . @prithivMLmods 🤗

View all activity

Organizations

None yet

r4gamia's activity

liked a Space 28 days ago

OCR

Qwen VL 2B

replied to prithivMLmods's post about 2 months ago

Model prithivMLmods/Qwen2-VL-OCR-2B-Instruct is currently loading
how to use with api interference