11 38 159

dinhanhx

dinhanhx

AI & ML interests

Vision Language

Recent Activity

liked a model 18 days ago

Qwen/Qwen2.5-14B-Instruct-GPTQ-Int8

liked a model 25 days ago

mistralai/Mistral-Small-3.1-24B-Instruct-2503

liked a Space 26 days ago

artificialguybr/Surya-OCR

View all activity

Organizations

dinhanhx's activity

liked a model 18 days ago

Qwen/Qwen2.5-14B-Instruct-GPTQ-Int8

Text Generation • Updated Oct 9, 2024 • 51.5k • 20

liked a model 25 days ago

mistralai/Mistral-Small-3.1-24B-Instruct-2503

Image-Text-to-Text • Updated 5 days ago • 137k • 1.11k

liked 2 Spaces 26 days ago

Surya OCR

👀

Analyze documents to extract and structure text

149

DocLayout YOLO

🚀

Demo for DocLayout-YOLO

New activity in jinaai/xlm-roberta-flash-implementation-onnx 26 days ago

Errors on rerun your code

#1 opened 3 months ago by

nosaty

New activity in jinaai/jina-embeddings-v3 26 days ago

The process to convert this model to onnx format

#119 opened 26 days ago by

dinhanhx

replied to merve's post 27 days ago

Time to burn META

upvoted a paper about 2 months ago

π_0: A Vision-Language-Action Flow Model for General Robot Control

Paper • 2410.24164 • Published Oct 31, 2024 • 7

liked a Space about 2 months ago

VLM R1 Referral Expression

💬

Highlight described objects in images

upvoted an article about 2 months ago

Article

Vision Language Models Explained

Apr 11, 2024

• 308

upvoted 2 articles 2 months ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 170

Article

Visual Document Retrieval Goes Multilingual

Jan 10

• 70

liked a Space 2 months ago

Timm + Transformers

😻

Any timm model with the transformers integration

upvoted an article 2 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 839

replied to merve's post 3 months ago

that's a little bit extreme and skeptic. Let's have faith

liked a model 3 months ago

erax-ai/EraX-VL-2B-V1.5

Visual Question Answering • Updated Jan 15 • 1.14k • 6

liked a model 4 months ago