Phan Hoang's picture

Phan Hoang

phanhoang

·

AI & ML interests

None yet

Recent Activity

liked a Space about 1 month ago

omlab/VLM-R1-Referral-Expression

liked a model 3 months ago

ByteDance/Sa2VA-1B

liked a model 3 months ago

OpenGVLab/InternVL2_5-4B-MPO

View all activity

Organizations

None yet

phanhoang's activity

upvoted a paper 4 months ago

Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion

Paper • 2412.04424 • Published Dec 5, 2024 • 62

upvoted a collection 6 months ago

DocLayout-YOLO

Dataset and model for DocLayout-YOLO • 10 items • Updated Jan 14 • 16

upvoted a paper 6 months ago

Loong: Generating Minute-level Long Videos with Autoregressive Language Models

Paper • 2410.02757 • Published Oct 3, 2024 • 36

upvoted 4 collections 6 months ago

📑Trending Papers - September 9⃣️

10 items • Updated 7 days ago • 9

Emu3

Emu3: Next-Token Prediction is All You Need • 7 items • Updated Feb 13 • 70

Molmo

Artifacts for open multimodal language models. • 5 items • Updated 22 days ago • 300

Qwen2.5

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 46 items • Updated Feb 26 • 580

upvoted 2 papers 7 months ago

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published Sep 18, 2024 • 76

General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model

Paper • 2409.01704 • Published Sep 3, 2024 • 84

upvoted an article 7 months ago

Article

Making LLMs lighter with AutoGPTQ and transformers

Aug 23, 2023

• 46

upvoted 2 collections 7 months ago

Awesome Document AI

A collection of open-source document AI 📄 📝 📈 • 27 items • Updated Mar 11, 2024 • 80

Qwen2-VL

Vision-language model series based on Qwen2 • 16 items • Updated Dec 6, 2024 • 209

upvoted a paper 7 months ago

Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Models

Paper • 2408.02442 • Published Aug 5, 2024 • 21

upvoted 2 collections 7 months ago

Function Calling Dataset

7 items • Updated Dec 5, 2023 • 6

Papers I want to read

Papers in my to-read list • 259 items • Updated Jan 10 • 30

upvoted 2 articles 8 months ago

Article

Tool Use, Unified

Aug 12, 2024

• 97

Article

Introducing TextImage Augmentation for Document Images

Aug 6, 2024

• 32