Jiahao Meng's picture

1 7

Jiahao Meng

marinero4972

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer

upvoted a paper 1 day ago

Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding

new activity 2 days ago

General-Level/General-Bench-Openset:Update README.md

View all activity

Organizations

marinero4972's activity

upvoted 2 papers 1 day ago

The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer

Paper • 2504.10462 • Published 3 days ago • 12

Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding

Paper • 2504.10465 • Published 3 days ago • 24

upvoted a paper 3 months ago

Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Paper • 2501.04001 • Published Jan 7 • 46

upvoted a paper 6 months ago

Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis

Paper • 2410.08261 • Published Oct 10, 2024 • 52

upvoted 3 papers 10 months ago

MotionBooth: Motion-Aware Customized Text-to-Video Generation

Paper • 2406.17758 • Published Jun 25, 2024 • 19

MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning

Paper • 2406.17770 • Published Jun 25, 2024 • 19

OMG-LLaVA: Bridging Image-level, Object-level, Pixel-level Reasoning and Understanding

Paper • 2406.19389 • Published Jun 27, 2024 • 55