Chenghao Zhang's picture

18 4

Chenghao Zhang

Snow-Nation

·

AI & ML interests

CG, LLM

Recent Activity

upvoted a paper 4 days ago

Unified Reward Model for Multimodal Understanding and Generation

upvoted a paper 5 days ago

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

upvoted a paper 5 days ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

View all activity

Organizations

None yet

Snow-Nation's activity

upvoted a paper 4 days ago

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published 17 days ago • 107

upvoted 2 papers 5 days ago

Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey

Paper • 2503.12605 • Published 7 days ago • 27

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published 5 days ago • 94

upvoted a paper about 1 month ago

mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic Data

Paper • 2502.08468 • Published Feb 12 • 13

upvoted 9 papers 3 months ago

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published Jan 1 • 101

Do NOT Think That Much for 2+3=? On the Overthinking of o1-Like LLMs

Paper • 2412.21187 • Published Dec 30, 2024 • 40

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 356

INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning

Paper • 2401.06532 • Published Jan 12, 2024 • 12

Progressive Multimodal Reasoning via Active Retrieval

Paper • 2412.14835 • Published Dec 19, 2024 • 73

Understand What LLM Needs: Dual Preference Alignment for Retrieval-Augmented Generation

Paper • 2406.18676 • Published Jun 26, 2024 • 6

Smaller Language Models Are Better Instruction Evolvers

Paper • 2412.11231 • Published Dec 15, 2024 • 27

IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations

Paper • 2412.12083 • Published Dec 16, 2024 • 12

RetroLLM: Empowering Large Language Models to Retrieve Fine-grained Evidence within Generation

Paper • 2412.11919 • Published Dec 16, 2024 • 34

upvoted a paper 4 months ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 120

upvoted 2 papers 5 months ago

HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG Systems

Paper • 2411.02959 • Published Nov 5, 2024 • 68

CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation

Paper • 2410.23090 • Published Oct 30, 2024 • 55

upvoted a collection 5 months ago

InternVL2.0

Expanding Performance Boundaries of Open-Source MLLM • 15 items • Updated Jan 10 • 91

upvoted a paper 5 months ago

Toward General Instruction-Following Alignment for Retrieval-Augmented Generation

Paper • 2410.09584 • Published Oct 12, 2024 • 48