2 29 21

Wujian Peng

wjpoom

https://scholar.google.com/citations?user=GTuWk9YAAAAJ&hl=zh-CN

wjpoom

AI & ML interests

None yet

Recent Activity

authored a paper 11 days ago

CoMP: Continual Multimodal Pre-training for Vision Foundation Models

upvoted a paper 13 days ago

CoMP: Continual Multimodal Pre-training for Vision Foundation Models

upvoted a paper 24 days ago

World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning

View all activity

Organizations

wjpoom's activity

authored a paper 11 days ago

CoMP: Continual Multimodal Pre-training for Vision Foundation Models

Paper • 2503.18931 • Published 14 days ago • 29

upvoted a paper 13 days ago

CoMP: Continual Multimodal Pre-training for Vision Foundation Models

Paper • 2503.18931 • Published 14 days ago • 29

upvoted a paper 24 days ago

World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning

Paper • 2503.10480 • Published 25 days ago • 49

upvoted a paper 28 days ago

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published about 1 month ago • 114

updated 2 datasets about 1 month ago

Inst-IT/Inst-It-Bench

Viewer • Updated Mar 3 • 4.07k • 126 • 1

Inst-IT/Inst-It-Dataset

Viewer • Updated Mar 1 • 72.5k • 381 • 7

updated a Space about 1 month ago

README

🐨

Boosting Multimodal Understanding at Instance-Level

published a Space about 1 month ago

README

🐨

Boosting Multimodal Understanding at Instance-Level

updated a collection about 1 month ago

Inst-IT Models

Collection

A series of LMMs finetuned with the Inst-IT Dataset, skilled in fine-grained image/video understanding at the instance-level. • 2 items • Updated 21 days ago

updated a model about 1 month ago

Inst-IT/LLaVA-Next-Inst-It-Qwen2-7B

Video-Text-to-Text • Updated Feb 21 • 26 • 3

liked a dataset about 2 months ago

Inst-IT/Inst-It-Bench

Viewer • Updated Mar 3 • 4.07k • 126 • 1

updated a model about 2 months ago

Inst-IT/LLaVA-Next-Inst-It-Vicuna-7B

Video-Text-to-Text • Updated Feb 20 • 20 • 2

updated 3 datasets 3 months ago