Kiren Wang

KirenWH

AI & ML interests

MLLM

Recent Activity

liked a dataset 8 days ago

lmms-lab/LLaVA-OneVision-Data

liked a dataset 12 days ago

yuvalkirstain/pickapic_v1

liked a dataset about 1 month ago

ds4sd/DocLayNet-v1.2

View all activity

Organizations

KirenWH's activity

liked a dataset 8 days ago

lmms-lab/LLaVA-OneVision-Data

Viewer • Updated Oct 22, 2024 • 3.72M • 19.7k • 181

liked a dataset 12 days ago

yuvalkirstain/pickapic_v1

Viewer • Updated May 5, 2023 • 616k • 5.6k • 39

liked 2 datasets about 1 month ago

ds4sd/DocLayNet-v1.2

Viewer • Updated Feb 10 • 80.9k • 1.04k • 3

ds4sd/DocLayNet-v1.1

Viewer • Updated Sep 1, 2023 • 23.5k • 9.92k • 23

upvoted a paper about 1 month ago

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published Mar 7 • 118

updated a dataset about 2 months ago

KirenWH/OCR_R1_test

Viewer • Updated Feb 25 • 100 • 27

published a dataset about 2 months ago

KirenWH/OCR_R1_test

Viewer • Updated Feb 25 • 100 • 27

upvoted a paper 4 months ago

Rethinking Token Reduction in MLLMs: Towards a Unified Paradigm for Training-Free Acceleration

Paper • 2411.17686 • Published Nov 26, 2024 • 21

liked 2 models 4 months ago

IndexTeam/Index-1.9B-32K

Updated Sep 11, 2024 • 59 • 4

Qwen/QVQ-72B-Preview

Image-Text-to-Text • Updated Jan 12 • 50.1k • • 586

liked a dataset 4 months ago

echo840/Detailed_Caption

Preview • Updated Apr 23, 2024 • 68 • 20

liked a model 4 months ago

google/siglip-base-patch16-224

Zero-Shot Image Classification • Updated Sep 26, 2024 • 231k • 43

upvoted a paper 5 months ago

Identity-Preserving Text-to-Video Generation by Frequency Decomposition

Paper • 2411.17440 • Published Nov 26, 2024 • 38

liked a dataset 5 months ago

lmms-lab/LLaVA-ReCap-558K

Viewer • Updated May 28, 2024 • 558k • 893 • 25

updated a dataset 6 months ago

KirenWH/LLaVA-Finetune

Updated Oct 24, 2024 • 45

upvoted an article 6 months ago

Article

Fixing Gradient Accumulation

Oct 16, 2024

• 53

liked 2 models 8 months ago

OpenGVLab/InternVL2-1B

Image-Text-to-Text • Updated 28 days ago • 42.4k • 71

Qwen/Qwen2-7B-Instruct

Text Generation • Updated Aug 21, 2024 • 288k • • 628

liked a model 9 months ago

internlm/internlm2-1_8b-reward

Text Classification • Updated Mar 13 • 8.09k • 13