4 18 4

Zhengyuan Yang

zyang39

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation

updated a dataset 10 days ago

zyang39/chartqa_digit_r1v_format

published a dataset 10 days ago

zyang39/chartqa_digit_r1v_format

View all activity

Organizations

zyang39's activity

upvoted a paper 3 days ago

TextAtlas5M: A Large-scale Dataset for Dense Text Image Generation

Paper • 2502.07870 • Published 6 days ago • 40

updated a dataset 10 days ago

zyang39/chartqa_digit_r1v_format

Viewer • Updated 10 days ago • 11.4k • 188

published a dataset 10 days ago

zyang39/chartqa_digit_r1v_format

Viewer • Updated 10 days ago • 11.4k • 188

updated 2 datasets 10 days ago

zyang39/chartqa_r1v_format

Viewer • Updated 10 days ago • 7.4k • 42

zyang39/mathv_r1v_format

Viewer • Updated 10 days ago • 3.04k • 30

published 2 datasets 10 days ago

zyang39/chartqa_r1v_format

Viewer • Updated 10 days ago • 7.4k • 42

zyang39/mathv_r1v_format

Viewer • Updated 10 days ago • 3.04k • 30

published a model 13 days ago

zyang39/Qwen2.5-1.5B-Open-R1-GRPO

Updated 13 days ago

upvoted a paper 24 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 26 days ago • 319

authored a paper about 1 month ago

ReFocus: Visual Editing as a Chain of Thought for Structured Image Understanding

Paper • 2501.05452 • Published Jan 9 • 15

authored a paper 2 months ago

OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation

Paper • 2412.09585 • Published Dec 12, 2024 • 11

upvoted 2 papers 2 months ago

OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation

Paper • 2412.09585 • Published Dec 12, 2024 • 11

Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension

Paper • 2412.03704 • Published Dec 4, 2024 • 7

upvoted a paper 3 months ago

GenXD: Generating Any 3D and 4D Scenes

Paper • 2411.02319 • Published Nov 4, 2024 • 20

authored 2 papers 3 months ago

GenXD: Generating Any 3D and 4D Scenes

Paper • 2411.02319 • Published Nov 4, 2024 • 20

SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation

Paper • 2410.23277 • Published Oct 30, 2024 • 9

upvoted a paper 4 months ago

SlowFast-VGen: Slow-Fast Learning for Action-Driven Long Video Generation

Paper • 2410.23277 • Published Oct 30, 2024 • 9

authored a paper 7 months ago

MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities

Paper • 2408.00765 • Published Aug 1, 2024 • 13

upvoted a paper 7 months ago

MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated Capabilities

Paper • 2408.00765 • Published Aug 1, 2024 • 13

authored a paper 8 months ago

VideoGUI: A Benchmark for GUI Automation from Instructional Videos

Paper • 2406.10227 • Published Jun 14, 2024 • 9