70 15 65

Ziyang Luo

Ziyang

https://chiyeunglaw.github.io/

ChiYeungLaw

AI & ML interests

LLMs, Multimodal ML

Recent Activity

updated a dataset 26 days ago

TransferLM/mp1

updated a dataset 26 days ago

TransferLM/mp2

published a dataset 26 days ago

TransferLM/mp2

View all activity

Organizations

Ziyang's activity

upvoted an article about 1 month ago

Article

✴️ ScreenSpot-Pro: GUI Grounding for Professional High-Resolution Computer Use

and 1 other •

Jan 3

• 13

upvoted a paper 3 months ago

VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation

Paper • 2411.13281 • Published Nov 20, 2024 • 18

upvoted an article 5 months ago

Article

The Annotated Diffusion Model

Jun 7, 2022

• 139

upvoted a paper 5 months ago

Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution

Paper • 2310.16834 • Published Oct 25, 2023 • 4

upvoted an article 7 months ago

Article

The Rise of Agentic Data Generation

•

Jul 15, 2024

• 81

upvoted a paper 8 months ago

VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs

Paper • 2406.07476 • Published Jun 11, 2024 • 34

upvoted a collection 8 months ago

From screenshots to HTML

Collection

WebSight is a dataset of 823,000 HTML/CSS codes representing synthetically generated English websites, each accompanied by a corresponding screenshot. • 4 items • Updated Apr 15, 2024 • 20

upvoted a paper 9 months ago

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Paper • 2404.07972 • Published Apr 11, 2024 • 48

upvoted a paper 10 months ago

MMCode: Evaluating Multi-Modal Code Large Language Models with Visually Rich Programming Problems

Paper • 2404.09486 • Published Apr 15, 2024 • 1

upvoted a paper 12 months ago

aMUSEd: An Open MUSE Reproduction

Paper • 2401.01808 • Published Jan 3, 2024 • 29

upvoted 3 papers about 1 year ago

GOAT-Bench: Safety Insights to Large Multimodal Models through Meme-Based Social Abuse

Paper • 2401.01523 • Published Jan 3, 2024 • 1

GAIA: a benchmark for General AI Assistants

Paper • 2311.12983 • Published Nov 21, 2023 • 191

LexLIP: Lexicon-Bottlenecked Language-Image Pre-Training for Large-Scale Image-Text Retrieval

Paper • 2302.02908 • Published Feb 6, 2023 • 1

upvoted a paper over 1 year ago

Demystifying GPT Self-Repair for Code Generation

Paper • 2306.09896 • Published Jun 16, 2023 • 19