Yang Yue

yueyang2000

yueyang2000

AI & ML interests

None yet

Recent Activity

liked a dataset 41 minutes ago

allenai/pixmo-points

upvoted a paper 14 days ago

Self-rewarding correction for mathematical reasoning

upvoted a paper 22 days ago

Magma: A Foundation Model for Multimodal AI Agents

View all activity

Organizations

None yet

yueyang2000's activity

liked a dataset 41 minutes ago

allenai/pixmo-points

Viewer • Updated Nov 27, 2024 • 2.38M • 640 • 22

upvoted a paper 14 days ago

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published 15 days ago • 77

upvoted a paper 22 days ago

Magma: A Foundation Model for Multimodal AI Agents

Paper • 2502.13130 • Published 23 days ago • 56

liked a dataset 25 days ago

open-thoughts/OpenThoughts-114k

Viewer • Updated 22 days ago • 228k • 86.6k • 653

upvoted a paper about 1 month ago

Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE

Paper • 2502.06282 • Published Feb 10 • 5

upvoted a paper about 2 months ago

FAST: Efficient Action Tokenization for Vision-Language-Action Models

Paper • 2501.09747 • Published Jan 16 • 23

upvoted 3 papers 3 months ago

Expanding Performance Boundaries of Open-Source Multimodal Models with Model, Data, and Test-Time Scaling

Paper • 2412.05271 • Published Dec 6, 2024 • 138

Infinity: Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Paper • 2412.04431 • Published Dec 5, 2024 • 18

GRAPE: Generalizing Robot Policy via Preference Alignment

Paper • 2411.19309 • Published Nov 28, 2024 • 44

liked a model 3 months ago

google/t5-v1_1-base

Text2Text Generation • Updated Jan 24, 2023 • 81.2k • • 56

liked a Space 6 months ago

7.86k

Kolors Virtual Try-On

👕

Upload images to try on clothes virtually

upvoted 3 papers 7 months ago

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 126

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Paper • 2408.11039 • Published Aug 20, 2024 • 59

Achieving Human Level Competitive Robot Table Tennis

Paper • 2408.03906 • Published Aug 7, 2024 • 27

liked a model 10 months ago

Salesforce/xgen-mm-phi3-mini-instruct-r-v1

Image-Text-to-Text • Updated Feb 3 • 1.17k • 185

upvoted a paper 10 months ago

KAN: Kolmogorov-Arnold Networks

Paper • 2404.19756 • Published Apr 30, 2024 • 111

liked a dataset 11 months ago

AbdomenAtlas/AbdomenAtlas1.0MiniBeta

Updated Jan 16 • 26 • 7

upvoted a paper 11 months ago

InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD

Paper • 2404.06512 • Published Apr 9, 2024 • 30

upvoted a paper 12 months ago

When Do We Not Need Larger Vision Models?

Paper • 2403.13043 • Published Mar 19, 2024 • 25

upvoted a paper about 1 year ago

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Paper • 2403.03507 • Published Mar 6, 2024 • 186