wuyuhao's picture

19 3

wuyuhao

mozhu

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Referring to Any Person

upvoted a paper 1 day ago

^RFLAV: Rolling Flow matching for infinite Audio Video generation

upvoted a paper 1 day ago

REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding

View all activity

Organizations

None yet

mozhu's activity

upvoted 6 papers 1 day ago

Referring to Any Person

Paper • 2503.08507 • Published 2 days ago • 5

^RFLAV: Rolling Flow matching for infinite Audio Video generation

Paper • 2503.08307 • Published 3 days ago • 8

REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding

Paper • 2503.07413 • Published 3 days ago • 1

What's in a Latent? Leveraging Diffusion Latent Space for Domain Generalization

Paper • 2503.06698 • Published 4 days ago • 2

NeuGrasp: Generalizable Neural Surface Reconstruction with Background Priors for Material-Agnostic Object Grasp Detection

Paper • 2503.03511 • Published 8 days ago • 1

Beyond Decoder-only: Large Language Models Can be Good Encoders for Machine Translation

Paper • 2503.06594 • Published 5 days ago • 4

upvoted a paper 21 days ago

LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models

Paper • 2502.14834 • Published 21 days ago • 24

upvoted a paper 23 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 26 days ago • 142

upvoted a paper 24 days ago

LongGenBench: Long-context Generation Benchmark

Paper • 2410.04199 • Published Oct 5, 2024 • 20

upvoted a paper about 2 months ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 276

upvoted 3 papers 2 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 263

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 92

Efficiently Serving LLM Reasoning Programs with Certaindex

Paper • 2412.20993 • Published Dec 30, 2024 • 36

upvoted 3 papers 3 months ago

LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context Multitasks

Paper • 2412.15204 • Published Dec 19, 2024 • 33

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published Dec 10, 2024 • 47

o1-Coder: an o1 Replication for Coding

Paper • 2412.00154 • Published Nov 29, 2024 • 44

upvoted 2 papers 6 months ago

Spinning the Golden Thread: Benchmarking Long-Form Generation in Language Models

Paper • 2409.02076 • Published Sep 3, 2024 • 12

No Training, No Problem: Rethinking Classifier-Free Guidance for Diffusion Models

Paper • 2407.02687 • Published Jul 2, 2024 • 24

upvoted a paper over 1 year ago

OtterHD: A High-Resolution Multi-modality Model

Paper • 2311.04219 • Published Nov 7, 2023 • 33