Ligeng Zhu's picture

Ligeng Zhu

Ligeng-Zhu

·

AI & ML interests

None yet

Recent Activity

new activity 7 days ago

Efficient-Large-Model/NVILA-Lite-2B-hf-preview:Do you have any plan to support other NVILA models such as NVILA-15B?

updated a model 7 days ago

Efficient-Large-Model/NVILA-Lite-8B-hf-preview

updated a model 7 days ago

Efficient-Large-Model/NVILA-Lite-2B-hf-preview

View all activity

Organizations

Ligeng-Zhu's activity

upvoted an article 25 days ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

27 days ago

• 376

upvoted a collection 5 months ago

Sana

⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 21 items • Updated 23 days ago • 90

upvoted a paper 6 months ago

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 112

upvoted a paper 8 months ago

Wolf: Captioning Everything with a World Summarization Framework

Paper • 2407.18908 • Published Jul 26, 2024 • 32

upvoted a collection 10 months ago

VILA: On Pre-training for Visual Language Models

10 items • Updated Oct 31, 2024 • 53

upvoted a paper 12 months ago

A Picture is Worth More Than 77 Text Tokens: Evaluating CLIP-Style Models on Dense Captions

Paper • 2312.08578 • Published Dec 14, 2023 • 20

upvoted 3 papers over 1 year ago

VILA: On Pre-training for Visual Language Models

Paper • 2312.07533 • Published Dec 12, 2023 • 23

Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference

Paper • 2310.04378 • Published Oct 6, 2023 • 20

PockEngine: Sparse and Efficient Fine-tuning in a Pocket

Paper • 2310.17752 • Published Oct 26, 2023 • 14