Zach Mustafa's picture

Zach Mustafa PRO

Zmu

·

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

DeepGlint-AI/UniME-LLaVA-1.6-7B

liked a dataset 1 day ago

OpenGVLab/InternVL-Data

liked a model 4 days ago

ostris/Flex.2-preview

View all activity

Organizations

Zmu's activity

upvoted 2 collections 4 days ago

Perception LM

7 items • Updated 10 days ago • 31

Perception Encoder

9 items • Updated 10 days ago • 35

upvoted a paper 4 days ago

Vidi: Large Multimodal Models for Video Understanding and Editing

Paper • 2504.15681 • Published 5 days ago • 14

upvoted 2 collections about 1 month ago

LipSync and Face Operations

18 items • Updated 17 days ago • 48

Excellent SLM & SVLM

Excellent SLM (small language models) and SVLM (small vison language models). • 29 items • Updated 26 days ago • 4

upvoted a paper about 1 month ago

Gemini Embedding: Generalizable Embeddings from Gemini

Paper • 2503.07891 • Published Mar 10 • 37

upvoted an article about 2 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12

• 400

upvoted a paper 2 months ago

VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing

Paper • 2502.17258 • Published Feb 24 • 79

upvoted an article 2 months ago

Article

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

Feb 19

• 69

upvoted a paper 2 months ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 143

upvoted a collection 2 months ago

SmolVLM2 📺 Smallest video LM ever 🤏🏻

11 items • Updated 2 days ago • 82

upvoted an article 2 months ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

Feb 20

• 238

upvoted a paper 2 months ago

LLM Agents Making Agent Tools

Paper • 2502.11705 • Published Feb 17 • 2

upvoted an article 2 months ago

Article

Build awesome datasets for video generation

Feb 12

• 30

upvoted 2 collections 3 months ago

Temporal Preference Optimization

Temporal Preference Optimization for Long-form Video Understanding • 3 items • Updated Jan 19 • 5

VideoChat-Flash

Faster and more powerful VideoChat. • 15 items • Updated 7 days ago • 11

upvoted a paper 3 months ago

LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published Feb 5 • 61

upvoted an article 3 months ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

Jan 23

• 173