-
Differential Transformer
Paper • 2410.05258 • Published • 178 -
PaliGemma 2: A Family of Versatile VLMs for Transfer
Paper • 2412.03555 • Published • 135 -
VisionZip: Longer is Better but Not Necessary in Vision Language Models
Paper • 2412.04467 • Published • 111 -
o1-Coder: an o1 Replication for Coding
Paper • 2412.00154 • Published • 45
BuiDoan
BuiDoan
AI & ML interests
None yet
Recent Activity
liked
a model
17 days ago
deepseek-ai/DeepSeek-V3-0324
upvoted
a
paper
25 days ago
Transformers without Normalization
updated
a collection
27 days ago
Great paper
Organizations
Collections
4
models
None public yet
datasets
None public yet