Arkajyoti Mitra's picture

9 1

Arkajyoti Mitra

aeros93

·

AI & ML interests

Deep Learning, Computer Vision, Vision Language Models, Diffusion, Gaussian Splatting

Recent Activity

upvoted a paper about 1 month ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

upvoted a paper 4 months ago

CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models

liked a Space 4 months ago

akhaliq/anychat

View all activity

Organizations

aeros93's activity

upvoted a paper about 1 month ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 203

upvoted 2 papers 4 months ago

CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models

Paper • 2411.18613 • Published Nov 27, 2024 • 52

LLaMA-Mesh: Unifying 3D Mesh Generation with Language Models

Paper • 2411.09595 • Published Nov 14, 2024 • 73

upvoted an article 9 months ago

Article

Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models

Jun 24, 2024

• 189

upvoted 2 articles 10 months ago

Article

PaliGemma – Google's Cutting-Edge Open Vision Language Model

May 14, 2024

• 243

Article

Vision Language Models Explained

Apr 11, 2024

• 286

upvoted a collection 11 months ago

Vision Language Models Papers 🖼️💬📝

Papers about vision-language models, most important ones are on top of the list. • 27 items • Updated Apr 30, 2024 • 36

upvoted an article 11 months ago

Article

seemore: Implement a Vision Language Model from Scratch

By

•

Jun 23, 2024

• 74

upvoted a paper over 1 year ago

LucidDreamer: Domain-free Generation of 3D Gaussian Splatting Scenes

Paper • 2311.13384 • Published Nov 22, 2023 • 52