Rishit Dagli's picture

Rishit Dagli

rishitdagli

·

https://rishitdagli.com/

AI & ML interests

vision, learning algorithms

Recent Activity

published a model about 1 hour ago

rishitdagli/nerf-gs-datasets

updated a dataset about 1 hour ago

rishitdagli/nerf-gs-datasets

liked a dataset about 2 hours ago

rishitdagli/nerf-gs-datasets

View all activity

Organizations

rishitdagli's activity

upvoted a paper 1 day ago

Can Vision-Language Models Answer Face to Face Questions in the Real-World?

Paper • 2503.19356 • Published 3 days ago • 1

upvoted a paper 2 days ago

FFN Fusion: Rethinking Sequential Computation in Large Language Models

Paper • 2503.18908 • Published 3 days ago • 16

upvoted a paper 4 months ago

LLaVA-o1: Let Vision Language Models Reason Step-by-Step

Paper • 2411.10440 • Published Nov 15, 2024 • 122

upvoted 3 papers 6 months ago

NVLM: Open Frontier-Class Multimodal LLMs

Paper • 2409.11402 • Published Sep 17, 2024 • 74

Implicit Neural Representations with Fourier Kolmogorov-Arnold Networks

Paper • 2409.09323 • Published Sep 14, 2024 • 5

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published Sep 17, 2024 • 114

upvoted an article 8 months ago

Article

SmolLM - blazingly fast and remarkably powerful

Jul 16, 2024

• 346

upvoted 2 papers 9 months ago

EvTexture: Event-driven Texture Enhancement for Video Super-Resolution

Paper • 2406.13457 • Published Jun 19, 2024 • 17

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Paper • 2405.09818 • Published May 16, 2024 • 131

upvoted a collection 9 months ago

SEE-2-SOUND

5 items • Updated Nov 9, 2024 • 4

upvoted an article 9 months ago

Article

🧨 Diffusers welcomes Stable Diffusion 3

Jun 12, 2024

• 95

upvoted a paper 10 months ago

SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound

Paper • 2406.06612 • Published Jun 6, 2024 • 16