7 13 62

Jaykumaran R

Jaykumaran17

Jaykumaran

AI & ML interests

None yet

Recent Activity

upvoted an article 4 days ago

NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets

liked a model 4 days ago

physical-intelligence/fast

upvoted an article 4 days ago

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

View all activity

Organizations

Jaykumaran17's activity

upvoted 3 articles 4 days ago

Article

NVIDIA's GTC 2025 Announcement for Physical AI Developers: New Open Models and Datasets

21 days ago

• 33

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Feb 4

• 134

Article

SmolVLM - small yet mighty Vision Language Model

Nov 26, 2024

• 230

upvoted a paper 20 days ago

VGGT: Visual Geometry Grounded Transformer

Paper • 2503.11651 • Published 24 days ago • 20

upvoted a paper 21 days ago

Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models

Paper • 2503.09573 • Published 26 days ago • 68

upvoted a collection 4 months ago

Molmo

Collection

Artifacts for open multimodal language models. • 5 items • Updated 25 days ago • 300

upvoted a collection 7 months ago

ViDoRe Benchmark

Collection

Benchmark for document retrieval using visual features, introduced in the ColPali paper. Datasets are using the QA format. • 10 items • Updated Jan 23 • 15

upvoted an article 8 months ago

Article

Vision Language Models Explained

Apr 11, 2024

• 304

upvoted an article 9 months ago

Article

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

May 1, 2024

• 74