Shikhar Singh

AxAI

axe--

AI & ML interests

Commonsense & Language Grounding

Recent Activity

upvoted an article 3 days ago

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

upvoted an article 3 days ago

Open-source DeepResearch – Freeing our search agents

liked a dataset 5 days ago

AI-MO/NuminaMath-1.5

View all activity

Organizations

None yet

AxAI's activity

upvoted 2 articles 3 days ago

Article

SmolVLM Grows Smaller – Introducing the 250M & 500M Models!

25 days ago

• 133

Article

Open-source DeepResearch – Freeing our search agents

13 days ago

• 984

upvoted an article 5 days ago

Article

Open R1: Update #2

and 6 others •

6 days ago

• 166

upvoted a paper 23 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 25 days ago • 319

upvoted a paper about 1 month ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 255

upvoted an article about 1 month ago

Article

Introducing AuraFace: Open-Source Face Recognition and Identity Preservation Models

•

Aug 26, 2024

• 43

upvoted 2 papers about 2 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 345

Are Your LLMs Capable of Stable Reasoning?

Paper • 2412.13147 • Published Dec 17, 2024 • 92

upvoted 8 papers 2 months ago

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

Paper • 2402.00159 • Published Jan 31, 2024 • 62

YOLO-World: Real-Time Open-Vocabulary Object Detection

Paper • 2401.17270 • Published Jan 30, 2024 • 36

Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data

Paper • 2401.10891 • Published Jan 19, 2024 • 60

Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively

Paper • 2401.02955 • Published Jan 5, 2024 • 22

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 139

upvoted a collection 2 months ago

InternVL2.5

Collection

Better than InternVL 2.0 • 18 items • Updated Jan 10 • 84

upvoted a paper 2 months ago

PaliGemma 2: A Family of Versatile VLMs for Transfer

Paper • 2412.03555 • Published Dec 4, 2024 • 126

upvoted a collection 3 months ago

PixMo

Collection

A set of vision-language datasets built by Ai2 and used to train the Molmo family of models. Read more at https://molmo.allenai.org/blog • 9 items • Updated 6 days ago • 59

upvoted a paper 3 months ago

Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities

Paper • 2308.12966 • Published Aug 24, 2023 • 8