忍者

byteprobe

AI & ML interests

RL | NLP | LLM | multimodal | agent

Recent Activity

upvoted an article about 11 hours ago

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

upvoted an article about 11 hours ago

SigLIP 2: A better multilingual vision language encoder

upvoted an article about 11 hours ago

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

View all activity

Organizations

byteprobe's activity

upvoted 3 articles about 11 hours ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

2 days ago

• 232

Article

SigLIP 2: A better multilingual vision language encoder

21 days ago

• 134

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Feb 4

• 113

upvoted 3 papers about 11 hours ago

Thus Spake Long-Context Large Language Model

Paper • 2502.17129 • Published 18 days ago • 68

Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity

Paper • 2502.13063 • Published 24 days ago • 67

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published 15 days ago • 77

upvoted 2 papers about 12 hours ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published 8 days ago • 87

SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published 22 days ago • 93

upvoted a paper about 23 hours ago

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Paper • 2502.14502 • Published 22 days ago • 85

upvoted 5 papers 1 day ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 22 days ago • 97

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published 29 days ago • 184

Large Language Diffusion Models

Paper • 2502.09992 • Published 28 days ago • 103

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 22 days ago • 129

Unified Reward Model for Multimodal Understanding and Generation

Paper • 2503.05236 • Published 7 days ago • 104

upvoted 5 papers 2 days ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published 22 days ago • 162

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published 11 days ago • 72

upvoted a paper 3 days ago

ZeroBench: An Impossible Visual Benchmark for Contemporary Large Multimodal Models

Paper • 2502.09696 • Published 29 days ago • 39