sshell's picture

15 100

sshell

sshell

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

Z1: Efficient Test-time Scaling with Code

upvoted a paper 13 days ago

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

liked a Space 15 days ago

brandonsmart/splatt3r

View all activity

Organizations

sshell's activity

upvoted a paper 3 days ago

Z1: Efficient Test-time Scaling with Code

Paper • 2504.00810 • Published 4 days ago • 22

upvoted a paper 13 days ago

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published 16 days ago • 46

upvoted 3 papers 21 days ago

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published 26 days ago • 83

4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models

Paper • 2503.10437 • Published 23 days ago • 30

Transformers without Normalization

Paper • 2503.10622 • Published 23 days ago • 151

upvoted 2 papers about 1 month ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20 • 169

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20 • 189

upvoted 4 papers about 2 months ago

ReLearn: Unlearning via Learning for Large Language Models

Paper • 2502.11190 • Published Feb 16 • 29

SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?

Paper • 2502.12115 • Published Feb 17 • 43

Competitive Programming with Large Reasoning Models

Paper • 2502.06807 • Published Feb 3 • 69

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published Feb 13 • 147

upvoted 4 papers 3 months ago

Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models

Paper • 2501.01830 • Published Jan 3 • 18

Infecting Generative AI With Viruses

Paper • 2501.05542 • Published Jan 9 • 13

Multiagent Finetuning: Self Improvement with Diverse Reasoning Chains

Paper • 2501.05707 • Published Jan 10 • 20

Apollo: An Exploration of Video Understanding in Large Multimodal Models

Paper • 2412.10360 • Published Dec 13, 2024 • 146