sshell

AI & ML interests

None yet

Recent Activity

upvoted a paper 22 days ago

Z1: Efficient Test-time Scaling with Code

upvoted a paper about 1 month ago

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

liked a Space about 1 month ago

brandonsmart/splatt3r

View all activity

Organizations

sshell's activity

upvoted a paper 22 days ago

Z1: Efficient Test-time Scaling with Code

Paper • 2504.00810 • Published 23 days ago • 26

upvoted a paper about 1 month ago

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published Mar 20 • 47

liked 3 Spaces about 1 month ago

Splatt3R - Zero-shot Gaussian Splatting from Uncalibarated Image Pairs

⛰

Generate 3D scenes from one or two images

1.82k

MagicQuill

🪶

Edit and enhance images with custom color and edge modifications

281

LBM Relighting

✨

Fast image relighting using Latent Bridge Matching

upvoted 3 papers about 1 month ago

LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RL

Paper • 2503.07536 • Published Mar 10 • 85

4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models

Paper • 2503.10437 • Published Mar 13 • 32

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 160

liked a dataset about 1 month ago

bigpictureio/companies-2023-q4-sm

Viewer • Updated Nov 14, 2023 • 17.2M • 92 • 8

liked a Space about 1 month ago

124

smolagents LLM leaderboard

🏆

A leaderboard for LLMs powering smolagents

liked a model about 2 months ago

qihoo360/TinyR1-32B-Preview

Text Generation • Updated 8 days ago • 3.94k • 327

upvoted a paper about 2 months ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20 • 175

upvoted 3 papers 2 months ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20 • 192

ReLearn: Unlearning via Learning for Large Language Models

Paper • 2502.11190 • Published Feb 16 • 29

SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?

Paper • 2502.12115 • Published Feb 17 • 45

liked a model 2 months ago

stepfun-ai/Step-Audio-Chat

Audio-Text-to-Text • Updated Feb 17 • 187 • 438

upvoted 2 papers 2 months ago

Competitive Programming with Large Reasoning Models

Paper • 2502.06807 • Published Feb 3 • 70

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published Feb 13 • 149

liked 2 models 2 months ago

katanemo/Arch-Function-3B

Text Generation • Updated Feb 5 • 428 • 115

microsoft/OmniParser

Image-Text-to-Text • Updated Dec 2, 2024 • 871 • 1.66k