AlgoDistill's picture

AlgoDistill

AlgoDistill

·

AI & ML interests

jailbreaking

Recent Activity

liked a dataset 8 days ago

SynthLabsAI/PERSONA_subset

updated a Space 8 days ago

AlgoDistill/coming-out-announcement

published a Space 8 days ago

AlgoDistill/coming-out-announcement

View all activity

Organizations

AlgoDistill's activity

upvoted 2 papers about 2 months ago

R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts

Paper • 2502.20395 • Published Feb 27 • 47

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published Feb 26 • 84

upvoted 4 papers 2 months ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published Feb 19 • 182

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Paper • 2502.14502 • Published Feb 20 • 90

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published Feb 20 • 63

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published Feb 20 • 103

upvoted 4 papers 3 months ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 120

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

Paper • 2501.19324 • Published Jan 31 • 39

DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation

Paper • 2501.16764 • Published Jan 28 • 22

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Paper • 2501.17161 • Published Jan 28 • 120