1 10 18

AlgoDistill

AI & ML interests

jailbreaking

Recent Activity

liked a dataset 4 days ago

RLAIF/math

liked a model 14 days ago

openai/whisper-large-v3-turbo

liked a model 14 days ago

black-forest-labs/FLUX.1-dev

View all activity

Organizations

AlgoDistill's activity

liked a dataset 4 days ago

RLAIF/math

Viewer • Updated 17 days ago • 12.5k • 530 • 1

liked 2 models 14 days ago

openai/whisper-large-v3-turbo

Automatic Speech Recognition • Updated Oct 4, 2024 • 7.81M • • 2.11k

black-forest-labs/FLUX.1-dev

Text-to-Image • Updated Aug 16, 2024 • 2.75M • • 9.34k

liked a dataset 14 days ago

WinkingFace/CryptoLM-Solana-SOL-USDT

Viewer • Updated 1 minute ago • 23.7k • 1.63k • 2

liked 2 models 14 days ago

microsoft/Magma-8B

Image-Text-to-Text • Updated 9 days ago • 12.7k • 331

answerdotai/ModernBERT-base

Fill-Mask • Updated Jan 15 • 2.86M • 793

commented a paper 14 days ago

R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts

Paper • 2502.20395 • Published 15 days ago • 44 •

upvoted 2 papers 14 days ago

R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts

Paper • 2502.20395 • Published 15 days ago • 44

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published 16 days ago • 77

liked a dataset 14 days ago

nlile/24-game

Viewer • Updated 14 days ago • 1.36k • 4.04k • 2

liked 3 datasets 17 days ago

upvoted 4 papers 21 days ago

Qwen2.5-VL Technical Report

Paper • 2502.13923 • Published 23 days ago • 164

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Paper • 2502.14502 • Published 22 days ago • 85

S*: Test Time Scaling for Code Generation

Paper • 2502.14382 • Published 22 days ago • 60

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 22 days ago • 97

upvoted 3 papers about 1 month ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 111

Reward-Guided Speculative Decoding for Efficient LLM Reasoning

Paper • 2501.19324 • Published Jan 31 • 38

DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation

Paper • 2501.16764 • Published Jan 28 • 22