Chris Concannon's picture

133 19

Chris Concannon

choncan

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

upvoted a paper 3 days ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

upvoted a paper 15 days ago

Phi-4 Technical Report

View all activity

Organizations

None yet

choncan's activity

upvoted 2 papers 3 days ago

SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines

Paper • 2502.14739 • Published 24 days ago • 97

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published 24 days ago • 179

upvoted 5 papers 15 days ago

Phi-4 Technical Report

Paper • 2412.08905 • Published Dec 12, 2024 • 111

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 352

CodeElo: Benchmarking Competition-level Code Generation of LLMs with Human-comparable Elo Ratings

Paper • 2501.01257 • Published Jan 2 • 50

2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining

Paper • 2501.00958 • Published Jan 1 • 100

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 263

upvoted 12 papers 16 days ago

MiniMax-01: Scaling Foundation Models with Lightning Attention

Paper • 2501.08313 • Published Jan 14 • 276

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 348

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 204

Demystifying Long Chain-of-Thought Reasoning in LLMs

Paper • 2502.03373 • Published Feb 5 • 57

Competitive Programming with Large Reasoning Models

Paper • 2502.06807 • Published Feb 3 • 67

Expect the Unexpected: FailSafe Long Context QA for Finance

Paper • 2502.06329 • Published Feb 10 • 126

Large Language Diffusion Models

Paper • 2502.09992 • Published about 1 month ago • 103

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 28 days ago • 143

SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering?

Paper • 2502.12115 • Published 27 days ago • 43

Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity

Paper • 2502.13063 • Published 26 days ago • 67

SurveyX: Academic Survey Automation via Large Language Models

Paper • 2502.14776 • Published 24 days ago • 93

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published 24 days ago • 162

upvoted a paper about 2 months ago

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published Jan 4 • 92