Daniil Laptev

dlaptev

AI & ML interests

None yet

Recent Activity

upvoted a collection 15 days ago

Reasoning, Thinking, RL and Test-Time Scaling

upvoted a paper about 2 months ago

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

upvoted a paper 2 months ago

You Do Not Fully Utilize Transformer's Representation Capacity

View all activity

Organizations

None yet

dlaptev's activity

upvoted a collection 15 days ago

Reasoning, Thinking, RL and Test-Time Scaling

Collection

128 items • Updated about 8 hours ago • 7

upvoted a paper about 2 months ago

How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM?

Paper • 2502.14502 • Published Feb 20 • 90

upvoted 3 papers 2 months ago

You Do Not Fully Utilize Transformer's Representation Capacity

Paper • 2502.09245 • Published Feb 13 • 37

Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling

Paper • 2502.06703 • Published Feb 10 • 151

LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!

Paper • 2502.07374 • Published Feb 11 • 39

upvoted a collection 2 months ago

RL papers

Collection

12 items • Updated Jan 31 • 2

authored a paper 2 months ago

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Paper • 2502.03032 • Published Feb 5 • 60

upvoted a paper 2 months ago

Analyze Feature Flow to Enhance Interpretation and Steering in Language Models

Paper • 2502.03032 • Published Feb 5 • 60

liked a dataset 3 months ago

AI-MO/NuminaMath-CoT

Viewer • Updated Nov 25, 2024 • 860k • 3.71k • 443

upvoted a paper 3 months ago

The Differences Between Direct Alignment Algorithms are a Blur

Paper • 2502.01237 • Published Feb 3 • 115

liked 4 datasets 4 months ago

upvoted a paper 6 months ago

Mechanistic Permutability: Match Features Across Layers

Paper • 2410.07656 • Published Oct 10, 2024 • 19

upvoted a collection 7 months ago

Saiga datasets

Collection

Datasets used for Saiga fine-tuning • 9 items • Updated Oct 28, 2024 • 8

upvoted a collection 9 months ago

🍃 MINT-1T

Collection

Data for "MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens" • 13 items • Updated Jul 24, 2024 • 59