2 32 29

Denis Akhiyarov

dtanow

AI & ML interests

AI Code Generation with LLMs

Recent Activity

upvoted a paper 28 days ago

MPO: Boosting LLM Agents with Meta Plan Optimization

upvoted a paper about 1 month ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

upvoted a paper about 1 month ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

View all activity

Organizations

None yet

dtanow's activity

upvoted a paper 28 days ago

MPO: Boosting LLM Agents with Meta Plan Optimization

Paper • 2503.02682 • Published about 1 month ago • 24

upvoted 2 papers about 1 month ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20 • 188

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 150

upvoted a paper about 2 months ago

CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction

Paper • 2502.07316 • Published Feb 11 • 47

upvoted 2 papers 3 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 272

Offline Reinforcement Learning for LLM Multi-Step Reasoning

Paper • 2412.16145 • Published Dec 20, 2024 • 38

upvoted a paper 4 months ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6, 2024 • 59

liked a model 4 months ago

mistralai/Pixtral-12B-2409

Image-Text-to-Text • Updated Dec 26, 2024 • • 628

upvoted a paper 5 months ago

GitChameleon: Unmasking the Version-Switching Capabilities of Code Generation Models

Paper • 2411.05830 • Published Nov 5, 2024 • 21

liked a model 5 months ago

facebook/incoder-6B

Text Generation • Updated Jan 24, 2023 • 290 • • 79

liked a Space 5 months ago

12.9k

Open LLM Leaderboard

🏆

Track, rank and evaluate open LLMs and chatbots

liked a model 5 months ago

neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w4a16

Text Generation • Updated Feb 12 • 11.5k • 32

liked a dataset 5 months ago

coseal/codal-bench

Viewer • Updated Mar 18, 2024 • 500 • 69 • 6

liked a Space 6 months ago

193

BigCodeBench Leaderboard

🥇

Explore and analyze code evaluation data

liked 2 models 6 months ago

google/gemma-2-2b-it

Text Generation • Updated Aug 27, 2024 • 387k • • 1.04k

mistralai/Mistral-Nemo-Instruct-2407

Text Generation • Updated Nov 6, 2024 • 237k • • 1.5k

liked a dataset 6 months ago

nvidia/OpenMathInstruct-2

Viewer • Updated Nov 25, 2024 • 22M • 6.79k • 164

New activity in nvidia/Llama-3_1-Nemotron-51B-Instruct 6 months ago

fp8 / int8 inference - use bitsandbytes or awq

#8 opened 6 months ago by

dtanow

liked a model 6 months ago

meta-llama/Llama-3.2-11B-Vision-Instruct

Image-Text-to-Text • Updated Dec 4, 2024 • 1.44M • • 1.4k

liked a dataset 6 months ago

THUDM/humaneval-x

Viewer • Updated Oct 25, 2022 • 820 • 1.31k • 84