aaa

qwertyuiopasdfg

AI & ML interests

None yet

Recent Activity

liked a model about 14 hours ago

latent-consistency/lcm-lora-sdxl

liked a model about 14 hours ago

nerijs/pixel-art-xl

upvoted a collection 1 day ago

DeepSeekCoder-V2

View all activity

Organizations

None yet

qwertyuiopasdfg's activity

upvoted a collection 1 day ago

DeepSeekCoder-V2

Collection

6 items • Updated Sep 5, 2024 • 93

upvoted an article 3 days ago

Article

Visualize and understand GPU memory in PyTorch

Dec 24, 2024

• 211

upvoted a collection 7 days ago

StarVector SVG Datasets (🏆SVG-Bench)

Collection

Datasets for training and evaluating SVG generation models • 11 items • Updated Jan 12 • 10

upvoted a paper 9 days ago

Exploring Data Scaling Trends and Effects in Reinforcement Learning from Human Feedback

Paper • 2503.22230 • Published 12 days ago • 43

upvoted 2 papers 16 days ago

LIMA: Less Is More for Alignment

Paper • 2305.11206 • Published May 18, 2023 • 24

Implicit Bias-Like Patterns in Reasoning Models

Paper • 2503.11572 • Published 26 days ago • 7

upvoted a paper 17 days ago

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published 20 days ago • 46

upvoted an article 20 days ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

29 days ago

• 379

upvoted a paper 23 days ago

Large-Scale Data Selection for Instruction Tuning

Paper • 2503.01807 • Published Mar 3 • 11

upvoted a paper 24 days ago

Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond

Paper • 2503.10460 • Published 27 days ago • 27

upvoted 2 papers about 2 months ago

LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!

Paper • 2502.07374 • Published Feb 11 • 39

LIMO: Less is More for Reasoning

Paper • 2502.03387 • Published Feb 5 • 61

upvoted a paper 3 months ago

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 275