Feng
VandeeeFeng
·
AI & ML interests
None yet
Recent Activity
updated
a collection
14 days ago
models
updated
a collection
about 1 month ago
apps
updated
a collection
about 1 month ago
papers
Organizations
None yet
Collections
3
-
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper • 2402.03300 • Published • 113 -
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Paper • 2501.17161 • Published • 118 -
2.4k
The Ultra-Scale Playbook
🌌The ultimate guide to training LLM on large GPU Clusters
-
192
LLM训练终极指南 | The Ultra-Scale Playbook
🔥了解LLM训练的方方面面
datasets
None public yet