zk67
zk67
AI & ML interests
None yet
Recent Activity
updated
a collection
about 16 hours ago
LLM Pre-Train
updated
a collection
about 21 hours ago
Agent AI
updated
a collection
3 days ago
LLM Reasoning Papers
Organizations
Collections
9
-
A Large Batch Optimizer Reality Check: Traditional, Generic Optimizers Suffice Across Batch Sizes
Paper • 2102.06356 • Published -
Large Batch Optimization for Deep Learning: Training BERT in 76 minutes
Paper • 1904.00962 • Published • 1 -
Decoupled Weight Decay Regularization
Paper • 1711.05101 • Published • 1
models
None public yet
datasets
None public yet