arxiv:2412.02674
hlzhang109 PRO
hlzhang109
AI & ML interests
None yet
Recent Activity
submitted
a paper
about 7 hours ago
Weight Decay Improves Language Model Plasticity
published
a dataset
2 days ago
hlzhang109/proteus-2k
updated
a model
about 1 month ago
hlzhang109/llama-4B-80BT-weightdecay1.0-seed42