Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
chuxin-llm
/
Scaling-Laws-for-Local-SGD-in-LLM-Intermediate-Checkpoints
like
0
Follow
chuxin
20
arxiv:
2409.13198
License:
mit
Model card
Files
Files and versions
Community
1
7b8a65d
Scaling-Laws-for-Local-SGD-in-LLM-Intermediate-Checkpoints
/
lsgd
/
lsgd_0.05b
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
colourful-tree
Upload 40 files
7b8a65d
verified
9 months ago
config.json
Safe
683 Bytes
Upload 40 files
9 months ago
generation_config.json
121 Bytes
Upload 40 files
9 months ago
pytorch_model.bin
Safe
846 MB
LFS
Upload 40 files
9 months ago
tokenizer.json
4.61 MB
Upload 40 files
9 months ago
tokenizer_config.json
Safe
794 Bytes
Upload 40 files
9 months ago