Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Website
Tasks
HuggingChat
Collections
Languages
Organizations
Community
Blog
Posts
Daily Papers
Learn
Discord
Forum
GitHub
Solutions
Team & Enterprise
Hugging Face PRO
Enterprise Support
Inference Providers
Inference Endpoints
Storage Buckets
Log In
Sign Up
tohio
/
slm-125m
like
0
Text Generation
Safetensors
English
slm
causal-lm
decoder-only
custom-architecture
rope
gqa
swiglu
base
custom_code
License:
mit
Model card
Files
Files and versions
xet
Community
Copy to bucket
new
main
slm-125m
506 MB
Ctrl+K
Ctrl+K
1 contributor
History:
2 commits
tohio
Export slm-125m (125.3M params)
8c81a35
verified
14 days ago
slm_remote
Export slm-125m (125.3M params)
14 days ago
tokenizer
Export slm-125m (125.3M params)
14 days ago
.gitattributes
Safe
1.52 kB
initial commit
14 days ago
README.md
Safe
6.94 kB
Export slm-125m (125.3M params)
14 days ago
attention.py
Safe
13.7 kB
Export slm-125m (125.3M params)
14 days ago
block.py
Safe
3.6 kB
Export slm-125m (125.3M params)
14 days ago
config.json
Safe
762 Bytes
Export slm-125m (125.3M params)
14 days ago
config.py
Safe
8.67 kB
Export slm-125m (125.3M params)
14 days ago
generation_config.json
Safe
215 Bytes
Export slm-125m (125.3M params)
14 days ago
mlp.py
Safe
2.68 kB
Export slm-125m (125.3M params)
14 days ago
model.py
Safe
19.8 kB
Export slm-125m (125.3M params)
14 days ago
model.safetensors
501 MB
xet
Export slm-125m (125.3M params)
14 days ago
norm.py
Safe
1.92 kB
Export slm-125m (125.3M params)
14 days ago